summaryrefslogtreecommitdiff
path: root/src/Readability.php
AgeCommit message (Collapse)Author
2024-04-03Add config support for ignoring additional elements.wn_
2021-12-28FixesFiveFilters.org
Reported here: https://forum.fivefilters.org/t/readability-php-fatal-error/1283
2021-08-31Adhere to standard code styleFiveFilters.org
2021-08-25Separating title-like and low-scoring headersFiveFilters.org
https://github.com/mozilla/readability/commit/2e620c232ec5b189207da5b7123470e8c8872f54
2021-08-25Fix param orderFiveFilters.org
2021-08-25Fix headers being removedFiveFilters.org
https://github.com/mozilla/readability/commit/11093f011f57fa528a00a78d70b8d9df32c7a32d
2021-08-25Skip removing nodes in <code>FiveFilters.org
https://github.com/mozilla/readability/commit/290724c68c93359139a4663d18460f6d26e192e6
2021-08-25Consider #hash links in link densityFiveFilters.org
https://github.com/mozilla/readability/commit/3c833899866ffb1f9130767110197fd6f5c08d4c (not all these changes work as expected, so one section has been commented out for now)
2021-08-25Code readability fixesFiveFilters.org
https://github.com/mozilla/readability/commit/fc78270a308e815429113517a61428e4e56076a8
2021-08-25Renamed nextElement to nextNodeFiveFilters.org
https://github.com/mozilla/readability/commit/60af91f15387fcd5e60fe05212bf6fd1a2d9d36d
2021-08-25exclude additional elements based on their roleFiveFilters.org
https://github.com/mozilla/readability/commit/d5eea06a0095b3138dbd1f6233f656d690200509
2021-08-25Update Readability.phpFiveFilters.org
2021-08-25Include more ancestors in candidate scoringFiveFilters.org
https://github.com/mozilla/readability/commit/3844d8f05b3f114e3df16c3bc3caf44e5ba52181
2021-08-22Prefer JSON-LD metadata object, when presentFiveFilters.org
https://github.com/mozilla/readability/commit/2ca98284e92aecd610bac3008e5e6ff1c0fdad37 (getJSONLD() method based on current Readability.js implementation)
2021-08-21Fix lazy-loaded images are not visible in Kinja sitesFiveFilters.org
https://github.com/mozilla/readability/commit/52ab9b5c8916c306a47b2119270dcdabebf9d203 (some changes based on current Readability.js code)
2021-08-20remove nodes with role=complementaryFiveFilters.org
https://github.com/mozilla/readability/commit/d5621f85e775229332bf0f6f2b1d3d789c638f2d
2021-08-20Add method to unwrap img inside noscriptFiveFilters.org
https://github.com/mozilla/readability/commit/d784bf7e20e25ec1b3a6102a20c83d35fe3ef87d (but code based on current version of Readability.js)
2021-08-19Use _getAllNodesWithTag() moreFiveFilters.org
https://github.com/mozilla/readability/commit/dc34dfd8fa6d5c17801efbc2e115dc368b7117c8 (haven't tested to see if the issue that affected Readability.js affects Readability.php in the same way)
2021-08-19preserve children when removing javascript: linksFiveFilters.org
https://github.com/mozilla/readability/commit/7c91bdd2753adabf801beb5063db6693c54e4e33
2021-08-19Added "keepClasses" option to prevent cleaning of classesFiveFilters.org
https://github.com/mozilla/readability/commit/2982216913af2c66b0690e88606b03116553ad92
2021-08-19Remove sharing elements in fewer situationsFiveFilters.org
https://github.com/mozilla/readability/commit/9092b2a29c6ad3ce7f9a9e1e4e3ee74a56abd304
2021-08-18Add support for detecting lazy-loaded imagesFiveFilters.org
https://github.com/mozilla/readability/commit/814f0a3884350b6f1adfdebb79ca3599e9806605 (but porting the _fixLazyImages() function as it stands now, not as it's shown in this commit)
2021-08-17Fix table header missingFiveFilters.org
https://github.com/mozilla/readability/commit/9009f64f9ce8b7d593c1ef90864843f72e193cba
2021-08-17Fix embedded videos getting removedFiveFilters.org
https://github.com/mozilla/readability/commit/6761a7e4124cab16384c0501733edd4c7bf32e6b
2021-08-16use default threshold for share elementsFiveFilters.org
https://github.com/mozilla/readability/commit/681bf0c47b235577fb40848a6092d0d07f7d95dc
2021-08-16only remove sharing elements if they contain <500 charactersFiveFilters.org
https://github.com/mozilla/readability/commit/e76aba3485d6788b07b1781661f6e283d6563276
2021-08-16Fix tabular data got removedFiveFilters.org
https://github.com/mozilla/readability/commit/6a5066abe27b47e1e6d9972fdfaa8aefe24cc516 (test files still to be updated)
2021-08-16Fix whitespace normalization in title metadataFiveFilters.org
https://github.com/mozilla/readability/commit/e8bb7f722febdc856abf4ac73846afd4f5f7b83d
2021-08-14Update namespaceFiveFilters.org
2021-08-11Prevent HTML5PHP from inserting doctype when serialising HTMLFiveFilters.org
2021-08-10HTML5 parser, more whitespace aware, backport more Readability.js codeFiveFilters.org
2021-07-28Apply changes from pull requests and forksFiveFilters.org
2019-07-15Fix wrong access to first element childAndres Rey
2019-06-17Style CI ChangesAndres Rey
2019-06-17Make DOMNodeList more DOMNodeList...yAndres Rey
2019-06-17Remove getChildren function, implement a fake DOMNodeList class. Prepare ↵Andres Rey
everything to support newer versions of libxml2
2019-05-26Add safecheck for getFirstElementChild and replace another instance of ↵Andres Rey
firstChild
2019-05-26Add getFirstElementChild functionAndres Rey
2019-03-24Merge remote-tracking branch 'origin/development' into developmentAndres Rey
# Conflicts: # test/test-pages/engadget/expected-metadata.json # test/test-pages/nytimes-1/expected-metadata.json # test/test-pages/nytimes-2/expected-metadata.json
2018-12-23Add getSiteName feature based on meta tag og:site_nameMaria Luiza Soares
2018-12-01Add "image" to the list of keys to search forAndres Rey
2018-11-29Update regex property extractor to avoid matching og:image tags multiple ↵Andres Rey
times and overwriting it's value (like og:image:width overwriting og:image)
2018-11-25Merge remote-tracking branch 'origin/development' into developmentAndres Rey
2018-11-25Search for the main image while looking for metadataAndres Rey
2018-11-22Apply fixes from StyleCIAndres Rey
2018-11-19Trim title before saving it + update test expectationsAndres Rey
2018-11-19Implement shiftingAwareGetElementsByTagName to avoid node shifting errorsAndres Rey
2018-11-18Remove unused variableAndres Rey
2018-11-18Import node attributes by default when setting a new tagAndres Rey
2018-11-18Avoid looping over negative numbersAndres Rey