Age | Commit message (Collapse) | Author | |
---|---|---|---|
2018-09-01 | Rename wordThreshold to charThreshold and throw deprecation notices | Andres Rey | |
2018-05-05 | Issue #63: Avoid diving by zero + test case | Andres Rey | |
2018-04-26 | Remove $parseSuccessful flag | Andres Rey | |
2018-04-10 | Remove extra check for DOMDocument nodes + add comment | Andres Rey | |
2018-04-10 | Merge pull request #58 from PedroAmorim/noticeparentOfTopCandidate2 | Andres Rey | |
Fix notice non-object on $parentOfTopCandidate for tumblr.com | |||
2018-04-09 | Fix notice non-object on $parentOfTopCandidate for tumblr.com | Pedro Amorim | |
PHP notice on DOMElement $parentOfTopCandidate. Trying to get property of non-object in serc/Readability.php line 1000 Trying to get property of non-object in serc/Readability.php line 1009 Reproduced with this url: https://clipartx.tumblr.com/post/172752750628/orange-swirl-burnt-orange-orange Config: $config = new Configuration; $config->setWordThreshold(5) ->setSummonCthulhu(true) ->setFixRelativeURLs(true) ->setOriginalURL($url); | |||
2018-03-21 | Clean <aside> tags on prepArticle | Andres Rey | |
2018-03-19 | Apply StyleCI diff | Andres Rey | |
2018-03-18 | Merge remote-tracking branch 'origin/development' into development | Andres Rey | |
# Conflicts: # CHANGELOG.md # composer.json | |||
2018-03-18 | Check for base urls before generating paths for the URL resolver | Andres Rey | |
2018-03-18 | Merge branch 'master' into update-to-8525c6a | Andres Rey | |
2018-03-18 | Use all the article text to determine how many characters were extracted. | Andres Rey | |
2018-03-15 | Override setLogger function to be able to return configuration object | Andres Rey | |
2018-03-14 | Use instanceof DOMdocument | Pedro Amorim | |
2018-03-14 | Fix error C14N | Pedro Amorim | |
I have the error: "Call to a member function C14N() on null" You could reproduce like this: - try to parse a url like http://www.dailymotion.com/video/x6ga6qi that doesn't return any content - this throw an exception and the logs show "[Parsing] Could not parse text, giving up :(" - now, call ->getContent() with the same object readability Previously, getContent would return "null" but now it call ->C14N() on a NULL object. | |||
2018-03-12 | removed class doc-block + method-name-builder switched to sprintf | topot | |
2018-03-10 | Add log messages | Andres Rey | |
2018-03-10 | Save attempts across different runs and try to return at least something ↵ | Andres Rey | |
before giving up. | |||
2018-03-10 | Clean link tags | Andres Rey | |
2018-03-10 | Failsafe for weird titles | Andres Rey | |
2018-03-10 | Add _cleanClasses function | Andres Rey | |
2018-03-10 | Add missing DOMEntity class | Andres Rey | |
2018-03-10 | StyleCI diff applied | topot | |
2018-03-09 | Added: Configuration parameters array constructor injection | topot | |
2018-03-06 | Rename getContentObject to getDOMDocument | Andres Rey | |
2018-03-06 | Save the full DOMDocument when finish processing + pull images of the ↵ | Andres Rey | |
article from the processed object, no the original one | |||
2018-03-06 | Add data-src as a image path source | Andres Rey | |
2018-01-27 | Make sure that we do not allow the DOMDocument reach the parsing algorithm ↵ | Andres Rey | |
(Because we use/abuse the parentNode call, and a DOMDocument does not have a parent) | |||
2018-01-11 | Merge remote-tracking branch 'origin/logging' into logging | Andres Rey | |
2018-01-11 | Merge branch 'master' into logging | Andres Rey | |
# Conflicts: # CHANGELOG.md | |||
2018-01-11 | Apply fixes from StyleCI | Andres Rey | |
2018-01-11 | Merge pull request #38 from PedroAmorim/domEntityReference | Andres Rey | |
Add missing DOM classes | |||
2018-01-11 | Add missing DOM class DOMEntityReference. | Pedro Amorim | |
Fix error: Uncaught Error: Call to undefined method DOMEntityReference::getAttribute() in vendor/andreskrey/readability.php/src/Readability.php:528 | |||
2018-01-11 | Remove the data-readability references | Andres Rey | |
2017-12-22 | Check for node type when scanning for better topCandidates | Andres Rey | |
2017-12-10 | Remove logger declaration inside the Configuration | Andres Rey | |
2017-12-10 | Improve logging messages | Andres Rey | |
2017-12-10 | Adding comments everywhere | Andres Rey | |
2017-12-10 | Check for minimum html before parsing metadata | Andres Rey | |
2017-12-10 | Switch to a logger aware trait in the Configuration object | Andres Rey | |
2017-12-10 | Initial approach to logger injection | Andres Rey | |
2017-12-09 | Remove modal as a negative property | Andres Rey | |
2017-12-05 | Search for 'data-orig' in image urls | Andres Rey | |
2017-12-03 | Add function to extract img srcs from other tags that might be used on lazy ↵ | Andres Rey | |
loading or other type of post load processing. | |||
2017-12-02 | Apply fixes from StyleCI | Andres Rey | |
2017-12-02 | Add small template on __toString magic method | Andres Rey | |
2017-12-02 | Search for excerpt in case it's not found on HTML metadata | Andres Rey | |
2017-12-01 | Clean up | Andres Rey | |
2017-12-01 | Clean up | Andres Rey | |
2017-12-01 | Move load function below parse function | Andres Rey | |