summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorAndres Rey <[email protected]>2017-11-12 20:14:53 +0000
committerAndres Rey <[email protected]>2017-11-12 20:14:53 +0000
commitfeefb34263bb5811a182cc772ba0d2224ff19da3 (patch)
tree90efc3b04f1b72191db24b1ba894beb6be072061
parent13f5628ae15e1fb43e457a8fd5161ad70ac7ffc1 (diff)
Update Changelog
-rw-r--r--CHANGELOG.md17
1 files changed, 14 insertions, 3 deletions
diff --git a/CHANGELOG.md b/CHANGELOG.md
index 40e5b20..aace2ab 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -4,10 +4,21 @@ All notable changes to this project will be documented in this file.
## Unreleased
- Merged PR #24. Fixes notice when trying to extract `og:image`
-- Up to date to commit [eb221c5](https://github.com/mozilla/readability/commit/c3ff1a2d2c94c1db257b2c9aa88a4b8fbeb221c5) (2017-10-16), which included the following changes:
- - test
+- Up to date to commit [eb221c5](https://github.com/mozilla/readability/commit/c3ff1a2d2c94c1db257b2c9aa88a4b8fbeb221c5) (2017-10-16), which includes the following changes:
+ - New tags added to the unlikelyCandidates regex
+ - Detection and removal of hierarchical separators in titles
+ - Added more tags to clean after parsing the article (`button`, `textarea`, `select`, etc.)
+ - New way to detect empty nodes (including a edge case where a node with a `&nsbp;` was detected as a node with content)
+ - Better approach to find a top candidate (specially when a top candidate is the only child of a parent node, which allows a more accurate joining of sibling elements)
+ - Detect text direction (`ltr` or `rtl`)
+ - Detect and mark data tables to avoid removing them during final clean up
+ - Major fixes when scanning and deleting nodes (no need to traverse backwards anymore)
+ - Node cleaning via regex matches
+ - Clean table attributes during final clean up.
- Added license
+Next release after this one will be v1 and will be a major refactor around Readability and HTMLParser methods and responsibilities.
+
## [v0.2.2](https://github.com/andreskrey/readability.php/releases/tag/v0.2.2)
- Added a safecheck for really nasty HTML
@@ -21,7 +32,7 @@ All notable changes to this project will be documented in this file.
## [v0.2.0](https://github.com/andreskrey/readability.php/releases/tag/v0.2.0)
-We ARE a 100% complete port of Readability.js!
+100% complete port of Readability.js!
- Every test unit passes
- Readability.php produces the same exact output as Readability.js
- I'm happy :)