summaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authorAndrew Dolgov <[email protected]>2024-04-03 17:38:00 +0000
committerAndrew Dolgov <[email protected]>2024-04-03 17:38:00 +0000
commit37c0c2ce76aa90e8adafbd5cb0b0332a54df1523 (patch)
treee6b44c6309edadf6e933caf81321740a0ace79a1 /README.md
parent8ac5abdd497b37d2be4833bcf18d6819bba4d9c9 (diff)
parent29c099cb9c1a9e2f6871068513fc321828da31d2 (diff)
Merge branch 'feature/ignore-more-elements' into 'master'HEADmaster
Add config support for ignoring additional elements. See merge request main/libraries/readability-php!1
Diffstat (limited to 'README.md')
-rw-r--r--README.md1
1 files changed, 1 insertions, 0 deletions
diff --git a/README.md b/README.md
index c910714..547178c 100644
--- a/README.md
+++ b/README.md
@@ -117,6 +117,7 @@ Then you pass this Configuration object to Readability. The following options ar
- **KeepClasses**: default value `false`, which removes all `class="..."` attribute values from HTML elements.
- **Parser**: default value `html5`, which uses HTML5-PHP for parsing. Set to `libxml` to use that instead (not recommended for modern HTML documents).
- **SummonCthulhu**: default value `false`, remove all `<script>` nodes via regex. This is not ideal as it might break things, but if you've set the parser to libxml (see above), it might be the only solution to [libxml problems with unescaped javascript](https://github.com/fivefilters/readability.php#known-libxml-parsing-issues).
+- **ExtraIgnoredElements**: default value `[]`, additional DOM elements that should be ignored. `noscript` and `script` are always ignored.
### Debug log