summaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authorwn_ <[email protected]>2024-04-03 17:23:41 +0000
committerwn_ <[email protected]>2024-04-03 17:25:30 +0000
commit29c099cb9c1a9e2f6871068513fc321828da31d2 (patch)
treee6b44c6309edadf6e933caf81321740a0ace79a1 /README.md
parent8ac5abdd497b37d2be4833bcf18d6819bba4d9c9 (diff)
Add config support for ignoring additional elements.
Diffstat (limited to 'README.md')
-rw-r--r--README.md1
1 files changed, 1 insertions, 0 deletions
diff --git a/README.md b/README.md
index c910714..547178c 100644
--- a/README.md
+++ b/README.md
@@ -117,6 +117,7 @@ Then you pass this Configuration object to Readability. The following options ar
- **KeepClasses**: default value `false`, which removes all `class="..."` attribute values from HTML elements.
- **Parser**: default value `html5`, which uses HTML5-PHP for parsing. Set to `libxml` to use that instead (not recommended for modern HTML documents).
- **SummonCthulhu**: default value `false`, remove all `<script>` nodes via regex. This is not ideal as it might break things, but if you've set the parser to libxml (see above), it might be the only solution to [libxml problems with unescaped javascript](https://github.com/fivefilters/readability.php#known-libxml-parsing-issues).
+- **ExtraIgnoredElements**: default value `[]`, additional DOM elements that should be ignored. `noscript` and `script` are always ignored.
### Debug log