Age | Commit message (Collapse) | Author | |
---|---|---|---|
2019-03-10 | drop hhvm support | Asmir Mustafic | |
2018-12-29 | Allow to pass a charset to the Scanner | Titouan Galopin | |
2018-11-27 | Optimize the processing of text between nodes | Christophe Coevoet | |
Instead of processing the text token one by one in the main loop, it is now processed in batch until the next special token (< and & which have special handling in the main loop and NUL characters which need to report a parse error). | |||
2018-11-27 | Merge pull request #161 from stof/optimize_main_loop | Asmir Mustafic | |
Optimize main loop | |||
2018-11-26 | Optimize the main loop | Christophe Coevoet | |
2018-11-26 | Merge pull request #155 from stof/optimize_attributes | Asmir Mustafic | |
Optimize the parsing of unquoted attributes | |||
2018-11-26 | Merge pull request #154 from stof/optimize_token_comparison | Asmir Mustafic | |
Optimize the token comparison | |||
2018-11-26 | Remove useless condition for the parsing of cdata | Christophe Coevoet | |
The caller already ensures that the current token is the right one. | |||
2018-11-26 | Simplify the doctype matching | Christophe Coevoet | |
- the doctype() function is only called for a D or d token, so there is no need to check again inside the method - checking that we have the DOCTYPE string can use a sequence matching | |||
2018-11-26 | Optimize the handling of the EOF detection in the main loop | Christophe Coevoet | |
The eof() method is a no-op when the token is not false. As the main loop already needs to identify that case anyway, skipping the method call allows to reduce the cost of parsing text tokens. | |||
2018-11-26 | Optimize the parsing of unquoted attributes | Christophe Coevoet | |
2018-11-26 | Optimize the token comparison | Christophe Coevoet | |
Tokens are always a single char. Using strspn to find whether they belong to a fixed list is slower than comparing them directly. | |||
2018-11-26 | Replace next calls with consume calls when the return value is ignored | Christophe Coevoet | |
2018-11-25 | Normalize PHPDoc comments | Titouan Galopin | |
2018-11-25 | Fix coding style | Titouan Galopin | |
2018-11-24 | Merge pull request #153 from stof/optimize_whitespace | Asmir Mustafic | |
Optimize consuming whitespaces | |||
2018-11-24 | Merge pull request #156 from stof/fix_typo | Asmir Mustafic | |
Fix typos in comments | |||
2018-11-24 | Fix typos in comments | Christophe Coevoet | |
2018-11-24 | Optimize the check for the element types during adding. | Christophe Coevoet | |
2018-11-24 | Optimize consuming whitespaces | Christophe Coevoet | |
Places consuming whitespaces don't care about the matched substring. They either need its length, or nothing. Returning only the length directly avoids computing the substring. | |||
2018-11-24 | Merge pull request #150 from Masterminds/backport | Asmir Mustafic | |
Backport few changes from old master branch | |||
2018-11-24 | Merge pull request #152 from stof/fix_typo | Asmir Mustafic | |
Fix typo in an error message | |||
2018-11-24 | Fix typo in an error message | Christophe Coevoet | |
2018-11-24 | Optimize the handling of references when consuming data | Christophe Coevoet | |
2018-11-23 | The element mask is 0 rather than false if element not fount. | Matt Farina | |
2018-11-23 | Setting variable for local namespaces to protected in writer. | Matt Farina | |
The namespaces for mathml, html, and svg are set to protected so they cannot be altered. These are part of the spec. | |||
2018-11-08 | move sequenceMatches to the Scanner | Asmir Mustafic | |
2018-11-08 | improve consume speed | Asmir Mustafic | |
2018-11-08 | Merge pull request #147 from tgalopin/tokenizer-perfs | Asmir Mustafic | |
Improve the Tokenizer performance | |||
2018-11-06 | Remove another current call | Titouan Galopin | |
2018-11-05 | Inline tag open in Tokenizer to further improve performances | Titouan Galopin | |
2018-11-05 | Improve Tokenizer performance by inlining text parsing and removing some ↵ | Titouan Galopin | |
Scanner::current calls | |||
2018-11-03 | Improve performance by relying on a native string instead of InputStream | Titouan Galopin | |
2018-11-02 | Add more extensions on composer.json, improve phpdocs and remove dead code | Titouan Galopin | |
2018-10-08 | audio is not a block tag | Asmir Mustafic | |
see https://developer.mozilla.org/en-US/docs/Web/HTML/Element/audio | |||
2017-12-04 | #136: Respect self-closing tags only on foreign elements | Albert Peschar | |
2017-09-01 | Merge pull request #134 from Masterminds/ampersand-in-urls | Asmir Mustafic | |
Raw & in attributes | |||
2017-08-31 | reduce number of times "current" is invoked | Asmir Mustafic | |
2017-08-28 | Fixes https://github.com/Masterminds/html5-php/issues/124 | Asmir Mustafic | |
Reference: https://www.w3.org/TR/html52/syntax.html#character-reference-state If the character reference was consumed as part of an attribute (return state is either attribute value (double-quoted) state, attribute value (single-quoted) state or attribute value (unquoted) state), and the last character matched is not a U+003B SEMICOLON character (;), and the next input character is either a U+003D EQUALS SIGN character (=) or an alphanumeric ASCII character, then, for historical reasons, switch to the character reference end state. If the last character matched is not a U+003B SEMICOLON character (;), this is a parse error. | |||
2017-08-25 | handle special "img" "image" handling | Asmir Mustafic | |
2017-07-26 | Fix https://github.com/Masterminds/html5-php/issues/131 | Asmir Mustafic | |
2016-09-22 | don't close optgroup when followed by an option | Valentin Kunz | |
Make sure options are children of optgroup, not the parent select. Otherwise we end up with a bunch of empty optgroups. | |||
2016-09-22 | Merge pull request #116 from Masterminds/case-sensitive-names | Asmir Mustafic | |
In XML mode, tags are case sensitive | |||
2016-08-19 | Alternative implementation of https://github.com/Masterminds/html5-php/pull/115 | Asmir Mustafic | |
2016-08-17 | In XML mode, tags are case sensitive | Asmir Mustafic | |
Fixes #114 | |||
2016-08-17 | When outputting a TEXT_RAW node, make sure each child is an instance of ↵ | Danny Smart | |
DOMCharacterData | |||
2016-05-09 | Correct address to reflect proper specfiication. | William Hearn | |
https://github.com/Masterminds/html5-php/issues/109 | |||
2016-03-28 | Merge pull request #99 from Masterminds/fix/98 | Asmir Mustafic | |
Fixes #98: a DOM is allowed within noscript tags | |||
2016-03-19 | Use mb_substitute_character instead of ini_set | Der Mundschenk & Compagnie | |
2016-03-18 | Fixes #98: a DOM is allowed within noscript tags | Matt Farina | |
When JavaScript is disabled noscript tags become transparent to to the DOM. For more details see: https://w3c.github.io/html/semantics-scripting.html#the-noscript-element While this notes limitations in the values in the head there are different sets elsewhere such as in: https://w3c.github.io/html/syntax.html#the-in-head-noscript-insertion-mode Since this is not a validating parser this handling it to make it transparent. |