index
:
html5-php.git
master
html5-php fork with PHP 8.1 fixes
Linux User
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
HTML5
/
Parser
Age
Commit message (
Collapse
)
Author
2013-04-23
Added an instruction processor for PIs.
Technosophos
2013-04-19
Stubs for tree builder and tests.
Technosophos
2013-04-19
Finished minor refactoring of tokenizer.
Technosophos
2013-04-19
Added consume() to scanner, refactoring Tokenizer.
Technosophos
2013-04-19
Full support for rawtext. Unit tests finished.
Technosophos
2013-04-19
Added support for raw text.
Technosophos
2013-04-18
Fixed attribute tokenizing for 8.1.2.3.
Technosophos
2013-04-18
Updating Parser docs.
Technosophos
2013-04-18
Tokenizer now handles sophisticated tags.
Technosophos
2013-04-18
Well-formed attribute values are working.
Technosophos
2013-04-17
Fixed bug in whitespace consumer.
Technosophos
2013-04-17
Added uppercase tests.
Technosophos
2013-04-17
Fixed broken tag test.
Technosophos
2013-04-16
Working on simple tags.
Technosophos
2013-04-16
Added support for processing instructions.
Technosophos
2013-04-16
Added a FileInputStream for anything that can be grabbed by file_get_contents.
Matt Farina
2013-04-15
Unit tests for DOCTYPE are all passing.
Technosophos
2013-04-15
First shot at DOCTYPE parsing and testing.
Technosophos
2013-04-15
Updated event handler interface.
Technosophos
2013-04-15
UNFINISHED: DOCTYPE parser is in progress.
Matt Butcher
2013-04-13
Fixed CDATA termination.
Matt Butcher
2013-04-12
CDATA handling is complete. DOCTYPE is begun.
Matt Butcher
2013-04-12
DOCTYPE bogus comments handled.
Technosophos
2013-04-12
BogusCOmments. How cool.
Technosophos
2013-04-11
Working on comments.
Technosophos
2013-04-11
endTag is done.
Technosophos
2013-04-11
Working on closing tag and bogus comments.
Technosophos
2013-04-11
Addressed UTF-8 encoding issues.
Technosophos
Neither iconv nor mb seem to be able to convert UTF-8 surrogates into UTF-8. As I understand it, this is an extreme edge case. Still, the behavior in both cases is that the surrogates are stripped from the string. We test for that condition, now.
2013-04-11
Moved UTF-8 character check out to UTF8Utils.
Technosophos
2013-04-11
Working on tag parsing.
Matt Butcher
2013-04-11
Started StringInputStream tests.
Matt Farina
2013-04-10
Streamlining recursion.
Technosophos
2013-04-10
Updated documentation in EventHandler.
Technosophos
2013-04-10
Instead of throwing parse errors, we now send as events.
Technosophos
2013-04-10
Finishing tests on entities.
Technosophos
2013-04-10
Namespace fixed for ParseError.
Technosophos
2013-04-10
Working on entity resolution.
Technosophos
2013-04-10
Finished CharacterReference class.
Technosophos
2013-04-10
Added pass through commands on the scanner to the column, row, and ↵
Matt Farina
characters remaining methonds.
2013-04-10
Added bounds checking to the scanner current method.
Matt Farina
2013-04-10
Moved the scanner to the new Parser InputStream and updated the unit tests ↵
Matt Farina
to use StringInputStream
2013-04-10
Added main parsing loop.
Technosophos
2013-04-10
Merge branch 'master' of github.com:technosophos/HTML5-PHP
Technosophos
2013-04-10
Added CharacterReference utility class.
Technosophos
2013-04-10
Moved the scanner to the new string input parser. The current test will fail ↵
Matt Farina
until the parser is updated to handle positioning correctly.
2013-04-10
Added tests for the scanner getHex method.
Matt Farina
2013-04-10
Use UTF8Utils.
Matt Butcher
2013-04-10
Merge branch 'master' of github.com:technosophos/HTML5-PHP
Matt Butcher
2013-04-10
Refactoring InputStream.
Matt Butcher
2013-04-10
Added more documentation and tests to the Scanner.
Matt Farina
[prev]
[next]