Skip to content

Concatenate character tokens #98

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 18, 2017
Merged

Concatenate character tokens #98

merged 1 commit into from
Jul 18, 2017

Conversation

RReverser
Copy link
Contributor

@RReverser RReverser commented Jul 17, 2017

Looks like these few places were missed when ParseError token type was removed.

This PR fixes #96 them to restore the state promised in the README:

All adjacent character tokens are coalesced into a single ["Character", data] token.

@inikulin
Copy link
Member

I believe there should be more places where we need to do that, see #96

Looks like these few places were missed when ParseError token type was removed.

This PR fixes them to restore the state promised in the README:

> All adjacent character tokens are coalesced into a single ["Character", data] token.
@RReverser
Copy link
Contributor Author

Updated with other found places.

@inikulin inikulin merged commit 0d35eb6 into html5lib:master Jul 18, 2017
@inikulin
Copy link
Member

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Question about the logic of merged "Character" tokens after PR #92 in tokenizer tests
2 participants