Update documentLoader to return parsed document #85

gkellogg · 2019-05-02T20:35:29Z

The documentation for documentLoader documentation is a bit lax, but it seems to provide for retrieving processed documents.

In the API, the documentLoader option is of type DocumentLoaderCallback, which is a Promise<USVString> (USVString url);. Adding an optional options parameter can allow to provide more information, such as for use in processing contexts, or when extractAllScripts is specified. But, it indicates that the promise returns a USVString.

We also define a RemoteDocument type, which is really what the documentLoader promise should result in. RemoteDocument has a document attribute of any, and is stated to either return the raw payload or parsed document. It would be useful for us to limit this to the parsed document, which would allow us to encapsulate more of the HTML processing within this definition (see w3c/json-ld-syntax#167 (comment)). But, this might be considered breaking the 1.0 API contract.

The Context Processing Algorithm doesn't explicitly describe using the documentLoader promise to load remote contexts, although it is implicit in the description of the documentLoader. This is somewhat complicated by the attempt to keep all WebIDL information, such as promises, out of the algorithms themselves, which remains a challenging aspect.

With suitable wording, I propose that we move all of the HTML-specific processing rules into the definition of RemoteDocument and suitably parameterize calls to documentLoader (including within the Context Processing Algorithm) to handle the various cases, and result in the internal JSON representation. We can handle the 1.0 contract, which allows an external information to simply return the raw document, by wrapping the callback in code which detects this, and provides it's own processing of the results.

The text was updated successfully, but these errors were encountered:

gkellogg · 2019-06-20T20:39:23Z

PR #87 reviewed by @dlongley. Closing.

gkellogg added spec:enhancement needs discussion labels May 2, 2019

gkellogg mentioned this issue May 3, 2019

Rewrite LoadDocumentCallback #87

Merged

gkellogg added propose closing and removed needs discussion labels May 6, 2019

gkellogg closed this as completed Jun 20, 2019

azaroth42 added the satisfied label Nov 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update documentLoader to return parsed document #85

Update documentLoader to return parsed document #85

gkellogg commented May 2, 2019 •

edited by davidlehn

Loading

gkellogg commented Jun 20, 2019

Uh oh!

Update documentLoader to return parsed document #85

Update documentLoader to return parsed document #85

Comments

gkellogg commented May 2, 2019 • edited by davidlehn Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

gkellogg commented Jun 20, 2019

Uh oh!

gkellogg commented May 2, 2019 •

edited by davidlehn

Loading