Accept objects for encoded typedarrays in data_array valType #5230

archmoj · 2020-10-22T20:02:10Z

Architecture Details

The approach is to perform the typed array decoding in coerce (before the supplyDefaults logic in the layout and traces). This has the advantage that none of the supply defaults or calc logic needs to be aware of the typed array specification objects.

For performance, the ArrayBuffer instances resulting from the decoding process are cached by trace uid and property name. To keep cache bounded, only one ArrayBuffer is cached per trace per property. Playing with the isosurface example in #5230, building the typed array with this caching approach is about 10x faster than when performing the base64 decoding.

Implementation Details

This PR handles decoding N-dimensional arrays into a nested collection of Arrays. The inner most layer contains TypedArrays that are backed by the original decoded ArrayBuffer, so no copying is performed. So far, all the 2D traces I've played with have worked fine with this nesting arrangement.

More testing is needed, but hopefully nothing in supplyDefaults or calc or rendering will need to change in any of the traces (apart from potential bug fixes for TypedArray handling).

Latest demo

codepen

Previous attempt

The initial commit dc9f4e5 was inspired by @jonmmease's PR #2911.

@plotly/plotly_js

- move decoding step to the start of calc data

archmoj · 2020-11-02T14:03:34Z

@jonmmease are you interested to review & possibly work on this PR?

almarklein · 2020-11-18T10:40:43Z

I'd love for this functionality to land, in particular to communicate images and volumetric data more efficiently. In addition to being more memory efficient, it would also improve the speed of the JSON encoding, as we found during our work on improving the performance of the JSON encoder in #2880.

I'm happy to get my hands dirty to help move this effort forwards. E.g. by implementing the encoding step for the Python JSON encoder. Though that should come after this. In the mean time, I could perhaps help implement support for other traces?

almarklein · 2020-11-18T10:43:44Z

I just did some benchmarking on the encoding of ndarray data in Python. Perhaps this helps create some enthusiasm for this feature :)

I used a test array of 1000x1000 containing random data. I compared encoding the data in the form proposed in this PR against letting the PlotlyJSONEncoder converting it to lists. The timing includes the time of the base64 encoding.

The size is 28% (more than 3x smaller).
The speed is about 10x faster (from ~1 sec to ~0.1s for this data size).

(this is measured with the improved PlotlyJSONEncoder of #2880, before that, the speed difference was even a factor 25).

src/plots/plots.js

jonmmease · 2020-11-18T10:59:24Z

@jonmmease are you interested to review & possibly work on this PR?

Yeah, I'm interested in picking this up again. Probably early December.

@almarklein Thanks for chiming in and for trying out those mini benchmarks!

One not yet settled question, as far as I understand, is what multi-dimensional array buffers are going to be de-serialized into in JavaScript (since TypedArrays are 1D only). @archmoj, I believe we already have a dependnecy on ndarray(https://github.com/scijs/ndarray) through the WebGL stuff. Do you think we could make plotly.js accept these as input for 2D/3D arrays (heatmap, volume, etc.)?

archmoj · 2020-11-18T12:22:37Z

@jonmmease are you interested to review & possibly work on this PR?

Yeah, I'm interested in picking this up again. Probably early December.

@almarklein Thanks for chiming in and for trying out those mini benchmarks!

One not yet settled question, as far as I understand, is what multi-dimensional array buffers are going to be de-serialized into in JavaScript (since TypedArrays are 1D only). @archmoj, I believe we already have a dependnecy on ndarray(https://github.com/scijs/ndarray) through the WebGL stuff. Do you think we could make plotly.js accept these as input for 2D/3D arrays (heatmap, volume, etc.)?

Right now one could pass 2D/3D arrays to volume, surface, etc. using 'shape' as mentioned in the modified attribute description.

src/plots/plots.js

jonmmease · 2020-11-28T14:44:44Z

Unlike #2911, the decoding process is done at calc step instead of supplyDefaults: 636e644

@archmoj Could you explain your motivation here? If I remember correctly, it seemed easier to me to do this in the top-level supplyDefaults so that all of the individual traces wouldn't have to be updated to deal with the base64 object.

Thanks again for resurrecting this!

archmoj · 2020-11-28T15:35:13Z

Unlike #2911, the decoding process is done at calc step instead of supplyDefaults: 636e644

@archmoj Could you explain your motivation here? If I remember correctly, it seemed easier to me to do this in the top-level supplyDefaults so that all of the individual traces wouldn't have to be updated to deal with the base64 object.

Thanks again for resurrecting this!

supplyDefault is called during interactions. So basically we don't want to run slow processes on arrays in there.

jonmmease · 2020-11-28T18:11:30Z

One issue I'm running into with having decoding happen in calc. This does prevent the decoding from happening during interactions, but since supplyDefaults does run during interactions, the typed arrays get overridden by the buffer specification in fullData.

You can see the effect of this in your demo. When you click to change the modebar tool the isosurface disapears. This is because fullData ends up with the buffer specification object instead of the typed array.

Not sure the best way forward here. Is there some way to skip supplyDefaults on an attribute that isn't going to have calc run on it?

Works for initial render, but typed array is overwritten when supplyDefaults is run again.

…yDefaults

This reverts commit 5079fc7

This reverts commit 0f2ef23.

This reverts commit 142e353.

This reverts commit 60697dc.

…arrays

jacksongoode · 2023-10-03T02:07:18Z

Given the speed improvements, this would be great to get in!

…arrays

alexcjohnson

💃 Let's do it! Great work, this is a long time coming!

nicolaskruchten · 2024-01-05T17:02:40Z

Wooot! nice job folks! :)

accept objects for encoded typedarrays in data_array valType

dc9f4e5

- move decoding step to the start of calc data

archmoj added feature something new status: reviewable and removed status: reviewable labels Oct 22, 2020

almarklein mentioned this pull request Nov 6, 2020

Todo plotly/dash-slicer#1

Closed

16 tasks

almarklein reviewed Nov 18, 2020

View reviewed changes

src/plots/plots.js Outdated Show resolved Hide resolved

almarklein reviewed Nov 18, 2020

View reviewed changes

src/plots/plots.js Outdated Show resolved Hide resolved

archmoj added status: reviewable and removed status: in progress labels Nov 25, 2020

archmoj requested a review from alexcjohnson November 25, 2020 19:01

Revert trace changes

67ccac9

jonmmease added 3 commits November 28, 2020 13:23

WIP of decoding in calc.

712a124

Works for initial render, but typed array is overwritten when supplyDefaults is run again.

Back to decoding in coerce

4d5b382

Cache base64 decoded ArrayBuffers and keep decoding in coerce / suppl…

7233f34

…yDefaults

jonmmease mentioned this pull request Nov 28, 2020

WIP: Typed array encoding in supplyDefaults with Caching #5308

Merged

jonmmease added 2 commits November 28, 2020 15:09

bvals -> buffer

5079fc7

remove big for consistency

38e10bb

archmoj added type: duplicate and removed status: reviewable labels Nov 28, 2020

jonmmease added 2 commits November 28, 2020 16:05

Revert "bvals -> buffer"

1043827

This reverts commit 5079fc7

Run isArray1D on decoded array not spec

8d84f3e

archmoj added 14 commits March 13, 2023 12:11

Revert "handle typed arrays in parcats"

5dfe2c9

This reverts commit 0f2ef23.

Revert "handle typed arrays in scattergeo"

9767454

This reverts commit 142e353.

Revert "handle typed arrays in opacityscale"

dfa65db

This reverts commit 60697dc.

improve packing integers in b64 image test

43d0780

handle typed arrays in transform filter value

b93033f

improve b64 test coverage include 2 item arrays too

6e3f987

handle typed arrays in parcats categoryarray

4bab8de

handle typed arrays in scattergeo locations

9b0be90

test b64 on arrays with only 1 item as well

509f7df

add b64 test dashboard - add script to convert mocks to b64

25e8678

place b64 mocks in the same folder so that devtools.js finds them

e8d82f4

lint env_image script

e9e221e

Merge remote-tracking branch 'origin/master' into handle-coded-typed-…

c7ec94f

…arrays

Merge remote-tracking branch 'origin/master' into handle-coded-typed-…

cef2785

…arrays

archmoj added 2 commits October 16, 2023 14:00

Merge remote-tracking branch 'origin/master' into handle-coded-typed-…

02557e5

…arrays

Merge remote-tracking branch 'origin/master' into handle-coded-typed-…

3314377

…arrays

mjainGH added the cycle-12 label Nov 9, 2023

Merge remote-tracking branch 'origin/master' into handle-coded-typed-…

cae76aa

…arrays

archmoj mentioned this pull request Dec 21, 2023

Use plotly.js base64 API to store and pass typed arrays declared by numpy, pandas, etc. plotly/plotly.py#4470

Merged

mjainGH added the Cycle-13 label Jan 4, 2024

alexcjohnson approved these changes Jan 5, 2024

View reviewed changes

archmoj merged commit 7536e78 into master Jan 5, 2024

archmoj deleted the handle-coded-typed-arrays branch January 5, 2024 16:15

archmoj mentioned this pull request May 8, 2024

Add support for numeric font weight #6990

Merged

6 tasks

FlorentinTh mentioned this pull request May 24, 2024

[Snyk] Upgrade plotly.js-cartesian-dist-min from 2.12.1 to 2.32.0 FlorentinTh/LE2ML-GUI#762

Open

FlorentinTh mentioned this pull request Jul 17, 2024

[Snyk] Upgrade plotly.js-cartesian-dist-min from 2.12.1 to 2.33.0 FlorentinTh/LE2ML-GUI#768

Open

GregBrimble mentioned this pull request Aug 4, 2024

[Snyk] Upgrade plotly.js from 1.54.1 to 2.33.0 GregBrimble/workers.sh#581

Open

FlorentinTh mentioned this pull request Aug 9, 2024

[Snyk] Upgrade plotly.js-cartesian-dist-min from 2.12.1 to 2.34.0 FlorentinTh/LE2ML-GUI#773

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accept objects for encoded typedarrays in data_array valType #5230

Accept objects for encoded typedarrays in data_array valType #5230

archmoj commented Oct 22, 2020 •

edited

Loading

archmoj commented Nov 2, 2020

almarklein commented Nov 18, 2020

almarklein commented Nov 18, 2020

jonmmease commented Nov 18, 2020

archmoj commented Nov 18, 2020

jonmmease commented Nov 28, 2020

archmoj commented Nov 28, 2020

jonmmease commented Nov 28, 2020 •

edited

Loading

jacksongoode commented Oct 3, 2023

alexcjohnson left a comment

nicolaskruchten commented Jan 5, 2024

Accept objects for encoded typedarrays in data_array valType #5230

Accept objects for encoded typedarrays in data_array valType #5230

Conversation

archmoj commented Oct 22, 2020 • edited Loading

Architecture Details

Implementation Details

Latest demo

Previous attempt

archmoj commented Nov 2, 2020

almarklein commented Nov 18, 2020

almarklein commented Nov 18, 2020

jonmmease commented Nov 18, 2020

archmoj commented Nov 18, 2020

jonmmease commented Nov 28, 2020

archmoj commented Nov 28, 2020

jonmmease commented Nov 28, 2020 • edited Loading

jacksongoode commented Oct 3, 2023

alexcjohnson left a comment

Choose a reason for hiding this comment

nicolaskruchten commented Jan 5, 2024

archmoj commented Oct 22, 2020 •

edited

Loading

jonmmease commented Nov 28, 2020 •

edited

Loading