Search: Use serializers to parse search results #7157

stsewd · 2020-06-04T04:37:13Z

This is towards #5821 and #6341 and #5966 and having a stable/understandable API for search.

I'm not using these serializers in the API yet, this is because I want to handle the update in the JS and in the server in another PR, making sure it doesn't break for users with a cached js file.

Had to update some tests on the api bc those depend on some data used for the tests for views, but after the serializers are used in the api that change can be reverted.

stsewd · 2020-06-04T17:28:44Z

readthedocs/search/views.py

-                for result in results:
-                    inner_hits = result.meta.inner_hits
-                    sections = inner_hits.sections or []
-                    domains = inner_hits.domains or []
-                    all_results = itertools.chain(sections, domains)
-
-                    sorted_results = utils._get_sorted_results(
-                        results=all_results,
-                        source_key='source',
-                    )
-
-                    result.meta.inner_hits = sorted_results


this logic was moved to the serializer itself.

stsewd · 2020-06-04T17:29:46Z

readthedocs/search/serializers.py

+    inner_hits = serializers.SerializerMethodField()
+
+    def get_link(self, obj):
+        # TODO: optimize this to not query the db for each result.


We are already doing one query per result, so this isn't something I'm introducing in this PR. We can fix this when merging the serializer from the api.

It's doing more than 1 query if we're calling resolve. We probably want to add this to the index, instead of computing it on render -- is that the plan?

We probably want to add this to the index

My plan was to so something similar to subprojects, fetch the project once and resolve the main domain for that project and then pass it down here.

stsewd · 2020-06-04T17:30:23Z

readthedocs/search/tests/test_api.py

+OLD_TYPES = {
+    'domain': 'domains',
+    'section': 'sections',
+}
+OLD_FIELDS = {
+    'docstring': 'docstrings',
+}
+


This can be removed once we start using the same serializer in the api

readthedocs/search/serializers.py

stsewd · 2020-06-04T17:34:53Z

readthedocs/search/serializers.py

+    def to_representation(self, instance):
+        return {
+            'name': getattr(instance, 'domains.name', []),
+            'docstring': getattr(instance, 'domains.docstrings', []),


this is named docstrings, but we only store one docstring per domain. (note here it defaults to a list bc this is the result of the highlight)

readthedocs/search/serializers.py

ericholscher

This looks like a great start.

A good way to enable this on the API might be adding a new API endpoint that returns these results, and we can keep serving the old endpoint for a release after the deploy, to allow the JS to switch over.

readthedocs/search/views.py

readthedocs/search/serializers.py

ericholscher · 2020-06-15T22:36:44Z

readthedocs/search/serializers.py

+    path = serializers.CharField(source='full_path')
+    link = serializers.SerializerMethodField()
+    highlight = PageHighlightSerializer(source='meta.highlight', default=dict)
+    inner_hits = serializers.SerializerMethodField()


I wonder if this should just be called hits? Or something even more explicit? search_results or something?

What about sections or blocks? This should be something that includes the results from domains and sections (and maybe images/codeblocks in the future?)

Sure, but they are still results, no? I guess I don't see what value inner has here, and seems confusing?

Yeah, they are results (and the object itself is a result, so that was kind of my idea of dropping the results suffix). And, yeah inner hits is more an ES thing than something users should care about.

ericholscher · 2020-06-15T22:36:59Z

readthedocs/search/serializers.py

+    inner_hits = serializers.SerializerMethodField()
+
+    def get_link(self, obj):
+        # TODO: optimize this to not query the db for each result.


It's doing more than 1 query if we're calling resolve. We probably want to add this to the index, instead of computing it on render -- is that the plan?

readthedocs/search/serializers.py

ericholscher

This makes sense to me. 👍

I do think we likely want some more documentation on the fields and what they mean, but that can probably come with the public API.

stsewd added 3 commits June 3, 2020 23:25

Use serializers to parse search results

a035ba2

Update templates

c57df3f

Update tests

3c0d2a7

stsewd added the PR: work in progress Pull request is not ready for full review label Jun 4, 2020

stsewd added 4 commits June 4, 2020 10:35

Merge branch 'master' into api-for-search

078adf6

Update

1cc6d1a

Typo

388ce8c

update docstring

b2bed6d

stsewd commented Jun 4, 2020

View reviewed changes

stsewd removed the PR: work in progress Pull request is not ready for full review label Jun 4, 2020

Linter

db8e203

stsewd requested a review from a team June 4, 2020 17:38

Merge branch 'master' into api-for-search

1197370

ericholscher reviewed Jun 15, 2020

View reviewed changes

stsewd added 3 commits June 22, 2020 11:00

Merge branch 'master' into api-for-search

7ac6194

Feedback from review

445bfc2

Add todo

5ec5e36

stsewd requested a review from ericholscher June 22, 2020 19:19

ericholscher approved these changes Jun 22, 2020

View reviewed changes

stsewd merged commit 9846324 into master Jun 23, 2020

stsewd deleted the api-for-search branch June 23, 2020 19:25

Uh oh!

Search: Use serializers to parse search results #7157

Search: Use serializers to parse search results #7157

Uh oh!

Conversation

stsewd commented Jun 4, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stsewd Jun 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ericholscher left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ericholscher left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stsewd Jun 15, 2020 •

edited

Loading