Search: return relatives URLS #7376

stsewd · 2020-08-10T19:27:31Z

ericholscher

This looks good except for the hacky logic to get the path & domain :)

ericholscher · 2020-08-13T20:36:01Z

readthedocs/search/serializers.py

    highlights = PageHighlightSerializer(source='meta.highlight', default=dict)
    blocks = serializers.SerializerMethodField()

-    def get_link(self, obj):
+    def get_domain(self, obj):


This definitely seems like it's adding a bunch of queries on both of these functions to do the full resolve and then ignore parts of it (eg. we don't care about subprojects for the domain). Is there a reason not to just call the resolver resolve_path and resolve_domain directly here?

The result from _get_full_path is cached, and we already pass project_data into the context, so this won't generate any extra queries.

readthedocs.org/readthedocs/search/api.py

Lines 379 to 382 in 7274123

def get_serializer_context(self):

context = super().get_serializer_context()

context['projects_data'] = self._get_all_projects_data()

return context

Calling resolve_path and resolve_domain here will generate extra queries.

It seems like projects_data is only used in this one place, so I don't understand why we're caching it prior to calling this code? Seems like we could just remove all the pre-setting and only set it here when we actually use it?

The serializer only knows about one object, not about all of them. But the caller of this class has the list of all objects that the serializer is going to use, so it can retrieve all the data in one query.

Sure, but you're doing queries for every Domain for every subproject with this approach, instead of querying the doctype for every Version, which will lead to the same number of queries?

Ok, I see what you mean, yeah, that can be optimized to query the domain only once.

I don't think we need to worry too much about making this super efficient -- I'm actually saying we should make the code simpler rather than try and make it super fast. We don't do that many searches, so having simple code is probably better. I guess it might matter for projects with a lot of subprojects.

I also think we need to think a bit more deeply about how to make this stuff faster at the resolver level, rather than trying to optimize specific areas. We've done this a few times, and really the solution should be "calling the resolver is always fast"

ericholscher

This seems fine for now, though a little complicated :)

Search: return relatives URLS

1f3cd13

Closes #7311

ericholscher reviewed Aug 13, 2020

View reviewed changes

ericholscher approved these changes Aug 13, 2020

View reviewed changes

stsewd merged commit 393c7ed into master Aug 13, 2020

stsewd deleted the search-relative-urls branch August 13, 2020 23:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Search: return relatives URLS #7376

Search: return relatives URLS #7376

stsewd commented Aug 10, 2020

ericholscher left a comment

ericholscher Aug 13, 2020

stsewd Aug 13, 2020

stsewd Aug 13, 2020

stsewd Aug 13, 2020

ericholscher Aug 13, 2020

stsewd Aug 13, 2020

ericholscher Aug 13, 2020

stsewd Aug 13, 2020

ericholscher Aug 13, 2020

ericholscher Aug 13, 2020

ericholscher left a comment

	def get_serializer_context(self):
	context = super().get_serializer_context()
	context['projects_data'] = self._get_all_projects_data()
	return context

Search: return relatives URLS #7376

Search: return relatives URLS #7376

Conversation

stsewd commented Aug 10, 2020

ericholscher left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ericholscher left a comment

Choose a reason for hiding this comment