gRPC: add more unit tests for `Stream` and `Datastore` #1935

var-const · 2018-10-12T02:06:52Z

No description provided.

…nit-tests-domain

var-const · 2018-10-12T02:12:06Z

Firestore/core/test/firebase/firestore/remote/datastore_test.mm

+  Shutdown();
+  datastore.reset();
+
+  EXPECT_NO_THROW(credentials.InvokeGetToken());


This test currently fails, I'll fix before merging.

Hmm, this actually looks like a non-trivial issue.

The root of the problem is that there's an implicit dependency between grpc::CompletionQueue and grpc::ByteBuffer's lifetimes. The smallest repro is just:

{ grpc::Slice slice{"foo"}; grpc::ByteBuffer b{&slice, 1}; // Buffer must be non-empty grpc::CompletionQueue cq; // Assuming it's the only gRPC-related object around } // Once the scope ends, assertion will be triggered, because cq was destroyed before b

Details:

In gRPC, C core is initialized once and shut down once. All C++ classes that need the core to be initialized inherit from GrpcLibraryCodegen. GrpcLibraryCodegen essentially makes C core reference-counted; each constructor increments, and each destructor decrements, the number of references to C core, and once the last reference is destroyed, the C core is shut down.

In this case, destroying Datastore destroys grpc::CompletionQueue, which happens to be the last reference to C core, so the line 154 (datastore.reset()) leads to global shutdown. The global shutdown, among other things, shuts down ExecCtx.

When EmptyCredentialsProvider::GetToken is called, the TokenListener (a std::function) is passed by value, so at the end of the call the destructor of TokenListener is called, which leads to the destruction of a lambda created by Datastore that contains a grpc::ByteBuffer. When a grpc::ByteBuffer is destroyed, it creates an ExecCtx, which fails because global shutdown has already been called on ExecCtx, leading to an assertion failure and a crash.

(Note that the fact that GetToken takes its argument by value isn't really an issue here; if the argument were taken by reference, the problem would surface when the credentials provider is destroyed. The root of the problem seems that the ByteBuffer-containing lambda may outlive gRPC core).

@wilhuff Re. the above:

As far as gRPC is concerned, do you feel it might be a bug? (and hence, worth reporting)

I presume we care about this case (Auth outlives Firestore) and don't want a crash there -- let me know if I misunderstand.

Submitted an issue to gRPC repo: grpc/grpc#16875

Re (1): this could be a bug, but realistically I think it's one we'll have to work around. Possibly this means that we should not be using ByteBuffers for anything except directly sending into/out of gRPC calls such that the construction order you're describing never happens. However, it's also possible I'm misunderstanding, because it seems like we really shouldn't get into a state where we've destroyed the completion queue before the last byte buffer we might have submitted into it.

Re (2): I don't think the issue is that we care so much about auth outliving firestore as we want to handle races where Firestore may be asked to shutdown while an auth request is pending. We should not crash in this circumstance.

In our public API shutdown is asynchronous, so we could work around this by performing teardown in two passes: a first pass to quiesce the system, inhibiting new requests and waiting for any outstanding ones and then tearing things down.

Alternatively, for any request that might outlive the system add some way to disconnect it such that when it calls back it doesn't attempt any action on the already destroyed system.

var-const · 2018-10-12T02:14:03Z

Firestore/core/test/firebase/firestore/remote/datastore_test.mm

+}
+
+/*
+TEST_F(DatastoreTest, AuthWhenDatastoreHasBeenShutDown) {


This will currently fail; I left it mainly for discussion. I don't know if it can be an issue -- it would depend on how likely it is for Datastore to be shut down but not destroyed, so that Auth has a chance to invoke its callback in-between.

Well, I don't know how likely that is, but I suppose you could have Datastore record the fact that it's shutdown, and then check that in the callbacks that auth invokes? That way, if we ever end up in that situation, we can either (a) abort, or (b) do something intelligent, rather than just blinding proceeding as if the Datastore is still active.

zxu123

Can you do a sync? Some changes I've saw in your other merged PR and do not belong to this PR.

var-const · 2018-10-12T17:08:59Z

Can you do a sync? Some changes I've saw in your other merged PR and do not belong to this PR.

@zxu123 Hmm, looks correct to me. Can you point me to the duplicate changes? (perhaps you mean very similar tests between grpc_stream_test.h and stream_test.h?)

rsgowman · 2018-10-12T19:36:31Z

Firestore/core/test/firebase/firestore/remote/datastore_test.mm

+}
+
+/*
+TEST_F(DatastoreTest, AuthWhenDatastoreHasBeenShutDown) {


Well, I don't know how likely that is, but I suppose you could have Datastore record the fact that it's shutdown, and then check that in the callbacks that auth invokes? That way, if we ever end up in that situation, we can either (a) abort, or (b) do something intelligent, rather than just blinding proceeding as if the Datastore is still active.

…eBuffer` right until the call is started (#1949) (see [here](#1935 (comment)) and [here](grpc/grpc#16875) for context) The problem with serializing a domain object immediately is that the resulting `ByteBuffer` is stored in a `std::function` within Auth. `ByteBuffer`s become invalid once gRPC core shuts down, so if Auth happens to outlive Firestore, once the `ByteBuffer`'s destructor is invoked, the app will crash.

var-const added 15 commits October 9, 2018 18:33

lint

ce6949c

small fixes

f6f3f84

Add grpc_connection_test

282009c

Missed test cases

635c473

Review feedback

59f070c

Forgotten files

3139cb4

linter

7170b70

Remove unnecessary split

a0f061f

quick fix

5bfc05b

Initial

1d38fb1

Delete some of the more excessive tests

dd3d10c

Fix bug in WriteAndFinish

23070bb

Merge branch 'varconst/grpc-unit-tests-wrappers' into varconst/grpc-u…

9dca0f9

…nit-tests-domain

compiles

928953a

Merge branch 'master' into varconst/grpc-unit-tests-domain

29e726c

var-const added the api: firestore label Oct 12, 2018

var-const assigned rsgowman and zxu123 Oct 12, 2018

var-const requested review from rsgowman and zxu123 October 12, 2018 02:06

googlebot added the cla: yes label Oct 12, 2018

linter

40b6260

var-const commented Oct 12, 2018

View reviewed changes

zxu123 reviewed Oct 12, 2018

View reviewed changes

Merge branch 'master' into varconst/grpc-unit-tests-domain

7a86c73

zxu123 approved these changes Oct 12, 2018

View reviewed changes

var-const unassigned zxu123 Oct 12, 2018

rsgowman approved these changes Oct 12, 2018

View reviewed changes

rsgowman assigned var-const and unassigned rsgowman Oct 12, 2018

var-const added 3 commits October 12, 2018 18:12

Simple test fixes

4c57926

Disable failing test, reenable fixed test

7687464

style

5c00ca6

var-const merged commit d8bb9b3 into master Oct 14, 2018

var-const mentioned this pull request Oct 15, 2018

gRPC: in Datastore, delay serializing domain object to a grpc::ByteBuffer right until the call is started #1949

Merged

paulb777 deleted the varconst/grpc-unit-tests-domain branch May 26, 2019 20:48

firebase locked and limited conversation to collaborators Oct 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gRPC: add more unit tests for `Stream` and `Datastore` #1935

gRPC: add more unit tests for `Stream` and `Datastore` #1935

Uh oh!

var-const commented Oct 12, 2018

Uh oh!

var-const Oct 12, 2018

Uh oh!

var-const Oct 12, 2018

Uh oh!

var-const Oct 12, 2018

Uh oh!

var-const Oct 14, 2018

Uh oh!

wilhuff Oct 15, 2018 •

edited

Loading

Uh oh!

var-const Oct 12, 2018

Uh oh!

rsgowman Oct 12, 2018

Uh oh!

zxu123 left a comment

Uh oh!

var-const commented Oct 12, 2018

Uh oh!

rsgowman Oct 12, 2018

Uh oh!

Uh oh!

gRPC: add more unit tests for Stream and Datastore #1935

gRPC: add more unit tests for Stream and Datastore #1935

Uh oh!

Conversation

var-const commented Oct 12, 2018

Uh oh!

var-const Oct 12, 2018

Choose a reason for hiding this comment

Uh oh!

var-const Oct 12, 2018

Choose a reason for hiding this comment

Uh oh!

var-const Oct 12, 2018

Choose a reason for hiding this comment

Uh oh!

var-const Oct 14, 2018

Choose a reason for hiding this comment

Uh oh!

wilhuff Oct 15, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

var-const Oct 12, 2018

Choose a reason for hiding this comment

Uh oh!

rsgowman Oct 12, 2018

Choose a reason for hiding this comment

Uh oh!

zxu123 left a comment

Choose a reason for hiding this comment

Uh oh!

var-const commented Oct 12, 2018

Uh oh!

rsgowman Oct 12, 2018

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gRPC: add more unit tests for `Stream` and `Datastore` #1935

gRPC: add more unit tests for `Stream` and `Datastore` #1935

wilhuff Oct 15, 2018 •

edited

Loading