Firestore: Optimize local cache sync when resuming a query that had docs deleted #4982

milaGGL · 2023-05-05T13:33:12Z

Ported from firebase/firebase-js-sdk#7229

Implement an optimization in Firestore when resuming a query where documents have either been deleted or no longer match the query on the server (a.k.a. "removed"). The optimization avoids re-running the entire query just to figure out which documents were deleted/removed in most cases.

Background Information

When a Firestore query is sent to the server, the server replies with the documents in the result set and a "resume token". The result set and the resume token are stored locally on the client. If the same query is resumed at a later time, such as by a later call to Query.get() or a listener registered via Query.addSnapshotListener() reconnects, then the client sends the same query to the server, but this time includes the resume token. To save on network bandwidth, the server only replies with the documents that have changed since the timestamp encoded in the resume token. Additionally, if the query is resumed within 30 minutes, and persistence is enabled, then the customer is only billed for the delta, and not the entire result set (see https://firebase.google.com/docs/firestore/pricing#listens for the official and most up-to-date details on pricing).

The problem is that if some documents in the result set were deleted or removed (i.e. changed to no longer match the query) then the server simply does not observe their presence in the result set and does not send updates for them. This leaves the client's cache in an inconsistent state because it still contains the deleted/removed documents. To work around this cache inconsistency, the server also replies with an "existence filter", a count of the documents that matched the query on the server. The client then compares this count with the number of documents that match the query in its local cache. If those counts are the same then all is good and the result set is raised via a snapshot; however, if the counts do not match then this is called an "existence filter mismatch" and the client re-runs the entire query from scratch, without a resume token, to figure out which documents in its local cache were deleted or removed. Then, the deleted or removed documents go into "limbo" and individual document reads are issued for each of the limbo documents to bring them into sync with the server.

The inefficiency is realized when the client "re-runs the entire query from scratch". This is inefficient for 2 reasons: (1) it re-transmits documents that were just sent when the query was resumed, wasting network bandwidth and (2) it results in being billed for document reads of the entire result set.

The Optimization

To avoid this expensive re-running of the query from scratch the server has been modified to also reply with the names of the documents that had not changed since the timestamp encoded in the resume token. With this additional information, the client can determine which documents in its local cache were deleted or removed, and directly put them into "limbo" without having to re-run the entire query from scratch.

The document names sent from the server are encoded in a data structure called a "bloom filter". A bloom filter is a size-efficient way to encode a "set" of strings. The size efficiency comes at the cost of correctness; that is, when testing for membership in a bloom filter it may incorrectly report that a value is contained in the bloom filter when in fact it is not (a.k.a. a "false positive"). The probability of this happening is made to be exceptionally low by tweaking the parameters of the bloom filter. However, when a false positive does happen then the client is forced to fall back to a full requery. But eliminating the vast majority of the full requeries is an overall win.

Googlers see go/firestore-ttl-deletion-protocol-changes for full details.

…requery (#4768)

…omFilter

…e bloom filter support has now been deployed to production. (#4871)

github-actions · 2023-05-05T13:33:27Z

📝 PRs merging into main branch

Our main branch should always be in a releasable state. If you are working on a larger change, or if you don't want this change to see the light of the day just yet, consider using a feature branch first, and only merge into the main branch when the code complete and ready to be released.

google-oss-bot · 2023-05-05T13:40:26Z

Coverage Report ¹

Affected Products

`firebase-firestore`

Overall coverage changed from 44.22% (41890a0) to 44.32% (4a18354) by +0.10%.

13 individual files with coverage change

Filename	Base (`41890a0`)	Merge (`4a18354`)	Diff
AutoValue_TestingHooks_ExistenceFilterBloomFilterInfo.java	?	20.00%	?
BitSequence.java	?	43.48%	?
BitSequenceOrBuilder.java	?	0.00%	?
BloomFilter.java	?	87.72%	?
BloomFilterOrBuilder.java	?	0.00%	?
BloomFilterProto.java	?	0.00%	?
ExistenceFilter.java	80.00%	90.00%	+10.00%
LruGarbageCollector.java	97.27%	93.64%	-3.64%
RemoteSerializer.java	79.18%	79.45%	+0.26%
RemoteStore.java	88.49%	88.80%	+0.31%
TargetData.java	77.50%	77.78%	+0.28%
TestingHooks.java	45.00%	64.52%	+19.52%
WatchChangeAggregator.java	98.26%	98.60%	+0.35%

Test Logs

https://storage.googleapis.com/firebase-sdk-metric-reports/K0OSqdmyes.html

github-actions · 2023-05-05T13:42:17Z

Unit Test Results

  162 files +    88   162 suites +88 1m 57s ⏱️ - 3m 10s
1 158 tests +  918 1 142 ✔️ +  903 16 💤 +16 0 ❌ - 1
2 316 runs +1 948 2 284 ✔️ +1 917 32 💤 +32 0 ❌ - 1

Results for commit d9a59e2. ± Comparison against base commit 41890a0.

This pull request removes 240 and adds 1158 tests. Note that renamed tests count towards both.

com.google.firebase.appcheck.FirebaseAppCheckRegistrarTest ‑ testGetComponents
com.google.firebase.appcheck.FirebaseAppCheckTest ‑ testGetInstance_defaultFirebaseAppName_matchesDefaultGetter
com.google.firebase.appcheck.FirebaseAppCheckTest ‑ testGetInstance_otherFirebaseAppName_doesNotMatch
com.google.firebase.appcheck.debug.DebugAppCheckProviderFactoryTest ‑ testGetInstance_callTwice_sameInstance
com.google.firebase.appcheck.debug.FirebaseAppCheckDebugRegistrarTest ‑ testGetComponents
com.google.firebase.appcheck.debug.internal.DebugAppCheckProviderTest ‑ exchangeDebugToken_onFailure_setsTaskException
com.google.firebase.appcheck.debug.internal.DebugAppCheckProviderTest ‑ exchangeDebugToken_onSuccess_setsTaskResult
com.google.firebase.appcheck.debug.internal.DebugAppCheckProviderTest ‑ testDetermineDebugSecret_noStoredSecret_createsNewSecret
com.google.firebase.appcheck.debug.internal.DebugAppCheckProviderTest ‑ testDetermineDebugSecret_storedSecret_usesExistingSecret
com.google.firebase.appcheck.debug.internal.DebugAppCheckProviderTest ‑ testPublicConstructor_nullFirebaseApp_expectThrows
…

com.google.firebase.TimestampTest ‑ testCompare
com.google.firebase.TimestampTest ‑ testFromDate
com.google.firebase.TimestampTest ‑ testRejectBadDates
com.google.firebase.TimestampTest ‑ testTimestampParcelable
com.google.firebase.firestore.AggregateQuerySnapshotTest ‑ createWithCountShouldReturnInstanceWithTheGivenQueryAndCount
com.google.firebase.firestore.AggregateQueryTest ‑ testSourceMustNotBeNull
com.google.firebase.firestore.BlobTest ‑ testComparison
com.google.firebase.firestore.BlobTest ‑ testEquals
com.google.firebase.firestore.BlobTest ‑ testMutableBytes
com.google.firebase.firestore.CollectionReferenceTest ‑ testEquals
…

♻️ This comment has been updated with latest results.

google-oss-bot · 2023-05-05T13:46:22Z

Size Report ¹

Affected Products

firebase-firestore
Type Base (41890a0) Merge (4a18354) Diff
aar 1.34 MB 1.36 MB +21.6 kB (+1.6%)
apk (aggressive) 518 kB 520 kB +1.95 kB (+0.4%)
apk (release) 3.94 MB 3.95 MB +8.75 kB (+0.2%)

Test Logs

https://storage.googleapis.com/firebase-sdk-metric-reports/mov1Uu0hVN.html

google-oss-bot · 2023-05-05T14:08:34Z

Startup Time Report ¹

Note: Layout is sometimes suboptimal due to limited formatting support on GitHub. Please check this report on GCS.

Notes

This report is for comparing the base commit (41890a0) and the CI merge commit (4a18354)
Please check below reports for each individual commit to find more details (Perfetto traces, histograms, detailed measurements)
- 41890a0: https://storage.googleapis.com/firebase-sdk-metric-reports/2IEpyZqRz2/index.html
- 4a18354: https://storage.googleapis.com/firebase-sdk-metric-reports/ZLMDOfOnqn/index.html

Startup Times

`fire-fst`

Device Statistics Distributions

oriole-32

Percentile	`41890a0`	`4a18354`	Diff	Significant (?)
p10	325 ±32 μs	351 ±80 μs	+26.4 μs (+8.1%)	NO
p25	338 ±47 μs	369 ±96 μs	+30.6 μs (+9.1%)	NO
p50	355 ±63 μs	398 ±114 μs	+43.3 μs (+12.2%)	NO
p75	381 ±73 μs	484 ±209 μs	+103 μs (+27.0%)	NO
p90	423 ±98 μs	636 ±432 μs	+213 μs (+50.4%)	NO

20 test runs in comparison

Commit	Test Runs
`41890a0`	2023-05-05_04:15:40.081227_YyuI 2023-05-05_04:15:40.083518_PZSZ 2023-05-05_04:15:40.083530_wNXk 2023-05-05_04:15:40.083536_tnaZ 2023-05-05_04:15:40.083542_wmss 2023-05-05_04:15:40.083547_UyBC 2023-05-05_04:15:40.083553_hXuX 2023-05-05_04:15:40.083558_eaTI 2023-05-05_04:15:40.083563_NUxE 2023-05-05_04:15:40.083569_BRjb
`4a18354`	2023-05-08_17:36:57.916586_bADU 2023-05-08_17:36:57.920282_nsGU 2023-05-08_17:36:57.920297_ZxcM 2023-05-08_17:36:57.920305_cdUZ 2023-05-08_17:36:57.920310_xQmU 2023-05-08_17:36:57.920316_Tnou 2023-05-08_17:36:57.920321_qVtH 2023-05-08_17:36:57.920326_apzT 2023-05-08_17:36:57.920331_uanA 2023-05-08_17:36:57.920336_cNVp

redfin-30

Percentile	`41890a0`	`4a18354`	Diff	Significant (?)
p10	642 ±31 μs	637 ±28 μs	-4.65 μs (-0.7%)	NO
p25	661 ±34 μs	655 ±30 μs	-5.58 μs (-0.8%)	NO
p50	688 ±39 μs	680 ±39 μs	-7.80 μs (-1.1%)	NO
p75	726 ±48 μs	712 ±53 μs	-14.1 μs (-1.9%)	NO
p90	768 ±60 μs	779 ±106 μs	+10.7 μs (+1.4%)	NO

20 test runs in comparison

Commit	Test Runs
`41890a0`	2023-05-05_04:15:40.081227_YyuI 2023-05-05_04:15:40.083518_PZSZ 2023-05-05_04:15:40.083530_wNXk 2023-05-05_04:15:40.083536_tnaZ 2023-05-05_04:15:40.083542_wmss 2023-05-05_04:15:40.083547_UyBC 2023-05-05_04:15:40.083553_hXuX 2023-05-05_04:15:40.083558_eaTI 2023-05-05_04:15:40.083563_NUxE 2023-05-05_04:15:40.083569_BRjb
`4a18354`	2023-05-08_17:36:57.916586_bADU 2023-05-08_17:36:57.920282_nsGU 2023-05-08_17:36:57.920297_ZxcM 2023-05-08_17:36:57.920305_cdUZ 2023-05-08_17:36:57.920310_xQmU 2023-05-08_17:36:57.920316_Tnou 2023-05-08_17:36:57.920321_qVtH 2023-05-08_17:36:57.920326_apzT 2023-05-08_17:36:57.920331_uanA 2023-05-08_17:36:57.920336_cNVp

`timeToInitialDisplay`

Device Statistics Distributions

oriole-32

Percentile	`41890a0`	`4a18354`	Diff	Significant (?)
p10	193 ±10 ms	207 ±25 ms	+14.1 ms (+7.3%)	NO
p25	201 ±17 ms	213 ±25 ms	+12.4 ms (+6.2%)	NO
p50	210 ±22 ms	222 ±31 ms	+12.0 ms (+5.7%)	NO
p75	219 ±26 ms	233 ±34 ms	+13.7 ms (+6.3%)	NO
p90	227 ±30 ms	250 ±39 ms	+22.6 ms (+10.0%)	NO

20 test runs in comparison

Commit	Test Runs
`41890a0`	2023-05-05_04:15:40.081227_YyuI 2023-05-05_04:15:40.083518_PZSZ 2023-05-05_04:15:40.083530_wNXk 2023-05-05_04:15:40.083536_tnaZ 2023-05-05_04:15:40.083542_wmss 2023-05-05_04:15:40.083547_UyBC 2023-05-05_04:15:40.083553_hXuX 2023-05-05_04:15:40.083558_eaTI 2023-05-05_04:15:40.083563_NUxE 2023-05-05_04:15:40.083569_BRjb
`4a18354`	2023-05-08_17:36:57.916586_bADU 2023-05-08_17:36:57.920282_nsGU 2023-05-08_17:36:57.920297_ZxcM 2023-05-08_17:36:57.920305_cdUZ 2023-05-08_17:36:57.920310_xQmU 2023-05-08_17:36:57.920316_Tnou 2023-05-08_17:36:57.920321_qVtH 2023-05-08_17:36:57.920326_apzT 2023-05-08_17:36:57.920331_uanA 2023-05-08_17:36:57.920336_cNVp

redfin-30

Percentile	`41890a0`	`4a18354`	Diff	Significant (?)
p10	232 ±5 ms	255 ±3 ms	+23.5 ms (+10.1%)	MAYBE
p25	238 ±5 ms	261 ±4 ms	+23.1 ms (+9.7%)	MAYBE
p50	245 ±6 ms	269 ±4 ms	+24.2 ms (+9.9%)	MAYBE
p75	253 ±6 ms	279 ±6 ms	+26.7 ms (+10.6%)	MAYBE
p90	262 ±6 ms	292 ±8 ms	+30.6 ms (+11.7%)	MAYBE

20 test runs in comparison

Commit	Test Runs
`41890a0`	2023-05-05_04:15:40.081227_YyuI 2023-05-05_04:15:40.083518_PZSZ 2023-05-05_04:15:40.083530_wNXk 2023-05-05_04:15:40.083536_tnaZ 2023-05-05_04:15:40.083542_wmss 2023-05-05_04:15:40.083547_UyBC 2023-05-05_04:15:40.083553_hXuX 2023-05-05_04:15:40.083558_eaTI 2023-05-05_04:15:40.083563_NUxE 2023-05-05_04:15:40.083569_BRjb
`4a18354`	2023-05-08_17:36:57.916586_bADU 2023-05-08_17:36:57.920282_nsGU 2023-05-08_17:36:57.920297_ZxcM 2023-05-08_17:36:57.920305_cdUZ 2023-05-08_17:36:57.920310_xQmU 2023-05-08_17:36:57.920316_Tnou 2023-05-08_17:36:57.920321_qVtH 2023-05-08_17:36:57.920326_apzT 2023-05-08_17:36:57.920331_uanA 2023-05-08_17:36:57.920336_cNVp

https://storage.googleapis.com/firebase-sdk-metric-reports/3or0f3IOJF/index.html

firebase-firestore/CHANGELOG.md

firebase-firestore/src/main/java/com/google/firebase/firestore/remote/BloomFilter.java

firebase-firestore/CHANGELOG.md

…feature has been merged into the android sdk in firebase/firebase-android-sdk#4982

…feature has been merged into the android sdk in firebase/firebase-android-sdk#4982 (#7285)

dconeybe · 2024-01-16T15:25:27Z

For a discussion about the implementation details of this PR, see firebase/firebase-ios-sdk#12270.

milaGGL and others added 30 commits January 17, 2023 10:29

Update protos to include bloom filter (#4564)

b58e756

Merge branch 'master' into mila/BloomFilter

1b0b3ed

Merge branch 'master' into mila/BloomFilter

fd907e5

Implement BloomFilter class (#4524)

b306135

Merge branch 'master' into mila/BloomFilter

82280ca

Add expected count to target (#4574)

2139f5e

Apply bloom filter on existence filter mismatch (#4601)

078c414

Merge branch 'master' into mila/BloomFilter

3862731

Merge branch 'master' into mila/BloomFilter

9e7023d

Update TargetData.java

7ea3c34

Add integration test for bloom filter (#4696)

8696c17

Merge branch 'master' into mila/BloomFilter

a9a2b42

Merge branch 'master' into mila/BloomFilter

8db1af7

fromat

9f66dea

Update MockDatastore.java

70326ee

Merge remote-tracking branch 'origin/master' into HEAD

3d406f2

Merge branch 'master' into mila/BloomFilter

9e889fe

Update WatchChangeAggregator.java

97dafee

Merge branch 'master' into mila/BloomFilter

90d57dd

update queryTest to be consistent with master

1bbf61b

Add new goog-listen-tag for bloom filter (#4777)

06479a0

Update the integration test to verify that bloom filter averted full …

59f2859

…requery (#4768)

Merge remote-tracking branch 'origin/master' into mila/BloomFilter

443289c

Merge remote-tracking branch 'origin/master' into mila/BloomFilter

9716326

Merge commit '687e079d401db1a75d2559038b879726a476ca3e' into mila/Blo…

36a1643

…omFilter

Merge remote-tracking branch 'origin/master' into mila/BloomFilter

ecda2ba

Merge branch 'master' into mila/BloomFilter

79cc918

Improve bloom filter application test coverage (#4828)

d63be2b

Merge remote-tracking branch 'origin/master' into mila/BloomFilter

8c6677f

QueryTest.java: Remove check for getTargetBackend() != NIGHTLY sinc…

158c084

…e bloom filter support has now been deployed to production. (#4871)

dconeybe and others added 3 commits May 2, 2023 15:00

Merge remote-tracking branch 'origin/master' into mila/BloomFilter

6b22865

Merge remote-tracking branch 'origin/master' into mila/BloomFilter

0a67ba1

Merge branch 'master' into mila/BloomFilter

9c12d19

milaGGL requested a review from dconeybe May 5, 2023 13:33

milaGGL self-assigned this May 5, 2023

milaGGL assigned dconeybe May 5, 2023

google-oss-bot added the size/XXL label May 5, 2023

Update CHANGELOG.md

2573baa

dconeybe reviewed May 7, 2023

View reviewed changes

firebase-firestore/CHANGELOG.md Outdated Show resolved Hide resolved

dconeybe reviewed May 7, 2023

View reviewed changes

firebase-firestore/src/main/java/com/google/firebase/firestore/remote/BloomFilter.java Outdated Show resolved Hide resolved

dconeybe mentioned this pull request May 7, 2023

Various improvements to Bloom Filter work #4986

Merged

milaGGL and others added 2 commits May 8, 2023 10:33

update CHANGELOG + format

cc0428a

Random improvements. (#4986)

d0849f4

dconeybe mentioned this pull request May 8, 2023

Firestore: Optimize local cache sync when resuming a query that had docs deleted firebase/firebase-js-sdk#7229

Merged

dconeybe removed their assignment May 8, 2023

dconeybe reviewed May 8, 2023

View reviewed changes

firebase-firestore/CHANGELOG.md Outdated Show resolved Hide resolved

Update CHANGELOG.md

d9a59e2

dconeybe approved these changes May 8, 2023

View reviewed changes

milaGGL merged commit c83d5e5 into master May 8, 2023

milaGGL deleted the mila/BloomFilter branch May 8, 2023 18:05

dconeybe added a commit to firebase/firebase-js-sdk that referenced this pull request May 8, 2023

Remove the no-android tag from bloom filter spec tests, now that the …

5b68ce2

…feature has been merged into the android sdk in firebase/firebase-android-sdk#4982

dconeybe mentioned this pull request May 8, 2023

Firestore: Remove the no-android tag from bloom filter spec tests firebase/firebase-js-sdk#7285

Merged

dconeybe added a commit to firebase/firebase-js-sdk that referenced this pull request May 8, 2023

Remove the no-android tag from bloom filter spec tests, now that the …

e147219

…feature has been merged into the android sdk in firebase/firebase-android-sdk#4982 (#7285)

firebase locked and limited conversation to collaborators Jun 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Firestore: Optimize local cache sync when resuming a query that had docs deleted #4982

Firestore: Optimize local cache sync when resuming a query that had docs deleted #4982

milaGGL commented May 5, 2023 •

edited by dconeybe

Loading

github-actions bot commented May 5, 2023 •

edited

Loading

google-oss-bot commented May 5, 2023 •

edited

Loading

`firebase-firestore`

github-actions bot commented May 5, 2023 •

edited

Loading

google-oss-bot commented May 5, 2023 •

edited

Loading

`firebase-firestore`

google-oss-bot commented May 5, 2023 •

edited

Loading

`fire-fst`

`timeToInitialDisplay`

dconeybe commented Jan 16, 2024

Firestore: Optimize local cache sync when resuming a query that had docs deleted #4982

Firestore: Optimize local cache sync when resuming a query that had docs deleted #4982

Conversation

milaGGL commented May 5, 2023 • edited by dconeybe Loading

Background Information

The Optimization

github-actions bot commented May 5, 2023 • edited Loading

📝 PRs merging into main branch

google-oss-bot commented May 5, 2023 • edited Loading

Coverage Report 1

Affected Products

firebase-firestore

Test Logs

github-actions bot commented May 5, 2023 • edited Loading

Unit Test Results

google-oss-bot commented May 5, 2023 • edited Loading

Size Report 1

Affected Products

firebase-firestore

Test Logs

google-oss-bot commented May 5, 2023 • edited Loading

Startup Time Report 1

Notes

Startup Times

fire-fst

timeToInitialDisplay

dconeybe commented Jan 16, 2024

milaGGL commented May 5, 2023 •

edited by dconeybe

Loading

github-actions bot commented May 5, 2023 •

edited

Loading

google-oss-bot commented May 5, 2023 •

edited

Loading

Coverage Report ¹

`firebase-firestore`

github-actions bot commented May 5, 2023 •

edited

Loading

google-oss-bot commented May 5, 2023 •

edited

Loading

Size Report ¹

`firebase-firestore`

google-oss-bot commented May 5, 2023 •

edited

Loading

Startup Time Report ¹

`fire-fst`

`timeToInitialDisplay`