query: add OnMaxTries #308

kcalvinalvin · 2025-02-17T05:45:19Z

OnMaxTries allows an outside caller to dictate what should happen to a given worker in case of a timeout.
This allows for peer selection strategies to be made outside of the package, allowing for greater flexibility.

EDIT:
This PR is a dependency for btcsuite/btcd#2226

OnMaxTries config lets the caller choose what happens when a query goes over the maximum allowed tries. This config helps meet the needs of different callers as some may choose to immediately disconnect vs others that may choose to try again.

The peer may already be disconnected but the go runtime could have chosen to handle the message. To ensure that the handleResponse doesn't get called when the peer is already disconnected, add a select before handleResponse.

saubyk · 2025-05-13T00:07:00Z

cc: @mohamedawnallah for review

ellemouton · 2025-05-19T12:29:41Z

query/workmanager.go

@@ -94,6 +94,10 @@ type Config struct {
 	// make this configurable to easily mock the worker used during tests.
 	NewWorker func(Peer) Worker

+	// OnMaxTries is function closure that's called on max retries on
+	// workers.
+	OnMaxTries func(string)


think we should instead use a stricter type here (net.Addr) and mention what is being passed to this call-back so that the caller doesnt have to go look at the code to figure out what is being passed to the call-back it is setting.

The commit message isnt very accurate imo. OnMaxTries config lets the caller choose what happens when a query goes over the maximum allowed tries isnt really true - more so: OnMaxTries gives the caller access to the peer's address once a maxmum number of retries have been attempted" or something like that

it's worth linking the PR that makes use of this new config option in the PR description so that reviewers can understand the context a bit more

think we should instead use a stricter type here (net.Addr) and mention what is being passed to this call-back so that the caller doesnt have to go look at the code to figure out what is being passed to the call-back it is setting.

The commit message isnt very accurate imo. OnMaxTries config lets the caller choose what happens when a query goes over the maximum allowed tries isnt really true - more so: OnMaxTries gives the caller access to the peer's address once a maxmum number of retries have been attempted" or something like that

Will address both points.

ellemouton · 2025-05-19T12:38:10Z

query/worker.go

+				select {
+				// If the peer disconnects before giving us a valid
+				// answer, we'll also exit with an error.
+				case <-peer.OnDisconnect():
+					log.Debugf("Peer %v for worker disconnected, "+
+						"cancelling job %v", peer.Addr(),
+						job.Index())
+
+					jobErr = ErrPeerDisconnected
+					break Loop


i dont think this adds anything. This select is already covered in below and so if peer.OnDisconnect() triggers before the message is received on the msgChan, we will already catch the case below.

I think what you instead want is for job.HandleResp's function to know if it can exit or not.

The idiomatic way of doing this would be to pass it a ctx context.Context which gets cancelled when the peer disconnects. An intermediate step can just be to pass it a quit chan struct{} which you can derive from the peers OnDisconnect method and then listen on that in the HandleResp call-back

So the reason for adding this is because I saw that there are cases where case resp := <- msgChan: gets triggered first before the below case <-peer.OnDisconnect(): gets triggered.

I think what you instead want is for job.HandleResp's function to know if it can exit or not.

Yeah that would also solve the problem I was seeing.

An intermediate step can just be to pass it a quit chan struct{} which you can derive from the peers OnDisconnect method

That would be ok as well except now we force every implementation of HandleResp to handle the case where quit chan struct{} is passed.

Well, I'll also add that I don't really have a preference on how that's handled so I'll push changes so that HandleResp takes a quti chan struct{} as an argument.

lightninglabs-deploy · 2025-05-28T06:28:23Z

@kcalvinalvin, remember to re-request review from reviewers when ready

kcalvinalvin added 2 commits December 9, 2024 17:31

query: add onmaxtries config option

e6d7733

OnMaxTries config lets the caller choose what happens when a query goes over the maximum allowed tries. This config helps meet the needs of different callers as some may choose to immediately disconnect vs others that may choose to try again.

kcalvinalvin mentioned this pull request Feb 17, 2025

peer, main, netsync, blockchain: parallel block downloads btcsuite/btcd#2226

Open

kcalvinalvin marked this pull request as ready for review April 21, 2025 15:33

kcalvinalvin marked this pull request as draft April 21, 2025 16:14

kcalvinalvin marked this pull request as ready for review May 12, 2025 15:55

saubyk assigned kcalvinalvin May 13, 2025

saubyk requested a review from ellemouton May 16, 2025 15:27

ellemouton requested changes May 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

query: add OnMaxTries #308

query: add OnMaxTries #308

Uh oh!

kcalvinalvin commented Feb 17, 2025 •

edited

Loading

Uh oh!

saubyk commented May 13, 2025

Uh oh!

ellemouton May 19, 2025

Uh oh!

ellemouton May 19, 2025

Uh oh!

kcalvinalvin May 21, 2025

Uh oh!

ellemouton May 19, 2025

Uh oh!

kcalvinalvin May 21, 2025

Uh oh!

kcalvinalvin May 21, 2025

Uh oh!

lightninglabs-deploy commented May 28, 2025

Uh oh!

Uh oh!

query: add OnMaxTries #308

Are you sure you want to change the base?

query: add OnMaxTries #308

Uh oh!

Conversation

kcalvinalvin commented Feb 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

saubyk commented May 13, 2025

Uh oh!

ellemouton May 19, 2025

Choose a reason for hiding this comment

Uh oh!

ellemouton May 19, 2025

Choose a reason for hiding this comment

Uh oh!

kcalvinalvin May 21, 2025

Choose a reason for hiding this comment

Uh oh!

ellemouton May 19, 2025

Choose a reason for hiding this comment

Uh oh!

kcalvinalvin May 21, 2025

Choose a reason for hiding this comment

Uh oh!

kcalvinalvin May 21, 2025

Choose a reason for hiding this comment

Uh oh!

lightninglabs-deploy commented May 28, 2025

Uh oh!

Uh oh!

kcalvinalvin commented Feb 17, 2025 •

edited

Loading