Sorry, I forgot to reply to this thread. Hopefully people are still interested, because we’re about to get into the good stuff (finally).
Quick recap, Hero has decided to always raise the A, always call the K and always fold the Q. In response, villain will always fold to raise.
Hero can now start bluffing raising their Q. If villain didn’t adjust, Hero could bluff raise their Q 100% of the time, but obviously villain with have the A sometimes and so isn’t going to fold to the raise in that case. Once villain knows that hero is bluff raising with the Q, they need to start calling sometimes with the K.
It turns out that if Hero bluffs with a Q 20% of the time, Hero always makes $0.37 per hand. That’s true if villain calls with a K 100% of the time, or folds 100% of the time, or anywhere in between.
It may seem that villain is now indifferent when facing a raise and holding a K, but they’re actually not. If they fold 100%, hero can start deviating again and start bluffing the Q 100%. So villain can’t do any better than losing $0.37 per hand, but they can do worse. It turns out villain must call with a K 20% of the time as well. Now, Hero wins 37c if they always fold the Q, 37c if the always raise the Q, and everything in between.
We now have an equilibrium. If hero deviates from raising the Q 20% of the time, villain can reduce their average loss. If villain deviates from calling with the K 20% of the time, Hero can increase their average profit.
If you’re wondering why 20%, it’s because of the pot odds we’ve set up in this toy game. The interesting things is that you can get to that value through trial and error, without ever having to calculate anything, and that is in fact what a solver does. (It is actually what I did too - I couldn’t be bothered working out what the answer should be and just plugged numbers into the google sheet I posted earlier)
Here’s the important thing to note: If Hero bluffs their Q at 19% instead of 20%, villain should always fold their K. If villain calls with a K only 19% of the time, Hero should always bluff raise the Q.
That is, an equilibrium is really a very finely balanced seesaw (teeter totter for 'merican’s), and if someone even just farts on the other end, the balance shifts completely.
A solver is always on the upside of that shift, but might only be 20% away from the middle, where we can be on the very far end.
I’ll leave it at that for now, but I have more points to make, some of them not even flatulence related.