... I have 5 6V6 tubes and my tester tells me 2 are bad. It gives me values of 66, 52, 50, 30, and 23. I tried all of these tubes in my circuit and I'll be damned if I could hear much if any difference between any of the tubes, even the 23 and 30 functioned and sounded fine to me.
... maybe those low testing tubes can still be used in SE circuits?
Transconductance (Gm) tests aren't diagnostic for output tubes; if you look in the Radiotron Designer's Handbook (available in the Library of Information, linked at the bottom of this screen), a power output test is recommended. In this test, you use the tube as you have, apply a signal and measure output power. You could either measure output power for a fixed applied signal, or apply a signal which drives the tube to the onset of clipping and note the drive signal needed.
Idle Gm really only matches tubes at idle. It doesn't guarantee the tubes are matched at all times. And as you found out, it doesn't really show whether the tube will perform in every amp circuit.
So what about low-Gm tubes? I would expect a low-Gm tube to need more bias voltage to keep plate current at a specified level. That's because Gm is a measure of how effective the grid voltage is at controlling plate current. Say the tube is capable of passing just as much current as any point in its life; since the plate load impedance and power supply capabilities dictate the output power under this circumstance, the tube is capable of performing in the amp and making just as-loud a speaker output.
But the bigger bias voltage implies a bigger drive signal is required to push that low-Gm tube to the same plate current swings. So Same-Power-Output / More-Drive-Signal = Lower Power Sensitivity. In other words, an extreme case will result in your amp seeming weak ("I swear it used to get louder at '7' on the Volume control..."). However, as you're seeing, you may be perfectly happy with how the tube performs.
I didn't even get into the issues of whether the tube tester provides a fair test of the tube, has correct setup conditions vs specified reading (even in the old days, tester manufacturers sometimes got this wrong), how much your tester varies from every other tester out there, etc. You really have to get a feel for your individual tester, the readings it provides, and how that translates to reality in your experience.