Of course, almost all PDFs 'contain text' in the sense of having text that you can read, but I'm talking here about the difference between those in which that's just a bitmap that only gets interpreted as text by the brain of the human looking at the screen, versus those which also contain text as far as the computer is concerned.
This can be nonobvious in the case of a PDF scanned from paper. Sometimes, what you see on the screen looks like a blurry imperfect picture of text straight from the scan, but it turns out that the PDF has been through OCR, so even though you are being shown the original blurry bitmap, even though you are not gaining the benefit of the OCR while normally reading, the embedded text is still there, hidden in the file. Two ways in which it may manifest in a PDF reader:
Try to select text with the mouse.
Try searching for a word.
Of course it can also happen that some of the text has been OCR'd but not all.
In cases where both the above tests come up negative, is it then possible to say "no, this PDF does not contain embedded text," or can embedded text still be hidden in the file?
For example:
https://pdf.datasheetcatalog.com/datasheets/2300/45014_DS.pdf
As far as I can tell, the above PDF is all bitmaps, no embedded text. Is that correct, or am I still missing something?