• 0 Posts
  • 23 Comments
Joined 1 year ago
cake
Cake day: June 5th, 2023

help-circle
  • The device wouldn’t necessarily have to be constantly streaming the audio to a central server. If it’s capable of hearing wake up words like “Ok Google” it’s capable of listening for other phrases and having onboard processing to relay back the results much more compressed. Whether or not this is common practice is another matter, and yes the algorithms are scary good even without eavesdropping.





  • That’s fair. I think fundamentally a false positive/negative isn’t that much different. Pretty much all tests—especially those dealing with real world conditions—are heuristic, as are all LLMs by necessity of the design. Hallucination is a pretty specific term given to AI as an attempt to assign agency to a system that doesn’t actually have any (by implying it’s crazy and making stuff up instead of a black box with deterministic inputs and outputs spitting out something factually wrong but with a similar format to what is trained on). I feel like the nature of any tool where “you can’t trust this to be entirely accurate” should have an umbrella term that encompasses both types of providing inaccurate info under certain conditions.

    I suppose the difference is that AI is a lot more likely to randomly go off, whereas a blood test is likelier to provide repeated false positives for the same person with their unique biology? There’s also the fact that most medical tests represent a true/false dichotomy or lookup table, whereas an LLM is given the entire bounds of language.

    Would an AI clustering algorithm (say, K-means for instance) giving an inaccurate diagnosis be a false positive/negative or a hallucination? These models can be programmed on a sliding scale and I feel like there’s definitely an area where the line could get pretty blurry.


  • I mean, AI is used in fraud detection pretty often; when it hits a false positive (which happens frequently on a population-level basis), is that not a hallucination of some sort? Obviously LLMs can go off the rails much further because it’s readable text, but any machine learning model will occasionally spit out really bad guesses almost any person could have done better with. (To be fair, humans are highly capable of really bad guesses too).












  • Thanks for letting me know. I thumbed through the article just to make sure and noticed the number was different, but figured if it changed once it would change multiple times and didn’t want to play a pointless game of catch up when my issue was over the use of a term, not the specific number.

    I had hoped the WHO would use more scientifically precise language, especially since they’re supposed to be a trusted authority on this subject. I think organizations which muddy the waters on terms like this, intentionally or not, end up damaging to scientific literacy for the average person who might not know the difference. It makes things confusing, especially because -fold is used to mean powers of 2 in some contexts and a reader could end up being misinformed if they came across such a headline on said topic in the future.