HomeScienceHospitals use a transcription tool powered by a hallucination-prone OpenAI model

Hospitals use a transcription tool powered by a hallucination-prone OpenAI model

A number of months in the past, my physician confirmed off an AI transcription software he used to report and summarize his affected person conferences. In my case, the abstract was wonderful, however researchers cited by ABC Information have discovered that’s not at all times the case with OpenAI’s Whisper, which powers a software many hospitals use — generally it simply makes issues up completely.

Whisper is utilized by an organization known as Nabla for a medical transcription software that it estimates has transcribed 7 million medical conversations, in response to ABC Information. Greater than 30,000 clinicians and 40 well being techniques use it, the outlet writes. Nabla is reportedly conscious that Whisper can hallucinate, and is “addressing the issue.”

A gaggle of researchers from Cornell College, the College of Washington, and others present in a examine that Whisper hallucinated in about 1 % of transcriptions, making up total sentences with generally violent sentiments or nonsensical phrases throughout silences in recordings. The researchers, who gathered audio samples from TalkBank’s AphasiaBank as a part of the examine, be aware silence is especially widespread when somebody with a language dysfunction known as aphasia is talking.

One of many researchers, Allison Koenecke of Cornel College, posted examples just like the one under in a thread in regards to the examine.

The researchers discovered that hallucinations additionally included invented medical situations or phrases you may anticipate from a YouTube video, akin to “Thanks for watching!” (OpenAI reportedly used to transcribe over 1,000,000 hours of YouTube movies to coach GPT-4.)

The examine was introduced in June on the Affiliation for Computing Equipment FAccT convention in Brazil. It’s not clear if it has been peer-reviewed.

OpenAI spokesperson Taya Christianson emailed a press release to The Verge:

We take this situation critically and are regularly working to enhance, together with decreasing hallucinations. For Whisper use on our API platform, our utilization insurance policies prohibit use in sure high-stakes decision-making contexts, and our mannequin card for open-source use consists of suggestions in opposition to use in high-risk domains. We thank researchers for sharing their findings.

RELATED ARTICLES

Most Popular

Recent Comments