BETA
This is a BETA experience. You may opt-out by clicking here

More From Forbes

Edit Story

Google Says Its Med-Gemini AI Healthcare Models Beat GPT-4

Following

The race for tailored medical AI models is heating up. Google and DeepMind released a paper on Monday describing Med-Gemini, a group of advanced AI models targeting healthcare applications. The models are still in the research phase, but the authors claim that Med-Gemini is outperforming competing models such as OpenAI’s GPT-4. However, the latter is not lagging behind in the medical arena, recently expanding its collaboration with Moderna, a large pharmaceutical company.

Med-Gemini’s striking leap forward, if validated in real-world settings, is its ability to capture context and temporality, such as potentially to understands the background and setting of symptoms as well as the timing and sequence of their onset. This is a known pitfall in existing health-related AI models. It’s true that we physicians are notorious for our abbreviations and lack of uniformity in documentation. Nonetheless, the true challenge in training medical algorithms is not the textual complexity—but rather the contextual one.

A simple example is one any parent of a toddler knows well: having to visit a pediatrician for your youngster’s fever and rash. The doctor will always ask, which came first, the fever or the rash? Did it spread from the head down or the legs up? These simple characteristics can differentiate a mild and self-limiting disease, like Roseola, from a potentially life threatening one, such as meningococcal meningitis.

These seemingly straightforward questions, with their multidimensionality and time-series characteristics, can throw an AI model completely off with the slightest inaccuracy.

This exact contextuality seems to have been tackled by Med-Gemini through breaking away from the massive undertaking of building an all-encompassing general medical model. Instead, Google’s developers have adopted a vertical-by-vertical approach of related models, referred to as a “family” of models, each optimizing a specific medical domain or scenario, such as image analysis in the fields of radiology and pathology, signal interpretation such as deciphering electrocardiogram exams or long-context understanding such as reading lengthy medical records. This, according to researchers, has resulted in improved and nuanced accuracy, and a more transparent reasoning, providing some interpretable feedback, such as why a suggested diagnosis is the most likely one.

As doctors are expected to keep abreast of recent research, Google seems to hold Med-Gemini to the same standard. The new model also incorporates a significant additional layer—a web-based search of up-to-date information, allowing augmentation of data with external knowledge, integrating online results into the model.

Though Med-Gemini has leveraged diverse data sources, such as excerpts from health records, X-rays, photos of skin lesions, medical exam prep question and others, it is still important to remember what has yet to happen: a real-world validation on actual production-level data in an everyday clinic setting, or at least a prospective double blind randomized clinical trial.

Multimodal models have provided AI powered health advancements. Yet, the burden of proof is still to be demonstrated in real-life clinical settings.

Follow me on Twitter or LinkedIn

Join The Conversation

Comments 

One Community. Many Voices. Create a free account to share your thoughts. 

Read our community guidelines .

Forbes Community Guidelines

Our community is about connecting people through open and thoughtful conversations. We want our readers to share their views and exchange ideas and facts in a safe space.

In order to do so, please follow the posting rules in our site's Terms of Service.  We've summarized some of those key rules below. Simply put, keep it civil.

Your post will be rejected if we notice that it seems to contain:

  • False or intentionally out-of-context or misleading information
  • Spam
  • Insults, profanity, incoherent, obscene or inflammatory language or threats of any kind
  • Attacks on the identity of other commenters or the article's author
  • Content that otherwise violates our site's terms.

User accounts will be blocked if we notice or believe that users are engaged in:

  • Continuous attempts to re-post comments that have been previously moderated/rejected
  • Racist, sexist, homophobic or other discriminatory comments
  • Attempts or tactics that put the site security at risk
  • Actions that otherwise violate our site's terms.

So, how can you be a power user?

  • Stay on topic and share your insights
  • Feel free to be clear and thoughtful to get your point across
  • ‘Like’ or ‘Dislike’ to show your point of view.
  • Protect your community.
  • Use the report tool to alert us when someone breaks the rules.

Thanks for reading our community guidelines. Please read the full list of posting rules found in our site's Terms of Service.