Microsoft launches Phi-3 Mini, a tiny AI model that packs a punch

Phi-3 Mini was designed with smartphones in mind.
By Cecily Mauran  on 
a futuristic rendering of a brain on top of a computing chip
Bigger isn't necessarily better. Credit: da-kuk / Getty Images

Microsoft released Phi-3 Mini, a new version of its lightweight AI model designed for specific tasks.

According to the research paper published earlier this week, Phi-3 Mini has 3.8 billion parameters which is significantly less than other models like OpenAI's GPT-4, making it small enough to be deployed on a smartphone. OpenAI hasn't shared how many parameters GPT-4 has but it's believed to have over one trillion parameters per Semafor.

Traditional AI models require massive amounts of computing power, which is very expensive and has a huge carbon footprint. Companies like Microsoft and Google have been working on smaller lightweight models that handle common tasks, which would make hosting their models more sustainable — in the operational sense — and more suitable for smartphones which is where the industry is heavily leaning. Samsung is going all in on generative AI with a collection of features for its Galaxy devices, Google is also adding generative AI features to its Pixel lineup, and even Apple is expected to make some big AI announcements for iOS 18.

Parameters relate to how models are able to tackle complexity, so the more parameters, the more capable a model is at handling vast and nuanced requests. But for everyday tasks that the average user would need from an AI model, such as translating, help drafting an email, or looking for local restaurants, a smaller lightweight model is presumed to be sufficient.

Mashable Light Speed
Want more out-of-this world tech, space and science stories?
Sign up for Mashable's weekly Light Speed newsletter.
By signing up you agree to our Terms of Use and Privacy Policy.
Thanks for signing up!

Phi-3 Mini scored similarly against Meta's open-source model Llama 3 and OpenAI's GPT-3.5 on common benchmarks with a few exceptions. It surpassed Llama 3 and scored just below GPT 3.5 in natural language understanding (MMLU) and commonsense reasoning (HellaSwag) and beat both models on arithmetic reasoning (GSM8K). As the paper notes, it scored lower on trivia and "factual knowledge" but researchers believe "such weakness can be resolved by augmentation with a search engine," meaning once the model is hooked up to the internet, that won't be such an issue.

Researchers trained Phi-3 Mini on a combination of "heavily filtered web data" that meets standards for high quality educational information, as well as synthetic data, which challenges the idea that scraping everything from the web is the best way to train a model. The model was also trained on... bedtime stories, according to DailyAI, which actually makes a ton of sense for understanding the way human brains work. The idea is to opt for quality over quantity with curated data so it can run on fewer parameters while still retaining its potency.

Phi-3 Mini is now available on HuggingFace, Azure, and Ollama.

Mashable Image
Cecily Mauran

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before getting her master's degree at Columbia Journalism School, she spent several years working with startups and social impact businesses for Unreasonable Group and B Lab. Before that, she co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. You can find her on Twitter at @cecily_mauran.


Recommended For You
Adobe unveils AI features for Photoshop — but not everyone is happy about it
an ai-generated heirloom tomato in a blue bowl against a blue background

ChatGPT vs. Gemini: Which AI chatbot won our 5-round match?
ChatGPT vs. Gemini


How to turn off Meta AI
the Meta logo on a wall with a blurry reflected background

Anthropic introduces Claude 3: Haiku, Sonnet, and Opus
Anthropic website displayed on a phone screen and Anthropic logo displayed on a screen in the background are seen in this illustration photo taken in Krakow, Poland on September 26, 2023.

Trending on Mashable
NYT Connections today: See hints and answers for May 4
A phone displaying the New York Times game 'Connections.'

'Wordle' today: Here's the answer hints for May 4
a phone displaying Wordle

NYT's The Mini crossword answers for May 4
Closeup view of crossword puzzle clues

53 of the best Harvard University courses you can take online for free
Hands on laptop

NYT Connections today: See hints and answers for May 3
A phone displaying the New York Times game 'Connections.'
The biggest stories of the day delivered to your inbox.
This newsletter may contain advertising, deals, or affiliate links. Subscribing to a newsletter indicates your consent to our Terms of Use and Privacy Policy. You may unsubscribe from the newsletters at any time.
Thanks for signing up. See you at your inbox!