Phi-2: Microsoft Launches Robust AI ‘Small Language Model’ for Researchers

The first model, the 1.3 billion parameter Phi-1 achieved state-of-the-art performance on Python coding among existing SLMs (specifically on the HumanEval and MBPP benchmarks).

Technology IANS| Dec 17, 2023 02:29 PM IST

A+

A-

New Delhi, December 17: Microsoft has released its newest compact “small language model” titled Phi-2 that continues to perform at par or better than certain larger open-source Llama 2 models with less than 13 billion parameters. Over the past few months, the Machine Learning Foundations team at Microsoft Research has released a suite of small language models (SLMs) called “Phi” that achieve remarkable performance on a variety of benchmarks.

The first model, the 1.3 billion parameter Phi-1 achieved state-of-the-art performance on Python coding among existing SLMs (specifically on the HumanEval and MBPP benchmarks). Microsoft AI Skills Initiative: US Tech Giant Launches New Programme To Help People Learn Generative AI Tools.

"We are now releasing Phi-2, a 2.7 billion-parameter language model that demonstrates outstanding reasoning and language understanding capabilities, showcasing state-of-the-art performance among base language models with less than 13 billion parameters,” the company said in an update.

Phi-2 is an ideal playground for researchers, including for exploration around mechanistic interpretability, safety improvements, or fine-tuning experimentation on a variety of tasks. “We have made Phi-2 available in the Azure AI Studio model catalog to foster research and development on language models,” said Microsoft. Microsoft Copilot AI Chatbot Announces New Features as It Observes First Anniversary, Details Here.

The massive increase in the size of language models to hundreds of billions of parameters has unlocked a host of emerging capabilities that have redefined the landscape of natural language processing.

However, a question remains whether such emergent abilities can be achieved at a smaller scale using strategic choices for training, e.g., data selection. “Our line of work with the Phi models aims to answer this question by training SLMs that achieve performance on par with models of much higher scale (yet still far from the frontier models),” said Microsoft.

The company has also performed extensive testing on commonly used prompts from the research community. “We observed a behaviour in accordance with the expectation we had given the benchmark results,” said the tech giant.

(The above story first appeared on LatestLY on Dec 17, 2023 02:29 PM IST. For more news and updates on politics, world, sports, entertainment and lifestyle, log on to our website latestly.com).

City	Petrol	Diesel
New Delhi	96.72	89.62
Kolkata	106.03	92.76
Mumbai	106.31	94.27
Chennai	102.74	94.33

City

Petrol

Diesel

New Delhi

96.72

89.62

Kolkata

106.03

92.76

Mumbai

106.31

94.27

Chennai

102.74

94.33

Sambhal Train Derailment: Train Engine Derails During Shunting at Chandausi Railway Station in Uttar Pradesh, Video Surfaces

‘Isse Achha Toh Akshay Kumar Hai’: Shah Rukh Khan Faces Online Backlash for His Radio Silence on Dr Manmohan Singh’s Demise

Xiaomi 16 Series To Feature Snapdragon 8 Elite Processor Offering Higher Performance, Periscope Telephoto Camera: Report

Chris Lynn Crowned 'Best Hair' in BBL 2024–25 Player Survey for Second Consecutive Big Bash League Season

ISL 2024–25 Points Table Updated Live: NorthEast United Rise To Fourth Spot In Standings After Thumping 3-0 Win Over Mumbai City FC

Angelina Jolie, Brad Pitt Reach Divorce Settlement

United Cup 2024–25: Katie Boulter and Charles Broom Seal Decider for Great Britain Against Argentina in Group F

Samsung Electronics To Become Largest Shareholder in Rainbow Robotics To Push Robotics Technologies Including Humanoid Robots

Nagaland State Lottery Sambad Result Today 1 PM Live: Dear Godavari Tuesday Lottery Result of December 31 2024 Declared Online, Watch Lucky Draw Winners List

Russell Westbrook Records 'Perfect Triple-Double' During Utah Jazz vs Denver Nuggets NBA 2024-25 Match, Becomes Only Second Player in League History to Achieve Remarkable Feat (Watch Video Highlights)

Phi-2: Microsoft Launches Robust AI ‘Small Language Model’ for Researchers

The first model, the 1.3 billion parameter Phi-1 achieved state-of-the-art performance on Python coding among existing SLMs (specifically on the HumanEval and MBPP benchmarks).

Year Ender 2024: From Prajwal Revanna Sex Scandal to Tirupati Laddu Row, List of Political Controversies That Rocked India

ISRO’s SpaDeX Mission Set for December 30 Launch To Achieve Historic Space Docking Feat: Minister of Science and Technology Dr Jitendra Singh

Dr Eldho Varghese, Dr T G Sumithra, Two CMFRI Scientists Earn Prestigious NAAS Recognition for Their Contribution to Fisheries and Agricultural Science

Apple To Become World’s Most Valuable Company Soon, Nears USD 4 Trillion Market Cap Amid AI Push and iPhone Supercycle: Reports

Sambhal Train Derailment: Train Engine Derails During Shunting at Chandausi Railway Station in Uttar Pradesh, Video Surfaces

‘Isse Achha Toh Akshay Kumar Hai’: Shah Rukh Khan Faces Online Backlash for His Radio Silence on Dr Manmohan Singh’s Demise

Xiaomi 16 Series To Feature Snapdragon 8 Elite Processor Offering Higher Performance, Periscope Telephoto Camera: Report

Chris Lynn Crowned 'Best Hair' in BBL 2024–25 Player Survey for Second Consecutive Big Bash League Season

ISL 2024–25 Points Table Updated Live: NorthEast United Rise To Fourth Spot In Standings After Thumping 3-0 Win Over Mumbai City FC

Angelina Jolie, Brad Pitt Reach Divorce Settlement

Manali, Solang Valley Witness Traffic Chaos Due to Heavy Snowfall; ‘Koi Bhi Mat Aana’, Warns Tourist As Over 2,000 Vehicles Stranded in Snow (Watch Video)

Stock Market Holiday: Is Share Market Open or Closed on January 1, 2025? Know if Trading Will Happen on NSE, BSE on First Day of New Year

Keeratpura: Rescuers Fail to Locate 3-Year-Old Chetna Trapped in Borewell for 9 Days (Watch Video)

OpenAI CEO Sam Altman Teases Plans for 2025: AGI, ChatGPT 4o Upgrade and More; Check Details

South Korea To Inspect All Boeing 737-800 Planes Operated by Domestic Airlines After Jeju Air Plane Crash

New York Shocker: Baby Girl Found Abandoned Outside Building in Bronx, Video of Masked Woman Leaving Child in Green Bag Surfaces

Short Videos

Editor's Choice

Gujarat Shocker: Man Fakes Death by Murdering Friend in Rajkot’s Mota Mahika Village, Sets Victim’s Body Ablaze With Own Belongings To Claim Life Insurance

Polar Vortex Warning: Arctic Blast to Cause Widespread Temperature Plunge, Snowstorms Across US During January 2025; Record-Breaking Cold Likely

Animal Cruelty in Mumbai: Father-Son Duo Shoot Stray Dog ‘Wolfy’ With Air Gun in Lokhandwala for Barking Continuously, Detained by Police

Stocks To Buy or Sell Today, December 31: Mazagon Dock Shipbuilders, ITC and EaseMyTrip Among Shares That May Remain in Focus on Thursday, Know Which Stocks to Buy or Sell on December 31

Trending Topics