Artificial Intelligence Race: After ChatGPT, Microsoft Introduces Kosmos-1, a New AI Model That Responds to Visual Cues

As the war over artificial intelligence (AI) chatbots heat up, Microsoft has unveiled Kosmos-1, a new AI model that can also respond to visual cues or images, apart from text prompts or messages.

Technology IANS| Mar 03, 2023 08:00 PM IST

A+

A-

New Delhi, March 3 : As the war over artificial intelligence (AI) chatbots heat up, Microsoft has unveiled Kosmos-1, a new AI model that can also respond to visual cues or images, apart from text prompts or messages. The multimodal large language model (MLLM) can help in an array of new tasks, including image captioning, visual question answering and more. Artificial Intelligence: India Building Next-Gen AI To Become a Global Powerhouse and Empower Billions of Citizens: Union Minister of State for Electronics and IT Rajeev Chandrasekhar.

Kosmos-1 can pave the way for the next-stage beyond ChatGPT's text prompts. "A big convergence of language, multimodal perception, action, and world modeling is a key step toward artificial general intelligence. In this work, we introduce Kosmos-1, a Multimodal Large Language Model (MLLM) that can perceive general modalities, learn in context and follow instructions," said Microsoft's AI researchers in a paper. WhatsApp Update: Meta-Owned Messaging Platform To Launch ‘Split View’ Feature for Tablets on Android Beta.

The paper suggests that multimodal perception, or knowledge acquisition and "grounding" in the real world, is needed to move beyond ChatGPT-like capabilities to artificial general intelligence (AGI), reports ZDNet.

"More importantly, unlocking multimodal input greatly widens the applications of language models to more high-value areas, such as multimodal machine learning, document intelligence, and robotics," the paper read.

The goal is to align perception with LLMs, so that the models are able to see and talk. Experimental results showed that Kosmos-1 achieves impressive performance on language understanding, generation, and even when directly fed with document images.

It also showed good results in perception-language tasks, including multimodal dialogue, image captioning, visual question answering, and vision tasks, such as image recognition with descriptions (specifying classification via text instructions).

"We also show that MLLMs can benefit from cross-modal transfer, i.e., transfer knowledge from language to multimodal, and from multimodal to language. In addition, we introduce a dataset of Raven IQ test, which diagnoses the nonverbal reasoning capability of MLLMs," said the team.

(The above story first appeared on LatestLY on Mar 03, 2023 08:00 PM IST. For more news and updates on politics, world, sports, entertainment and lifestyle, log on to our website latestly.com).

City	Petrol	Diesel
New Delhi	96.72	89.62
Kolkata	106.03	92.76
Mumbai	106.31	94.27
Chennai	102.74	94.33

City

Petrol

Diesel

New Delhi

96.72

89.62

Kolkata

106.03

92.76

Mumbai

106.31

94.27

Chennai

102.74

94.33

'Bowled, Jasprit Bumrah...' Sam Kontas Practices Joe Root-Style Ramp Shot At Nets Ahead of IND vs AUS 4th Test 2024 (Watch Video)

Christmas Eve 2024 Unique Traditions: From Attending Midnight Mass and Setting Cookies for Santa to Festive Feasts and Gift Exchanges, a Look at Holiday Traditions From Around the World

Shocking Update in ‘Pushpa 2’ Stampede Case: Allu Arjun Summoned Again by Hyderabad Police for Questioning at 11 AM on December 24

‘Identity’ Trailer: Tovino Thomas and Trisha Krishnan’s Mystery Thriller Offers High-Octane Action (Watch Video)

Sheikh Hasina Extradition: Bangladesh Requests India To Extradite Former Prime Minister; New Delhi Confirms Communication

‘Pushpa 2 – The Rule’ Stampede Update: Mythri Movie Makers Donates INR 50 Lakhs to Victim Revathi’s Family Following Tragic Incident at Sandhya Theater in Hyderabad (Watch Video)

Boxing Day 2024 Football Schedule: Manchester City vs Everton, Chelsea vs Fulham, Liverpool vs Leicester City and Other Games To Be Played on December 26

Bihar Assembly Elections 2025: NDA To Contest Vidhan Sabha Polls Under Leadership of CM Nitish Kumar, Says State BJP Chief Dilip Jaiswal

24 December 2024 Horoscope: What Is the Zodiac Sign of People Celebrating Birthday Today? Know the Sun Sign, Lucky Colour and Number Prediction

With Cricket Set to Feature at LA Olympic Games 2028 Take A Look at Cricketers With Olympic Connections (Watch Video)

Artificial Intelligence Race: After ChatGPT, Microsoft Introduces Kosmos-1, a New AI Model That Responds to Visual Cues

As the war over artificial intelligence (AI) chatbots heat up, Microsoft has unveiled Kosmos-1, a new AI model that can also respond to visual cues or images, apart from text prompts or messages.

India’s Data Centre Capacity To More Than Double to 2–2.3 GW by 2027 Amid Rise in Cloud Storage Investments: CRISIL

OpenAI Faces Penalty: Italy’s Data Protection Authority Fines Euro 15 Million After Probe Into ChatGPT Data Collection, Privacy Violations

What Is ChatGPT WhatsApp Number? How Does It Work? Know More About OpenAI's New Experimental Feature, Step-by-Step Guidelines To Use It

AI in Legal Work: Over 36,324 Supreme Court Judgements Translated Into Hindi, 42,765 Verdicts Into 17 Regional Languages

'Bowled, Jasprit Bumrah...' Sam Kontas Practices Joe Root-Style Ramp Shot At Nets Ahead of IND vs AUS 4th Test 2024 (Watch Video)

Christmas Eve 2024 Unique Traditions: From Attending Midnight Mass and Setting Cookies for Santa to Festive Feasts and Gift Exchanges, a Look at Holiday Traditions From Around the World

Shocking Update in ‘Pushpa 2’ Stampede Case: Allu Arjun Summoned Again by Hyderabad Police for Questioning at 11 AM on December 24

‘Identity’ Trailer: Tovino Thomas and Trisha Krishnan’s Mystery Thriller Offers High-Octane Action (Watch Video)

Sheikh Hasina Extradition: Bangladesh Requests India To Extradite Former Prime Minister; New Delhi Confirms Communication

‘Pushpa 2 – The Rule’ Stampede Update: Mythri Movie Makers Donates INR 50 Lakhs to Victim Revathi’s Family Following Tragic Incident at Sandhya Theater in Hyderabad (Watch Video)

Shyam Benegal Dies at 90: Shekhar Kapur, Naveen Patnaik, Shashi Tharoor and Others Pay Tribute to the Legendary Filmmaker

HC on Inter-Faith Marriage: Madhya Pradesh High Court Comes to Aid of Interfaith Couple, Says There’s No Bar on Muslim Boy Marrying Hindu Girl Under Section 4 of Special Marriage Act

Steve Smith Jokingly Mentions ICC ‘Not As Powerful’ Compared to BCCI During an Interview, Video Goes Viral

Richard Heart, Founder of HEX and PulseChain, Faces Interpol Red Notice, Listed on Europol’s Most Wanted Fugitives List; Know Why

Khalistan-Linked Criminals Killed in Encounter: 3 Pro-Khalistan Sympathisers, Wanted in Gurdaspur Grenade Attack, Gunned Down in Pilibhit in Joint Operation by UP and Punjab Police (Watch Videos)

Rohtas: Truck Parked Near Pakhnaari on Sasaram National Highway in Bihar Catches Fire, Terrifying Video of ‘Burning Truck’ Surfaces

Short Videos

Editor's Choice

Mahila Samman Yojana Registration Begins in Delhi: Know Eligibility, List of Required Documents and How To Apply To Get INR 2,100 Monthly

Rewa: ‘Preserve Sperm’ Demand by Woman Whose Husband Died in Road Accident Leaves Authorities Perplexed

Miguel Angel Aguilar Dies: Fitness Influencer in US Loses Battle for Life Months After Being Shot in Face During Robbery Attempt in Los Angeles

Narayana Murthy Warns Mass Migration to Bengaluru, Pune and Hyderabad Amid Climate Change

Trending Topics