Magzter GOLDで無制限に

Magzter GOLDで無制限に

10,000以上の雑誌、新聞、プレミアム記事に無制限にアクセスできます。

$149.99
 
$74.99/年
The Perfect Holiday Gift Gift Now

Large multimodal modelsAnother step towards AGI

PCQuest

|

september 2024

Large Multimodal Models LMMs) represent the next leap in Al, combining text, images, and audio into a single system that understands the world more like humans do. This advancement moves us closer to Al that can perform complex tasks across various domains, from healthcare to entertainment, and brings Us a step nearer to Artificial General Intelligence

- Amit Gupta

Large multimodal modelsAnother step towards AGI

The excitement surrounding large language models (LLMs) is rapidly increasing, with industries widely exploring diverse use cases. As a transformative technology, LLMs are being closely monitored for their potential to revolutionize and optimize everything from customer service to complex data analysis to advance health care. Bill Gates recently wrote a blog on how agents will be the next big thing in software. He further claimed that in the next 5 years, anyone who’s online will be able to have a personal assistant powered by artificial intelligence.

While the industries & user community are still embracing the euphoria of Large Language Models (LLMs), the Hi-Tech industry has already started to work on evolution of Large Multimodal Models (LMM) - a step towards extending the ‘emergent’ abilities of LLMs beyond text-only input/output models.

▾ Large Multimodal Models

We human beings are blessed with multiple sensory & cognitive capabilities and our intelligence is a collective intelligence derived from multiple sources. As we grow, we learn to use one or more of these ‘Modes of interactions’ to interact with the world around us. The future of AI will likely follow the same realm and will work on integrating multiple data modalities at input and/or output into AI models, leading to the development of LMMs. The input or output modes of interest could be text/language, images, video, audio, sensors data, actuator data, etc. Till recently, the focus was on unimodal models which could process only one data mode (such as text or speech or image) at a time.

By combining these different types of data, LMMs can achieve a more holistic understanding of the world, enabling them to perform complex tasks. For instance, an LMM could analyze a video, recognize objects, understand spoken language, and generate descriptive text all in one seamless iteration.

PCQuest からのその他のストーリー

PCQuest

PCQuest

The invisible intelligence powering healthcare and finance

What if your hospital's AI could think like a surgeon and your bank's software acted like a risk analyst? Inside Iksha Labs, machines aren't just smart, they're regulation-ready, real-time coworkers for the world's most demanding industries

time to read

5 mins

December 2025

PCQuest

PCQuest

How AI and cloud can optimize the performance and efficiency of edge devices

AI isn't just living in the cloud, it's getting its boots dirty at the edge. From oil rigs to warehouses, learn how smart tech is teaming up with cloud power to make machines faster, decisions sharper, and industries safer

time to read

2 mins

December 2025

PCQuest

PCQuest

Beyond automation: A shift in developer cognition

From modular code generation to knowledge-as-a-service, a new Al-human alliance is reshaping how enterprise software is built, tested, and governed. Welcome to the new age of intelligent development

time to read

5 mins

December 2025

PCQuest

PCQuest

Ubon SP-95

Budget Bluetooth speakers often try to pack in more than they can handle. The Ubon SP-95 takes a different route. It focuses on the basics and aims to execute them well. You get a 20W output, Bluetooth 5.3, USB and TF card playback, AUX input, FM radio, and a Type-C charging port. All of this comes at a price of Rs 1,499, which puts it in the sweet spot for students and young users who want something reliable without spending too much.

time to read

1 mins

December 2025

PCQuest

PCQuest

India's esports scene is about to go BOOM

India's gaming boom needs more than tournaments. It needs creators, infrastructure, pathways, and a long-term vision that treats esports as entertainment for all, not just the pro tier. JioBLAST wants to write that next chapter by blending fans, creators, and competitors into one connected ecosystem

time to read

6 mins

December 2025

PCQuest

PCQuest

AI's power shift begins at the edge

Cloud isn't king anymore. AI is moving home to your laptop, your office, and your private cloud. What's driving this silent shift from scale to sovereignty? The answer lies at the edge, where performance meets control

time to read

4 mins

December 2025

PCQuest

PCQuest

A quiet revolution under the hood

When hardware stops holding you back, imagination runs wild. From dorms to dev studios, Indian gamers are rewriting the rules not with hype, but with high frame rates, future-ready builds, and a hunger that's finally met its match.

time to read

3 mins

December 2025

PCQuest

PCQuest

The collaboration paradox

What if your workflow wasn't broken, but the tools were never built for your brain in the first place? A new creator-led rethink is turning chaotic feedback, endless loops, and scattered files into something surprisingly rare: peace

time to read

4 mins

December 2025

PCQuest

PCQuest

2025 inflection point Where hype met hard truth

2025 wasn't just another tech year. It was the year tech grew up, left behind the hype cycles, and got a real job. From autonomous AI to sovereign data bunkers, the industry finally started chasing outcomes, not headlines

time to read

4 mins

December 2025

PCQuest

PCQuest

The rise of Indian esports isn't luck; it's logistics

As esports in India finds mainstream momentum, a silent revolution is unfolding, shaped by smarter devices, deeper analytics, and disciplined creator ecosystems. The future isn't a bet. It's a build

time to read

4 mins

December 2025

Listen

Translate

Share

-
+

Change font size

Holiday offer front
Holiday offer back