Model Text - Search News

Meta’s Transfusion model handles text and images in a single architecture

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Multi-modal models that can process both ...

18d

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Another day, another AI model from Google. This time, Google DeepMind has released a new member of the Gemma 4 open model ...

VentureBeat

Researchers say they trained a foundation model from scratch for about $1,500

Training a foundation LLM from scratch costs millions and requires internet-scale data — which is why most enterprises don't bother. Sapient thinks it has a cheaper path. To overcome this brute-force ...

Nature

A novel hybrid model for emotion detection in text through sequential and transformer-based approaches: LSTM enhanced RoBERTa (LER)

Text emotion detection is an essential task in Natural Language Processing (NLP), with applications in customer support automation, diagnosing mental health, and social media analysis. Yet, precise ...

1monon MSN

Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start

Google's Gemini Omni is a new multimodal model that reasons across text, images, audio, and video to generate and edit videos through simple conversation — starting with Omni Flash.

Tech Times

Google’s DiffusionGemma Generates Text 4x Faster: Diffusion Replaces Token-by-Token Output

Google DeepMind released DiffusionGemma on June 10, 2026, an experimental open-weights model that writes text using discrete diffusion rather than the token-by-token method behind GPT-style systems ...

Nature

Multi-modal molecule structure–text model for text-based retrieval and editing

There is increasing adoption of artificial intelligence in drug discovery. However, existing studies use machine learning to mainly utilize the chemical structures of molecules but ignore the vast ...

Your Story

OpenAI releases text-to-video model Sora

OpenAI has launched Sora, it’s much awaited text-to-video generation model capable of creating videos from text prompts. The model allows users to convert written descriptions into videos, which can ...

TechCrunch

Largest text-to-speech AI model yet shows ’emergent abilities’

Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The ...

MIT Technology Review

OpenAI teases an amazing new generative video model called Sora

The firm is sharing Sora with a small group of safety testers but the rest of us will have to wait to learn more. OpenAI has built a striking new generative video model called Sora that can take a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results