Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Multi-modal models that can process both ...
Another day, another AI model from Google. This time, Google DeepMind has released a new member of the Gemma 4 open model ...
Training a foundation LLM from scratch costs millions and requires internet-scale data — which is why most enterprises don't bother. Sapient thinks it has a cheaper path. To overcome this brute-force ...
Text emotion detection is an essential task in Natural Language Processing (NLP), with applications in customer support automation, diagnosing mental health, and social media analysis. Yet, precise ...
Google's Gemini Omni is a new multimodal model that reasons across text, images, audio, and video to generate and edit videos through simple conversation — starting with Omni Flash.
Google DeepMind released DiffusionGemma on June 10, 2026, an experimental open-weights model that writes text using discrete diffusion rather than the token-by-token method behind GPT-style systems ...
There is increasing adoption of artificial intelligence in drug discovery. However, existing studies use machine learning to mainly utilize the chemical structures of molecules but ignore the vast ...
OpenAI has launched Sora, it’s much awaited text-to-video generation model capable of creating videos from text prompts. The model allows users to convert written descriptions into videos, which can ...
Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The ...
The firm is sharing Sora with a small group of safety testers but the rest of us will have to wait to learn more. OpenAI has built a striking new generative video model called Sora that can take a ...