Apple’s Creator Studio Pro leverages AI to help creators with tedious tasks — like finding clips or building slides — without trying to do the work for them.
Q.ai is an Israeli startup specializing in imaging and machine learning, particularly technologies that enable devices to interpret whispered speech and enhance audio in noisy environments.
ElevenLabs CEO argued at Web Summit Qatar that voice is the next interface for AI, as OpenAI, Google, and Apple push conversational systems into wearables, new hardware, and everyday interactions.
Alibaba released Qwen 3.5, a multimodal model capable of processing videos up to two hours in length. The model is released under an open-weights strategy, making it freely deployable without per-token API costs. The release expands accessible long-context video understanding to developers who previously relied on closed commercial APIs.
The feature is designed to work inside the website to understand its content and layout, allowing site owners to make changes with natural language commands. The WordPress AI assistant doesn't need precisely tailored prompts, either.
Seven frontier-class AI models launched within a 28-day window spanning late January to late February 2026, including Opus 4.6, GPT-5.3-Codex, Gemini 3 Deep Think, Sonnet 4.6, and Gemini 3.1 Pro. The release cadence represents a marked acceleration compared to prior years, where comparable capability jumps arrived quarterly at most. The pace raises practical questions for engineering teams about how to structure model dependencies in production applications.
Anthropic launched Claude Memory on March 1, 2026, enabling persistent context across sessions for all users. The feature leverages the 1-million-token context windows available in Opus and Sonnet models. Integration with Microsoft PowerPoint and Excel is included at launch.
Luma introduced Luma Agents, powered by its new “Unified Intelligence” models, designed to coordinate multiple AI systems and generate end-to-end creative work across text, images, video and audio.
OpenAI launches GPT-5.4 with Pro and Thinking versions, Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro, Where things stand with the Department of War Anthropic
The company's new product, called Gamma Imagine, will let users employ text prompts to create brand-specific assets like interactive charts and visualizations, marketing collateral, social graphics, and infographics.
Meta released Muse Spark, the first model from its Superintelligence Labs division led by Alexandr Wang, on April 8, 2026. The model, developed codenamed Avocado, scores 86.4% on figure understanding benchmarks but 42.5% on abstract reasoning tasks. The release follows Meta's $14 billion investment in Wang's team and highlights that perception and reasoning capabilities remain difficult to develop in tandem.
Liquid AI just released LFM2.5-VL-450M, an updated version of its earlier LFM2-VL-450M vision-language model. The new release introduces bounding box prediction, improved instruction following, expanded multilingual understanding, and function calling support — all within a 450M-parameter footprint
In this tutorial, we walk through MolmoAct step by step and build a practical understanding of how action-reasoning models can reason in space from visual observations. We set up the environment, load the model, prepare multi-view image inputs, and explore how MolmoAct produces depth-aware reasoning
According to Bloomberg reporter Mark Gurman, Apple is developing smart glasses that skip the display entirely and instead function as an AI wearable. The article Apple is building smart glasses without a display to serve as an AI wearable appeared first on The Decoder.
A single image becomes a talking character: LPM 1.0 generates real-time video with lip sync, facial expressions, and emotional reactions. For now, it remains a research project. The article New AI model generates 45-minute lip-synced video from one photo and runs in real time appeared first on The D
An internal OpenAI memo lays out five strategic priorities for the enterprise business - including a new model codenamed "Spud," a platform play for AI agents, and the blunt accusation that Anthropic is overstating its revenue by 8 billion dollars. The article OpenAI's leaked memo says new "Spud" mo