-
The new AI-powered Google Finance is expanding to Europe.by AI on May 11, 2026
This week, the new, AI-powered Google Finance is launching across Europe, with full local language support. This reimagined experience offers a suite of powerful capabil…
-
‘Your Career Starts at the Beginning of the AI Revolution,’ NVIDIA CEO Tells Graduatesby Matthew Leib (NVIDIA Blog) on May 10, 2026
“You are entering the world at an extraordinary moment,” NVIDIA founder and CEO Jensen Huang told graduates as he delivered the keynote address at Carnegie Mellon University’s 128th commencement ceremony on Sunday. “A new industry is being born. A new era of science and discovery is beginning.” “No generation has entered the world with more
-
Building realistic electric transmission grid dataset at scale: a pipeline from open datasetby Andrea Britto Mattos Lima, Thiago Vallin Spina, Weiwei Yang, Spencer Fowers, Ruslan Nagimov, Baosen Zhang (Microsoft Research) on May 8, 2026
Microsoft Research is excited to release an open dataset of approximate transmission topology of the U.S. power grid derived from publicly available data. The ability to study transmission-level power grid behavior is essential for modern power systems research. Analyses of congestion, transmission expansion, demand growth, and system resilience all depend on network models with realistic The post Building realistic electric transmission grid dataset at scale: a pipeline […]
-
See what happens when creative legends use AI to make ads for small businesses.by AI on May 8, 2026
Today we're launching The Small Brief, an initiative bringing together three ad industry icons to champion a local businesses they love. Their mission is to build breakt…
-
Halliburton enhances seismic workflow creation with Amazon Bedrock and Generative AIby Yuan Tian (Artificial Intelligence) on May 8, 2026
In this post, we'll explore how we built a proof-of-concept that converts natural language queries into executable seismic workflows while providing a question-answering capability for Halliburton's Seismic Engine tools and documentation. We'll cover the technical details of the solution, share evaluation results showing workflow acceleration of up to 95%, and discuss key learnings that can help other organizations enhance their complex technical workflows with generative AI.
-
Powering the Next American Century: US Energy Secretary Chris Wright and NVIDIA’s Ian Buck on the Genesis Missionby Brian Caulfield (NVIDIA Blog) on May 7, 2026
AI will help build the energy it needs. That’s the case U.S. Energy Secretary Chris Wright and NVIDIA Vice President of Hyperscale and High-Performance Computing Ian Buck made Thursday morning at the SCSP AI+ Expo. The 30-minute fireside chat, moderated by SCSP president Ylli Bajraktari, was called “Powering the Next American Century.” Their argument: American
-
Secure short-term GPU capacity for ML workloads with EC2 Capacity Blocks for ML and SageMaker training plansby Vanessa Ji (Artificial Intelligence) on May 7, 2026
In this post, you will learn how to secure reserved GPU capacity for short-term workloads using Amazon Elastic Compute Cloud (Amazon EC2) Capacity Blocks for ML and Amazon SageMaker training plans. These solutions can address GPU availability challenges when you need short-term capacity for load testing, model validation, time-bound workshops, or preparing inference capacity ahead of a release.
-
Overcoming reward signal challenges: Verifiable rewards-based reinforcement learning with GRPO on SageMaker AIby Surya Kari (Artificial Intelligence) on May 7, 2026
In this post, you will learn how to implement reinforcement learning with verifiable rewards (RLVR) to introduce verification and transparency into reward signals to improve training performance. This approach works best when outputs can be objectively verified for correctness, such as in mathematical reasoning, code generation, or symbolic manipulation tasks. You will also learn how to layer techniques like Group Relative Policy Optimization (GRPO) and few-shot examples to […]
-
Linked and Loaded: Gaijin Single Sign-On Now Available on GeForce NOWby GeForce NOW Community (NVIDIA Blog) on May 7, 2026
Less typing, more tanking. Faster logins mean more time in the gaming action — and this week provides GeForce NOW members with a smoother path straight into the battlefield. Cloud gaming is all about instant access to titles across devices, and the latest GeForce NOW update removes another layer for members jumping into their Gaijin
-
Agents that transact: Introducing Amazon Bedrock AgentCore payments, built with Coinbase and Stripeby Preethi C N (Artificial Intelligence) on May 7, 2026
Today, we're announcing a preview of Amazon Bedrock AgentCore Payments, a new set of features in Amazon Bedrock AgentCore that enables AI agents to instantly access and pay for what they use. AgentCore Payments was developed in partnership with Coinbase and Stripe.
-
5 gardening tips you can try right in Searchby (AI) on May 6, 2026
We’ve rounded up the top ways you can use Google’s AI Mode, Search Live and Shopping to help your plants thrive.
-
Cost effective deployment of vision-language models for pet behavior detection on AWS Inferentia2by Ray Wang (Artificial Intelligence) on May 6, 2026
Tomofun, the Taiwan-headquartered pet-tech startup behind the Furbo Pet Camera, is redefining how pet owners interact with their pets remotely. To reduce costs and maintain accuracy, Tomofun turned to EC2 Inf2 instances powered by AWS Inferentia2, the Amazon purpose-built AI chips. In this post, we walk through the following sections in detail.
-
NVIDIA Spectrum-X — the Open, AI-Native Ethernet Fabric — Sets the Standard for Gigascale AI, Now With MRCby Gilad Shainer (NVIDIA Blog) on May 6, 2026
The race to build the world’s most powerful AI factories demands networking that keeps pace with the ambitions of AI itself. NVIDIA Spectrum-X Ethernet scale-out infrastructure stands at the forefront of that race as the most advanced AI networking technology available today, deployed by industry leaders who can’t afford to compromise on performance, resilience or
-
AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fieldsby Google DeepMind News on May 6, 2026
Explore how AlphaEvolve's Gemini-powered algorithms are driving impact across business, infrastructure, and science.
-
NVIDIA and ServiceNow Partner on New Autonomous AI Agents for Enterprisesby Kari Briski (NVIDIA Blog) on May 5, 2026
Enterprise AI has learned to generate. It has learned to reason. Now companies are asking the next question: How should AI act? Early agent systems have shown what’s possible, moving beyond simple prompts to take on more complex tasks. The next step is bringing those capabilities into enterprise environments — where agents must operate with
-
How Hapag-Lloyd uses Amazon Bedrock to transform customer feedback into actionable insightsby Aamna Najmi (Artificial Intelligence) on May 5, 2026
Hapag-Lloyd's Digital Customer Experience and Engineering team, distributed between Hamburg and Gdańsk, drives digital innovation by developing and maintaining customer-facing web and mobile products. In this post, we walk you through our generative AI–powered feedback analysis solution built using Amazon Bedrock, Elasticsearch, and open-source frameworks like LangChain and LangGraph
-
Streamlining generative AI development with MLflow v3.10 on Amazon SageMaker AIby Sandeep Raveesh-Babu (Artificial Intelligence) on May 5, 2026
Today, we’re excited to announce that Amazon SageMaker AI MLflow Apps now support MLflow version 3.10, bringing enhanced capabilities for generative AI development and streamlined experiment tracking to your generative AI workflows. Building on the foundations established with Amazon SageMaker AI MLflow Apps, this latest version introduces powerful new features for observability, evaluation, and generative
-
Introducing OS Level Actions in Amazon Bedrock AgentCore Browserby Evandro Franco (Artificial Intelligence) on May 5, 2026
We’re announcing OS Level Actions for AgentCore Browser. This new capability unblocks these scenarios by exposing direct OS control through the InvokeBrowser API, so agents can interact with content visible on the screen, not only what's accessible through the browser's web layer. By combining full-desktop screenshots with mouse and keyboard control at the OS level, agents can observe native UI, reason about it, and act on it within the same session. This post walks […]
-
Microsoft at NSDI 2026: Advances in large-scale networked systemsby Sujata Banerjee (Microsoft Research) on May 5, 2026
Microsoft researchers share advances in building and operating large-scale distributed systems, spanning datacenters, networking, and the growing intersection with AI during NSDI ’26. The post Microsoft at NSDI 2026: Advances in large-scale networked systems appeared first on Microsoft Research.
-
Google is partnering with XPRIZE and Range Media Partners on the $3.5 million Future Vision film competition.by AI on May 5, 2026
Google is partnering with XPRIZE and Range Media Partners on the $3.5 million Future Vision film competition.
-
Secure AI agents with Amazon Bedrock AgentCore Identity on Amazon ECSby Julian Grüber (Artificial Intelligence) on May 5, 2026
AI agents in production require secure access to external services. Amazon Bedrock AgentCore Identity, available as a standalone service, secures how your AI agents access external services whether they run on compute platforms like Amazon ECS, Amazon EKS, AWS Lambda, or on-premises. This post implements Authorization Code Grant (3-legged OAuth) on Amazon ECS with secure session binding and scoped tokens.
-
Intelligence-driven message defense and insights using Amazon Bedrockby Tyler Huehmer (Artificial Intelligence) on May 5, 2026
In this post, you will learn how you can use Amazon Nova Foundation Models in Amazon Bedrock to apply generative AI techniques for both business protection and enhancement. You can identify obvious and disguised attempts at direct contact while gaining valuable insights into customer sentiment and service improvement opportunities.
-
Beyond BI: How the Dataset Q&A feature of Amazon Quick powers the next generation of data decisionsby Salim Khan (Artificial Intelligence) on May 4, 2026
Business leaders across industries rely on operational dashboards as the shared source of truth that their teams execute against daily. But dashboards are built to answer known questions. When teams need to explore further, ad-hoc, multi-dimensional, or unforeseen questions, they hit a bottleneck. They wait hours or days for BI teams to build new views
-
Introducing agent quality optimization in AgentCore, now in previewby Bharathi Srinivasan (Artificial Intelligence) on May 4, 2026
Generate recommendations from production traces, validate them with batch evaluation and A/B testing, and ship with confidence. AI agents that perform well at launch don’t stay that way. As models evolve, user behavior shifts, and prompts get reused in new contexts they were never designed for. Agent quality quietly degrades. In most teams, the improvement
-
Agent-guided workflows to accelerate model customization in Amazon SageMaker AIby Lauren Mullennex (Artificial Intelligence) on May 4, 2026
Amazon SageMaker AI now offers an agentic experience that changes this. Developers describe their use case using natural language, and the AI coding agent streamlines the entire journey, from use case definition and data preparation through technique selection, evaluation, and deployment. In this post, we walk you through the model customization lifecycle using SageMaker AI agent skills.
-
The latest AI news we announced in April 2026by (AI) on May 4, 2026
Here are Google’s latest AI updates from April 2026
-
Generate dashboards from natural language prompts in Amazon Quickby Salim Khan (Artificial Intelligence) on May 4, 2026
Building meaningful dashboards demands hours of manual setup, even for experienced BI professionals. Amazon Quick now generates complete multi-sheet dashboards from natural language prompts, taking you from one or more datasets to a production-ready analysis in minutes. Data analysts building recurring operations reports, program managers preparing a leadership review, or engineers exploring a new dataset can
-
From data lake to AI-ready analytics: Introducing new data source with S3 Tables in Amazon Quickby Raji Sivasubramaniam (Artificial Intelligence) on May 4, 2026
Amazon Quick introduces Amazon S3 Tables (Apache Iceberg tables) as a new data source. With this feature, customers can directly query and visualize Apache Iceberg tables stored in an Amazon S3 table bucket without the need for intermediate data layers. In this post, we explored how Amazon Quick’s new Amazon S3 Tables data source enables near real-time analytics while streamlining modern data architectures.
-
Introducing Dataset Q&A: Expanding natural language querying for structured datasets in Amazon Quickby Surendran Raju (Artificial Intelligence) on May 4, 2026
In this post, you learn how to get started with Dataset Q&A, explore real-world use cases with hands-on examples, and discover advanced capabilities like auto-discovery across all your data assets and multi-dataset querying in a single conversation.
-
Capacity-aware inference: Automatic instance fallback for SageMaker AI endpointsby Kareem Syed-Mohammed (Artificial Intelligence) on May 4, 2026
Today, Amazon SageMaker AI introduces capacity aware instance pool for new and existing inference endpoints. You define a prioritized list of instance types, and SageMaker AI automatically works through your list whenever capacity is constrained at creation, during scale-out, and during scale-in. Your endpoint provisions on available AI Infrastructure without manual intervention. This capability is available for Single Model Endpoints, Inference Component-based endpoints, […]
-
Reduce friction and latency for long-running jobs with Webhooks in Gemini APIby (AI) on May 4, 2026
Event-Driven Webhooks are a push-based notification system that eliminates the need for inefficient polling.
-
AWS Transform now automates BI migration to Amazon Quick in daysby Anantha Choppalli, Ahil Gunasekaran, Taher Paratha (Artificial Intelligence) on May 1, 2026
In this post, we walk through the full journey, from setting up your migration workspace in AWS Transform to subscribing to partner agents through AWS Marketplace to unlocking Amazon Quick capabilities that change how your organization consumes data.
-
Red-teaming a network of agents: Understanding what breaks when AI agents interact at scaleby Gagan Bansal, Shujaat Mirza, Keegan Hines, Will Epperson, Zachary Huang, Whitney Maxwell, Pete Bryan, Tyler Payne, Adam Fourney, Amanda Swearngin, Wenyue Hua, Tori Westerhoff, Amanda Minnich, Maya Murad, Ece Kamar, Ram Shankar Siva Kumar, Saleema Amershi (Microsoft Research) on April 30, 2026
Safe agents don’t guarantee a safe ecosystem of interconnected agents. Microsoft Research examines what breaks when AI agents interact and why network-level risks require new approaches. The post Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale appeared first on Microsoft Research.
-
Reinforcement fine-tuning with LLM-as-a-judgeby Hemanth Kumar Jayakumar (Artificial Intelligence) on April 30, 2026
In this post, we take a deeper look at how RLAIF or RL with LLM-as-a-judge works with Amazon Nova models effectively.
-
Nemotron Labs: What OpenClaw Agents Mean for Every Organizationby Justin Boitano (NVIDIA Blog) on April 30, 2026
By early 2026, the open source project OpenClaw had become a phenomenon. In January, its GitHub star count crossed 100,000 as developer interest surged.
-
AWS Generative AI Model Agility Solution: A comprehensive guide to migrating LLMs for generative AI productionby Long Chen (Artificial Intelligence) on April 30, 2026
In this post, we introduce a systematic framework for LLM migration or upgrade in generative AI production, encompassing essential tools, methodologies, and best practices. The framework facilitates transitions between different LLMs by providing robust protocols for prompt conversion and optimization.
-
It’s Gonna Be May: 16 Games Hit the Cloud This Month, With More NVIDIA GeForce RTX 5080 Powerby GeForce NOW Community (NVIDIA Blog) on April 30, 2026
[Editor’s note] The blog has been updated to note that GeForce RTX 5080-power expansion also extends to the Install-to-Play library. It’s gonna be May — and the cloud’s in full festival mode. 16 games are joining GeForce NOW this month, including new AAA titles arriving on launch day from Steam, Xbox, PC Game Pass and
-
Enabling a new model for healthcare with AI co-clinicianby Google DeepMind News on April 30, 2026
Researching the path to AI-augmented care and development of an AI co-clinician.
-
NVIDIA Launches Nemotron 3 Nano Omni Model, Unifying Vision, Audio and Language for up to 9x More Efficient AI Agentsby Kari Briski (NVIDIA Blog) on April 28, 2026
AI agent systems today juggle separate models for vision, speech and language — losing time and context as they pass data from one model to the other. Unveiled today, NVIDIA Nemotron 3 Nano Omni is an open multimodal model that brings these capabilities together into one system, enabling agents to deliver faster, smarter responses with
-
Celebrating 20 years of Google Translate: Fun facts, tips and new features to tryby (AI) on April 28, 2026
Google’s sharing 20 fun facts to celebrate Google Translate turning 20, from its roots as a 2006 AI experiment to supporting almost 250 languages today.
-
Into the Omniverse: Manufacturing’s Simulation-First Era Has Arrivedby Bhoomi Gadhia (NVIDIA Blog) on April 28, 2026
Manufacturing’s traditional design-build-test cycle rested on a single assumption: Real-world testing was the only reliable test environment.
-
Join the new AI Agents Vibe Coding Course from Google and Kaggleby (AI) on April 27, 2026
Google is bringing back its 5-Day AI Agents Intensive Course with Kaggle and registration is open.
-
Announcing our partnership with the Republic of Koreaby Google DeepMind News on April 27, 2026
Google DeepMind and Korea partner to accelerate scientific breakthroughs using frontier AI models
-
8 Gemini tips for organizing your space (and life)by (AI) on April 24, 2026
Organize your home and digital space with Gemini. Use AI-powered tips for cleaning schedules, inbox decluttering, seasonal chores.
-
OpenAI’s New GPT-5.5 Powers Codex on NVIDIA Infrastructure — and NVIDIA Is Already Putting It to Workby Justin Boitano (NVIDIA Blog) on April 23, 2026
AI agents have revolutionized developer workflows, and their next frontier is knowledge work: processing information, solving complex problems, coming up with new ideas and driving innovation. Codex, OpenAI’s agentic coding application, is enabling this new frontier. It’s now powered by GPT-5.5, OpenAI’s latest frontier model, which runs on NVIDIA GB200 NVL72 rack-scale systems. Over 10,000
-
Tag, You’re It: GeForce NOW Levels Up Game Discovery With Xbox Game Pass and Ubisoft+ Labelsby GeForce NOW Community (NVIDIA Blog) on April 23, 2026
GeForce NOW is doubling down on what matters most: gamers. This week’s upgrades bring smarter libraries, making it easier than ever for gamers to turn a PC collection into a cloud-powered flex. It starts with giving existing libraries time to shine. Gamers can bring the games they love to the cloud, stream them with high
-
Making Sense of the Early Universeby Brian Caulfield (NVIDIA Blog) on April 23, 2026
This Spring Astronomy Day, here’s a look at how AI and GPUs are helping astronomers work through unprecedented volumes of cosmic data.
-
Here’s how our TPUs power increasingly demanding AI workloads.by AI on April 23, 2026
Learn how Google’s TPUs power increasingly demanding AI workloads with this new video.
-
Elevating Austria: Google invests in its first data center in the Alps.by AI on April 23, 2026
Google has been a proud part of Austria’s landscape for years, and today, we’re announcing our first data center in Kronstorf, generating 100 direct jobs. This facility …
-
AutoAdapt: Automated domain adaptation for large language modelsby Sidharth Sinha, Anson Bastos, Xuchao Zhang, Akshay Nambi, Rujia Wang, Chetan Bansal (Microsoft Research) on April 22, 2026
Deploying large language models (LLMs) in real-world, high-stakes settings is harder than it should be. In high-stakes settings like law, medicine, and cloud incident response, performance and reliability can quickly break down because adapting models to domain-specific requirements is a slow and manual process that is difficult to reproduce. The core challenge is domain adaptation, The post AutoAdapt: Automated domain adaptation for large language models appeared first on […]
-
From Rainforests to Recycling Plants: 5 Ways NVIDIA AI Is Protecting the Planetby NVIDIA Writers (NVIDIA Blog) on April 22, 2026
Across climate, conservation, disaster monitoring and recycling, NVIDIA AI is enabling applications protecting the planet.
-
NVIDIA and Google Cloud Collaborate to Advance Agentic and Physical AIby Ian Buck (NVIDIA Blog) on April 22, 2026
NVIDIA and Google Cloud have collaborated for more than a decade, co‑engineering a full‑stack AI platform that spans every technology layer — from performance‑optimized libraries and frameworks to enterprise‑grade cloud services. This foundation enables developers, startups and enterprises to push agentic and physical AI out of the lab and into production — from agents that
-
We're launching two specialized TPUs for the agentic era.by AI on April 22, 2026
The eighth generation of Google’s TPU includes two specialized chips that will power the future of AI.
-
Partnering with industry leaders to accelerate AI transformationby Google DeepMind News on April 21, 2026
Google DeepMind partners with global consultancies to bring the power of frontier AI to organizations around the world.
-
3 new ways Ads Advisor is making Google Ads safer and fasterby (AI) on April 21, 2026
Three new agentic safety and policy features integrated into Ads Advisor will help protect and streamline your Google Ads account.
-
Can we AI our way to a more sustainable world?by Doug Burger, Amy Luers, Ishai Menache (Microsoft Research) on April 20, 2026
Doug Burger, sustainability expert Amy Luers, and optimization researcher Ishai Menache examine the global emissions implications of datacenter operations, efficiency gains, and AI's potential across electrification, materials, and food systems. The post Can we AI our way to a more sustainable world? appeared first on Microsoft Research.
-
Autonomous AI at Scale: Adobe Agents Unlock Breakthrough Creative Intelligence With NVIDIA and WPPby Richard Kerris (NVIDIA Blog) on April 20, 2026
AI agents are transforming how work gets done across all industries, accelerating everything from content creation to decision-making. NVIDIA’s expanded strategic collaborations with Adobe and WPP are bringing agentic AI to the center of enterprise marketing operations across creative production and customer experience orchestration. As demand for personalized customer experiences surges, brands require intelligent systems
-
NVIDIA and Partners Showcase the Future of AI-Driven Manufacturing at Hannover Messe 2026by James McKenna (NVIDIA Blog) on April 20, 2026
Manufacturing is at an inflection point. Across every major industrial economy, the pressure to do more with less — due to faster design cycles, leaner operations and strain on skilled labor pools — is accelerating the shift to AI-driven production. The question is no longer whether to adopt AI, but how fast and at what
-
7 ways to travel smarter this summer, with help from Googleby (AI) on April 17, 2026
The latest tools from Google can help you plan trips, find a great deal and explore your next destination.
-
A new way to explore the web with AI Mode in Chromeby (AI) on April 16, 2026
Today’s upgrades for AI Mode in Chrome transform how you interact with the web
-
New ways to create personalized images in the Gemini appby (AI) on April 16, 2026
Nano Banana 2 now uses your personal context and Google Photos to create images that reflect your unique life.
-
No Need for Space Gear — Capcom’s ‘PRAGMATA’ Joins GeForce NOW on Launch Dayby GeForce NOW Community (NVIDIA Blog) on April 16, 2026
Head straight for orbit with GeForce NOW — no space helmet required. PRAGMATA, Capcom’s long-awaited sci-fi action adventure, touches down on GeForce NOW the same day it launches worldwide. The futuristic journey through a cold lunar station in the near future can be streamed instantly from the cloud to almost any device, no console or
-
Gemini 3.1 Flash TTS: the next generation of expressive AI speechby Google DeepMind News on April 15, 2026
Our newest audio model introduces granular audio tags that give you precise control to direct AI speech for expressive audio generation.
-
Rethinking AI TCO: Why Cost per Token Is the Only Metric That Mattersby Shruti Koparkar (NVIDIA Blog) on April 15, 2026
Traditional data centers only stored, retrieved and processed data. In the generative and agentic AI era, these facilities have evolved into AI token factories. With AI inference becoming their primary workload, their primary output is intelligence manufactured in the form of tokens. This transformation demands a corresponding shift in how the economics of AI infrastructure,
-
Gemini 3.1 Flash TTS: the next generation of expressive AI speechby (AI) on April 15, 2026
Gemini 3.1 Flash TTS is now available across Google products.
-
Turn your best AI prompts into one-click tools in Chromeby (AI) on April 14, 2026
Skills in Chrome let you discover, save and remix AI workflows — and repeat them instantly.
-
Bringing people together at AI for the Economy Forumby (AI) on April 14, 2026
Google is bringing people together in Washington D.C. at our AI for the Economy Forum.
-
Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoningby Google DeepMind News on April 13, 2026
Gemini Robotics ER 1.6: Enhancing spatial reasoning and multi-view understanding for autonomous robotics.
-
New Future of Work: AI is driving rapid change, uneven benefitsby Jaime Teevan, Sonia Jaffe, Rebecca Janssen, Nancy Baym, Siân Lindley, Bahar Sarrafzadeh, Brent Hecht, Jenna Butler, Jake Hofman, Sean Rintel (Microsoft Research) on April 9, 2026
For the past five years, the New Future of Work report has captured how work is changing. This year, the shift feels especially sharp. Previous editions have focused on technology’s role in increasing productivity by automating tasks, accelerating communication, and expanding access to information, as well as the rise of remote work. Today, generative AI The post New Future of Work: AI is driving rapid change, uneven benefits appeared first on Microsoft Research.
-
Ideas: Steering AI toward the work future we wantby Jaime Teevan, Jenna Butler, Jake Hofman, Rebecca Janssen (Microsoft Research) on April 9, 2026
Microsoft Chief Scientist Jaime Teevan and researchers Jenna Butler, Jake Hofman, and Rebecca Janssen unpack the New Future of Work Report 2025 and explore the ideal AI-driven working world. Plus, is AI a tool or a collaborator? And why the answer matters. The post Ideas: Steering AI toward the work future we want appeared first on Microsoft Research.
-
Gemma 4: Byte for byte, the most capable open modelsby Google DeepMind News on April 2, 2026
Gemma 4: Our most intelligent open models to date, purpose-built for advanced reasoning and agentic workflows.
-
New ways to balance cost and reliability in the Gemini APIby (AI) on April 2, 2026
Google is introducing two new inference tiers to the Gemini API, Flex and Priority, to balance cost and latency.
-
ADeLe: Predicting and explaining AI performance across tasksby Lexin Zhou, Xing Xie (Microsoft Research) on April 1, 2026
AI benchmarks report how large language models (LLMs) perform on specific tasks but provide little insight into their underlying capabilities that drive their performance. They do not explain failures or reliably predict outcomes on new tasks. To address this, Microsoft researchers in collaboration with Princeton University and Universitat Politècnica de València introduce ADeLe (opens in new tab) (AI The post ADeLe: Predicting and explaining AI performance across […]
-
AsgardBench: A benchmark for visually grounded interactive planningby Andrea Tupini, Lars Liden, Reuben Tan, Yu Wang, Jianfeng Gao (Microsoft Research) on March 26, 2026
Imagine a robot tasked with cleaning a kitchen. It needs to observe its environment, decide what to do, and adjust when things don’t go as expected, for example, when the mug it was tasked to wash is already clean, or the sink is full of other items. This is the domain of embodied AI: systems The post AsgardBench: A benchmark for visually grounded interactive planning appeared first on Microsoft Research.
-
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulationby Sehun Jung, HyunJee Song, Dong-Hee Kim, Reuben Tan, Jianfeng Gao, Yong Jae Lee, Donghyun Kim (Microsoft Research) on March 26, 2026
Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most systems split these decisions into two steps: a VLM generates a plan in natural language, and a separate model translates it into executable actions. This approach often breaks The post GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation appeared first on Microsoft Research.
-
Gemini 3.1 Flash Live: Making audio AI more natural and reliableby Google DeepMind News on March 26, 2026
Our latest voice model has improved precision and lower latency to make voice interactions more fluid, natural and precise.
-
Protecting people from harmful manipulationby Google DeepMind News on March 25, 2026
Google DeepMind researches AI's harmful manipulation risks across areas like finance and health, leading to new safety measures.
-
Lyria 3 Pro: Create longer tracks in moreby Google DeepMind News on March 25, 2026
Introducing Lyria 3 Pro, which unlocks longer tracks with structural awareness. We’re also bringing Lyria to more Google products and surfaces.
-
Measuring progress toward AGI: A cognitive frameworkby Google DeepMind News on March 17, 2026
We’re introducing a framework to measure progress toward AGI, and launching a Kaggle hackathon to build the relevant evaluations.
-
From games to biology and beyond: 10 years of AlphaGo’s impactby Google DeepMind News on March 9, 2026
Ten years since AlphaGo, we explore how it is catalyzing scientific discovery and paving a path to AGI.
-
Gemini 3.1 Flash-Lite: Built for intelligence at scaleby Google DeepMind News on March 3, 2026
Gemini 3.1 Flash-Lite is our fastest and most cost-efficient Gemini 3 series model yet.
-
Nano Banana 2: Combining Pro capabilities with lightning-fast speedby Google DeepMind News on February 26, 2026
Our latest image generation model offers advanced world knowledge, production ready specs, subject consistency and more, all at Flash speed.
-
Gemini 3.1 Pro: A smarter model for your most complex tasksby Google DeepMind News on February 19, 2026
3.1 Pro is designed for tasks where a simple answer isn’t enough.
-
A new way to express yourself: Gemini can now create musicby Google DeepMind News on February 18, 2026
The Gemini app now features our most advanced music generation model Lyria 3, empowering anyone to make 30-second tracks using text or images.
-
Accelerating discovery in India through AI-powered science and educationby Google DeepMind News on February 17, 2026
Google DeepMind brings National Partnerships for AI initiative to India, scaling AI for science and education
-
Gemini 3 Deep Think: Advancing science, research and engineeringby Google DeepMind News on February 12, 2026
Our most specialized reasoning mode is now updated to solve modern science, research and engineering challenges.
-
Accelerating Mathematical and Scientific Discovery with Gemini Deep Thinkby Google DeepMind News on February 9, 2026
Research papers point to the growing impact of Deep Think across fields
-
Project Genie: Experimenting with infinite, interactive worldsby Google DeepMind News on January 29, 2026
Google AI Ultra subscribers in the U.S. can try out Project Genie, an experimental research prototype that lets you create and explore worlds.
-
D4RT: Teaching AI to see the world in four dimensionsby Google DeepMind News on January 16, 2026
D4RT: Unified, efficient 4D reconstruction and tracking up to 300x faster than prior methods.
-
Veo 3.1 Ingredients to Video: More consistency, creativity and controlby Google DeepMind News on January 13, 2026
Our latest Veo update generates lively, dynamic clips that feel natural and engaging — and supports vertical video generation.
-
Google's year in review: 8 areas with research breakthroughs in 2025by Google DeepMind News on December 23, 2025
Google 2025 recap: Research breakthroughs of the year
-
Gemini 3 Flash: frontier intelligence built for speedby Google DeepMind News on December 17, 2025
Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.
-
Gemma Scope 2: helping the AI safety community deepen understanding of complex language model behaviorby Google DeepMind News on December 16, 2025
Open interpretability tools for language models are now available across the entire Gemma 3 family with the release of Gemma Scope 2.
-
Improved Gemini audio models for powerful voice experiencesby Google DeepMind News on December 12, 2025
-
Deepening our partnership with the UK AI Security Instituteby Google DeepMind News on December 11, 2025
Google DeepMind and UK AI Security Institute (AISI) strengthen collaboration on critical AI safety and security research
-
Strengthening our partnership with the UK government to support prosperity and security in the AI eraby Google DeepMind News on December 10, 2025
Deepening our partnership with the UK government to support prosperity and security in the AI era
-
FACTS Benchmark Suite: Systematically evaluating the factuality of large language modelsby Google DeepMind News on December 9, 2025
Systematically evaluating the factuality of large language models with the FACTS Benchmark Suite.
-
Engineering more resilient crops for a warming climateby Google DeepMind News on December 4, 2025
Scientists are using AlphaFold to strengthen a photosynthesis enzyme for resilient, heat-tolerant crops.
-
AlphaFold: Five years of impactby Google DeepMind News on November 25, 2025
Explore how AlphaFold has accelerated science and fueled a global wave of biological discovery.
-
Revealing a key protein behind heart diseaseby Google DeepMind News on November 25, 2025
AlphaFold has revealed the structure of a key protein behind heart disease
-
Google DeepMind supports U.S. Department of Energy on Genesis: a national mission to accelerate innovation and scientific discoveryby Google DeepMind News on November 24, 2025
Google DeepMind and the DOE partner on Genesis, a new effort to accelerate science with AI.
-
How we’re bringing AI image verification to the Gemini appby Google DeepMind News on November 20, 2025
-
Build with Nano Banana Pro, our Gemini 3 Pro Image modelby Google DeepMind News on November 20, 2025
-
Introducing Nano Banana Proby Google DeepMind News on November 20, 2025
-
Start building with Gemini 3by Google DeepMind News on November 18, 2025
-
We’re expanding our presence in Singapore to advance AI in the Asia-Pacific regionby Google DeepMind News on November 18, 2025
Google DeepMind opens a new Singapore research lab, accelerating AI progress in the Asia-Pacific region.
-
A new era of intelligence with Gemini 3by Google DeepMind News on November 18, 2025
-
Introducing Google Antigravityby Google DeepMind News on November 18, 2025
-
WeatherNext 2: Our most advanced weather forecasting modelby Google DeepMind News on November 17, 2025
The new AI model delivers more efficient, more accurate and higher-resolution global weather predictions.
-
SIMA 2: An Agent that Plays, Reasons, and Learns With You in Virtual 3D Worldsby Google DeepMind News on November 13, 2025
Introducing SIMA 2, a Gemini-powered AI agent that can think, understand, and take actions in interactive environments.
-
Teaching AI to see the world more like we doby Google DeepMind News on November 11, 2025
Our new paper analyzes the important ways AI systems organize the visual world differently from humans.
-
How AI is giving Northern Ireland teachers time backby Google DeepMind News on November 10, 2025
A six-month long pilot program with the Northern Ireland Education Authority’s C2k initiative found that integrating Gemini and other generative AI tools saved participating teachers an average of 10 hours per week.
-
Mapping, modeling, and understanding nature with AIby Google DeepMind News on November 5, 2025
AI models can help map species, protect forests and listen to birds around the world
-
Accelerating discovery with the AI for Math Initiativeby Google DeepMind News on October 29, 2025
The initiative brings together some of the world's most prestigious research institutions to pioneer the use of AI in mathematical research.
-
T5Gemma: A new collection of encoder-decoder Gemma modelsby Google DeepMind News on October 25, 2025
Introducing T5Gemma, a new collection of encoder-decoder LLMs.
-
MedGemma: Our most capable open models for health AI developmentby Google DeepMind News on October 25, 2025
We’re announcing new multimodal models in the MedGemma collection, our most capable open models for health AI development.
-
Introducing Gemma 3n: The developer guideby Google DeepMind News on October 25, 2025
Gemma 3n is designed for the developer community that helped shape Gemma.
-
Gemini 2.5 Flash-Lite is now ready for scaled production useby Google DeepMind News on October 25, 2025
Gemini 2.5 Flash-Lite, previously in preview, is now stable and generally available. This cost-efficient model provides high quality in a small size, and includes 2.5 family features like a 1 million-token context window and multimodality.
-
Behind “ANCESTRA”: combining Veo with live-action filmmakingby Google DeepMind News on October 25, 2025
We partnered with Darren Aronofsky, Eliza McNitt and a team of more than 200 people to make a film using Veo and live-action filmmaking.
-
AlphaEarth Foundations helps map our planet in unprecedented detailby Google DeepMind News on October 24, 2025
New AI model integrates petabytes of Earth observation data to generate a unified data representation that revolutionizes global mapping and monitoring
-
Exploring the context of online images with Backstoryby Google DeepMind News on October 24, 2025
New experimental AI tool helps people explore the context and origin of images seen online.
-
Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiadby Google DeepMind News on October 24, 2025
The International Mathematical Olympiad (“IMO”) is the world’s most prestigious competition for young mathematicians, and has been held annually since 1959. Each country taking part is represented by six elite, pre-university mathematicians who compete to solve six exceptionally difficult problems in algebra, combinatorics, geometry, and number theory.
-
Aeneas transforms how historians connect the pastby Google DeepMind News on October 24, 2025
Introducing the first model for contextualizing ancient inscriptions, designed to help historians better interpret, attribute and restore fragmentary texts.
-
Genie 3: A new frontier for world modelsby Google DeepMind News on October 24, 2025
Genie 3 can generate dynamic worlds that you can navigate in real time at 24 frames per second, retaining consistency for a few minutes at a resolution of 720p.
-
How AI is helping advance the science of bioacoustics to save endangered speciesby Google DeepMind News on October 24, 2025
Our new Perch model helps conservationists analyze audio faster to protect endangered species, from Hawaiian honeycreepers to coral reefs.
-
Using AI to perceive the universe in greater depthby Google DeepMind News on October 24, 2025
Using AI to perceive the universe in greater depth
-
Gemini achieves gold-medal level at the International Collegiate Programming Contest World Finalsby Google DeepMind News on October 24, 2025
Gemini 2.5 Deep Think achieves breakthrough performance at the world’s most prestigious computer programming competition, demonstrating a profound leap in abstract problem solving.
-
Discovering new solutions to century-old problems in fluid dynamicsby Google DeepMind News on October 24, 2025
Our new method could help mathematicians leverage AI techniques to tackle long-standing challenges in mathematics, physics and engineering.
-
Strengthening our Frontier Safety Frameworkby Google DeepMind News on October 23, 2025
We’re strengthening the Frontier Safety Framework (FSF) to help identify and mitigate severe risks from advanced AI models.
-
Gemini Robotics 1.5 brings AI agents into the physical worldby Google DeepMind News on October 23, 2025
We’re powering an era of physical agents — enabling robots to perceive, plan, think, use tools and act to better solve complex, multi-step tasks.
-
Introducing CodeMender: an AI agent for code securityby Google DeepMind News on October 23, 2025
Using advanced AI to fix critical software vulnerabilities
-
Bringing AI to the next generation of fusion energyby Google DeepMind News on October 23, 2025
We’re partnering with Commonwealth Fusion Systems (CFS) to bring clean, safe, limitless fusion energy closer to reality.
-
Try Deep Think in the Gemini appby Google DeepMind News on October 23, 2025
We're rolling out Deep Think in the Gemini app for Google AI Ultra subscribers, and we're giving select mathematicians access to the full version of the Gemini 2.5 Deep Think model entered into the IMO competition.
-
Rethinking how we measure AI intelligenceby Google DeepMind News on October 23, 2025
Game Arena is a new, open-source platform for rigorous evaluation of AI models. It allows for head-to-head comparison of frontier systems in environments with clear winning conditions.
-
Introducing Gemma 3 270M: The compact model for hyper-efficient AIby Google DeepMind News on October 23, 2025
Today, we're adding a new, highly specialized tool to the Gemma 3 toolkit: Gemma 3 270M, a compact, 270-million parameter model.
-
Image editing in Gemini just got a major upgradeby Google DeepMind News on October 23, 2025
Transform images in amazing new ways with updated native image editing in the Gemini app.
-
VaultGemma: The world's most capable differentially private LLMby Google DeepMind News on October 23, 2025
We introduce VaultGemma, the most capable model trained from scratch with differential privacy.
-
Introducing the Gemini 2.5 Computer Use modelby Google DeepMind News on October 23, 2025
Available in preview via the API, our Computer Use model is a specialized model built on Gemini 2.5 Pro’s capabilities to power agents that can interact with user interfaces.
-
Introducing Veo 3.1 and advanced creative capabilitiesby Google DeepMind News on October 23, 2025
We’re rolling out significant updates to Veo that give people even more creative control.
-
How a Gemma model helped discover a new potential cancer therapy pathwayby Google DeepMind News on October 23, 2025
We’re launching a new 27 billion parameter foundation model for single-cell analysis built on the Gemma family of open models.
-
AlphaGenome: AI for better understanding the genomeby Google DeepMind News on June 25, 2025
Introducing a new, unifying DNA sequence model that advances regulatory variant-effect prediction and promises to shed new light on genome function — now available via API.
-
Gemini Robotics On-Device brings AI to local robotic devicesby Google DeepMind News on June 24, 2025
We’re introducing an efficient, on-device robotics model with general-purpose dexterity and fast task adaptation.
-
We’re expanding our Gemini 2.5 family of modelsby Google DeepMind News on June 17, 2025
Gemini 2.5 Flash and Pro are now generally available, and we’re introducing 2.5 Flash-Lite, our most cost-efficient and fastest 2.5 model yet.
-
Gemini 2.5: Updates to our family of thinking modelsby Google DeepMind News on June 17, 2025
Explore the latest Gemini 2.5 model updates with enhanced performance and accuracy: Gemini 2.5 Pro now stable, Flash generally available, and the new Flash-Lite in preview.
-
How we're supporting better tropical cyclone prediction with AIby Google DeepMind News on June 12, 2025
We’re launching Weather Lab, featuring our experimental cyclone predictions, and we’re partnering with the U.S. National Hurricane Center to support their forecasts and warnings this cyclone season.
-
Advanced audio dialog and generation with Gemini 2.5by Google DeepMind News on June 3, 2025
Gemini 2.5 has new capabilities in AI-powered audio dialog and generation.
-
Gemini 2.5: Our most intelligent models are getting even betterby Google DeepMind News on May 20, 2025
Gemini 2.5 Pro continues to be loved by developers as the best model for coding, and 2.5 Flash is getting even better with a new update. We’re bringing new capabilities to our models, including Deep Think, an experimental enhanced reasoning mode for 2.5 Pro.
-
Fuel your creativity with new generative media models and toolsby Google DeepMind News on May 20, 2025
Introducing Veo 3 and Imagen 4, and a new tool for filmmaking called Flow.
-
SynthID Detector — a new portal to help identify AI-generated contentby Google DeepMind News on May 20, 2025
Learn about the new SynthID Detector portal we announced at I/O to help people understand how the content they see online was generated.
-
Advancing Gemini's security safeguardsby Google DeepMind News on May 20, 2025
We’ve made Gemini 2.5 our most secure model family to date.
-
Our vision for building a universal AI assistantby Google DeepMind News on May 20, 2025
We’re extending Gemini to become a world model that can make plans and imagine new experiences by simulating aspects of the world.
-
Announcing Gemma 3n preview: Powerful, efficient, mobile-first AIby Google DeepMind News on May 20, 2025
Gemma 3n is a cutting-edge open model designed for fast, multimodal AI on devices, featuring optimized performance, unique flexibility with a 2-in-1 model, and expanded multimodal understanding with audio, empowering developers to build live, interactive applications and sophisticated audio-centric experiences.
-
AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithmsby Google DeepMind News on May 14, 2025
New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators
-
Gemini 2.5 Pro Preview: even better coding performanceby Google DeepMind News on May 6, 2025
We’ve seen developers doing amazing things with Gemini 2.5 Pro, so we decided to release an updated version a couple of weeks early to get into developers hands sooner.
-
Build rich, interactive web apps with an updated Gemini 2.5 Proby Google DeepMind News on May 6, 2025
Our updated version of Gemini 2.5 Pro Preview has improved capabilities for coding.
-
Music AI Sandbox, now with new features and broader accessby Google DeepMind News on April 24, 2025
Helping music professionals explore the potential of generative AI
-
Introducing Gemini 2.5 Flashby Google DeepMind News on April 17, 2025
Gemini 2.5 Flash is our first fully hybrid reasoning model, giving developers the ability to turn thinking on or off.
-
Generate videos in Gemini and Whisk with Veo 2by Google DeepMind News on April 15, 2025
Transform text-based prompts into high-resolution eight-second videos in Gemini Advanced and use Whisk Animate to turn images into eight-second animated clips.
-
DolphinGemma: How Google AI is helping decode dolphin communicationby Google DeepMind News on April 14, 2025
DolphinGemma, a large language model developed by Google, is helping scientists study how dolphins communicate — and hopefully find out what they're saying, too.
-
Taking a responsible path to AGIby Google DeepMind News on April 2, 2025
We’re exploring the frontiers of AGI, prioritizing technical safety, proactive risk assessment, and collaboration with the AI community.
-
Evaluating potential cybersecurity threats of advanced AIby Google DeepMind News on April 2, 2025
Our framework enables cybersecurity experts to identify which defenses are necessary—and how to prioritize them
-
Gemini 2.5: Our most intelligent AI modelby Google DeepMind News on March 25, 2025
Gemini 2.5 is our most intelligent AI model, now with thinking built in.
-
Gemini Robotics brings AI into the physical worldby Google DeepMind News on March 12, 2025
Introducing Gemini Robotics and Gemini Robotics-ER, AI models designed for robots to understand, act and react to the physical world.
-
Experiment with Gemini 2.0 Flash native image generationby Google DeepMind News on March 12, 2025
Native image output is available in Gemini 2.0 Flash for developers to experiment with in Google AI Studio and the Gemini API.
-
Introducing Gemma 3by Google DeepMind News on March 12, 2025
The most capable model you can run on a single GPU or TPU.
-
Start building with Gemini 2.0 Flash and Flash-Liteby Google DeepMind News on February 25, 2025
Gemini 2.0 Flash-Lite is now generally available in the Gemini API for production use in Google AI Studio and for enterprise customers on Vertex AI
-
Gemini 2.0 is now available to everyoneby Google DeepMind News on February 5, 2025
We’re announcing new updates to Gemini 2.0 Flash, plus introducing Gemini 2.0 Flash-Lite and Gemini 2.0 Pro Experimental.
-
How generational differences affect consumer attitudes towards adsby Meta Research on May 17, 2023
Our research study, in collaboration with CrowdDNA, aims to understand people's relationship with social media ads across different social media platforms.
-
Every tree countsby Meta Research on April 17, 2023
Meta set a goal to reach net zero emissions by 2030. We are developing technology to mitigate our carbon footprint and making these openly available.
-
How a non-traditional background led to cutting-edge XR techby Meta Research on April 14, 2023
-
A new, unique AI dataset for animating amateur drawingsby Meta Research on April 13, 2023
-
How the metaverse can transform educationby Meta Research on April 12, 2023
-
Build faster with Buck2: Our open source build systemby Meta Research on April 6, 2023
-
Announcing the 2023 Meta Research PhD Fellowship award winnersby Meta Research on April 5, 2023
...
-
Announcing the winners of the 2022 Foundational Integrity Research request for proposalsby Meta Research on March 27, 2023
In September, Meta launched the Foundational Integrity Research request for proposals. Today, we announce the winners of this award.
-
Two meta sustainability grant and scholarship recipients share impactby Meta Research on March 24, 2023
