Microsoft gives Copilot a massive redesign with a voice and vision

Microsoft is rolling out a major overhaul of its Copilot experience, introducing new voice and vision capabilities aimed at transforming it into a more personalized AI assistant. These enhancements, which were first announced in May, mark a significant step forward in AI integration for Microsoft’s users. Copilot’s new features include a virtual news presenter mode to deliver headlines, a vision component that allows it to “see” what you’re looking at, and voice interaction that enables natural conversations similar to OpenAI’s Advanced Voice Mode.

Read: A new iPhone SE is coming with Face ID and Apple Intelligence

Across mobile, web, and the dedicated Windows app, Copilot is undergoing a visual redesign, adopting a card-based interface that closely resembles the Pi AI assistant created by Inflection AI. This shift in design comes under the direction of Mustafa Suleyman, Google DeepMind cofounder and now Microsoft AI’s CEO. Suleyman, who joined Microsoft earlier this year, is spearheading this redesign, aiming to make Copilot a more accessible and user-friendly companion. In an open letter, he stated, “At Microsoft AI, we are creating an AI companion for everyone,” expressing confidence that this new era of AI will be more supportive and helpful than ever before.

The updated interface reflects a warmer, more personalized experience compared to previous versions. A Copilot Discover page now tailors suggestions, searches, and tips based on individual users’ conversation history, offering more relevant information right from the homepage. This redesign is the result of lessons learned from Inflection AI’s focus on customer needs and feedback, according to Yusuf Mehdi, Microsoft’s executive vice president and consumer chief marketing officer.

One of the major upgrades is the voice interaction feature, which allows users to engage with Copilot in a conversational manner. Copilot offers four voice options and responds naturally to interruptions or follow-up questions, mimicking human-like interactions. This voice feature enhances the assistant’s versatility, providing a more seamless experience for users looking to multitask or engage with their devices hands-free. “We’re making a huge bet on voice,” Mehdi emphasized, highlighting its potential to create a more intuitive AI interaction.

Another significant addition is Copilot Vision, enabling the AI to analyze the content users are viewing on a webpage. Whether it’s text, images, or other content, Copilot can answer questions and provide insights in real time, making it a powerful tool for web browsing and online shopping. However, Copilot Vision is an opt-in feature, and Microsoft has taken steps to ensure user privacy by restricting the types of websites it works with and not storing any content for future use.

The overhaul also introduces Copilot Daily, a personalized audio summary of news and weather updates. This feature, initially available only in the US and the UK, offers users a brief, spoken digest of the day’s headlines, pulling content from authorized sources like Reuters and the Financial Times. Additionally, Copilot’s new “Think Deeper” feature allows the AI to take its time in providing detailed, step-by-step responses to complex questions, further expanding its utility for decision-making tasks.

To encourage user feedback, Microsoft is also launching Copilot Labs, a platform for testing experimental features like Copilot Vision. The updated experience is available now across mobile apps, the web, and Windows, with voice initially offered in English-speaking regions. Microsoft is clearly positioning Copilot as a dynamic, evolving tool that will become even more personalized over time. According to Suleyman, future versions will adapt to users’ preferences and assist with everything from taking notes at doctor appointments to helping plan family events, promising a highly integrated AI experience.