Elon Musk’s artificial intelligence venture, xAI, has added powerful new capabilities to its Grok AI chatbot, enhancing how users interact with the world through their devices. The most notable update, Grok Vision, allows the AI to interpret visuals in real time via a smartphone camera — similar to image-recognition features seen in OpenAI’s ChatGPT and Google’s Gemini.
Launched on April 22, Grok Vision is now available for iOS users through the Grok app, enabling them to point their camera at objects, signs, or documents and ask related questions instantly. Android users, however, will need to wait, as the feature has not yet been rolled out on that platform.
New Features for Grok Users
In addition to visual recognition, xAI has introduced multilingual voice capabilities and real-time web search in voice mode. These upgrades are available to subscribers of the SuperGrok plan, which provides access to Grok’s most advanced tools.
Another major enhancement is Grok’s memory feature, rolled out last week for the Grok 3 model. This function enables the chatbot to recall previous conversations, helping it offer more tailored responses. For example, if a user discusses their fitness regime, Grok can use that context later to suggest a suitable diet.
Unlike traditional chatbots, Grok emphasizes memory transparency — users can view what the AI remembers and even delete specific interactions. A “Forget” option for Android users is also in the works, aimed at giving them more control over their stored data.
Introducing Grok Studio
Earlier this month, xAI launched Grok Studio, a collaborative workspace that allows users to co-create documents, develop simple apps, and even make browser-based games. Both free and paid users on Grok.com can access this feature. When working within Studio, content opens in a new window, letting users collaborate with Grok directly.
These innovations reflect xAI’s goal of pushing AI beyond text-based interactions, making it more dynamic, visual, and user-centric.