xAI’s Grok chatbot can now ‘see’ the world around it

xAI’s Grok chatbot can now answer questions about what’s in view of your smartphone’s camera, similar to real-time vision features available for Google’s Gemini and ChatGPT.
On Tuesday, xAI announced the launch of Grok Vision, which lets users point their phone at objects like products, signs, and documents and ask questions about them. Grok Vision is accessible from the Grok app for iOS, but not the Grok Android app just yet.
GROK CAN SEE WHAT YOU SEE—LITERALLY
Grok’s voice mode comes with camera access, letting users point their phone at something and ask, “What am I looking at?”
The Vision feature on iOS allows the chatbot to analyze real-world objects, text, and environments through your… pic.twitter.com/N1b6pcYZOi
— Mario Nawfal (@MarioNawfal) April 20, 2025
Other new capabilities launching for Grok today include multilingual audio and real-time search in Grok’s voice mode. Grok users on Android can tap those, but only if they’re subscribed to xAI’s $30-per-month SuperGrok plan.
Introducing Grok Vision, multilingual audio, and realtime search in Voice Mode. Available now.
Grok habla español
Grok parle français
Grok Türkçe konuşuyor
グロクは日本語を話す
ग्रोक हिंदी बोलता है pic.twitter.com/lcaSyty2n5— Ebby Amir (@ebbyamir) April 22, 2025
Grok has been gaining new features at a steady clip. Earlier this month, xAI added a “memory” component to Grok that lets the bot pull on details from past conversations. Grok also got a canvas-like tool for creating docs and apps.