🚨 ChatGPT can now see, hear, and speak 🤯

and free productivity tools

Hello, AI buddies

Thank you for joining us in this edition of the AI Chronicles, your go-to source for all things AI.

This Week in AI

  • ChatGPT can now see, hear, and speak which is a more intuitive type of interface allowing users to talk or show ChatGPT.

  • Getty Images collaborates with Nvidia to unveil Generative AI, enabling image creation from Getty's vast photo library.

  • Spotify, in collaboration with OpenAI, plans to clone podcasters' voices and translate them into multiple languages for seamless international content.

  • Amazon accelerates its AI ambitions with a substantial $4 billion investment in Anthropic, intensifying its competition in the AI-driven cloud sector.

🤯 ChatGPT can now See, Hear, and Speak

ChatGPT now possesses the ability to process images and respond using human-like voice, offering a multimodal experience alongside text.

  • Voice Interaction: Engage in interactive conversations with ChatGPT using voice, accessible through the New Features section in Settings, featuring a choice of five distinct voices. Explore voice samples for a preview.

  • Image Interaction: Upload or capture images and seek answers from ChatGPT, spanning various queries from troubleshooting appliances to interpreting graphs. Utilize the drawing tool to pinpoint specific image details and even upload documents containing text and images.

These enhancements will be available to ChatGPT Plus and Enterprise users in just two weeks, with future access planned for developers. Here’s a quick video on these updates.

OpenAI prioritizes gradual deployment to ensure the safety and continuous improvement of advanced models encompassing voice and vision technologies.

Voice Technology: It is capable of crafting realistic synthetic voices, holds potential for creative and accessibility-focused applications. To address potential risks, ChatGPT is deploying it for voice chat and collaborating with trusted partners like Spotify for innovative applications such as Voice Translation in podcasts.

Vision-Based Models: Vision capabilities come with unique challenges, and thorough testing has been conducted to mitigate risks. Vision is designed to assist users in their daily lives, taking inspiration from our work with Be My Eyes.

Transparency and Limitations: They’re maintaining transparency regarding model limitations, advising against higher-risk use cases and cautioning users about ChatGPT's proficiency with non-English languages. Feedback from real-world usage will contribute to refining safeguards while maintaining utility.

🦾 AI-Tools of the Week

Today we are bringing you a curated list of free AI-powered tools designed to supercharge your productivity.

Explore these innovative solutions that harness the power of artificial intelligence to streamline tasks, boost efficiency, and make your work smarter and more effective.

Share what you create using these with us on Twitter

🐦 Meme of the Week

📍Get daily updates on LinkedIn.

🧠 AI Fun Fact

One of the first AI programs was created in 1965 by Carl Djerassi. It was named DENDRAL and it automatically discovered unknown forms of medications.

Stay informed, stay curious, and together, let's unlock the full potential of AI for a better tomorrow!

See you next week 🥳