ImageToText Archives - myspace.com.mm

Integrating Image-To-Text And Text-To-Speech Models (Part 2) — Smashing Magazine

Written by News One August 30, 2024

In the second part of this series, Joas Pambou aims to build a more advanced version of the previous application that performs conversational analyses on images or videos, much like a chatbot assistant. This means you can ask and learn more about your input content. Joas also explores multimodal or any-to-any models that handle images, […]

UI / UX

Integrating Image-To-Text And Text-To-Speech Models (Part 1) — Smashing Magazine

Written by News One July 24, 2024

Joas Pambou built an app that integrates vision language models (VLMs) and text-to-speech (TTS) AI technologies to describe images audibly with speech. This audio description tool can be a big help for people with sight challenges to understand what’s in an image. But how this does it even work? Joas explains how these AI systems […]