Skip to content
De volta Flows
Modelo

AI agente bot que entende texto, áudio, imagem e documentos

Automação pronta a usar que podes lançar no teu próprio número de WhatsApp em minutos. Personaliza os passos conforme as tuas necessidades.

O que este modelo faz

Create your Custom Business AI Agent that speaks, sees, listens and replies to your customers.

🚀 What this workflow does

  1. Receives any inbound WhatsApp message via a Wassenger Trigger
  2. Detects the medium – text, voice note, image or document (PDF)
  3. Processes accordingly
    • Text → straight to the AI brain
    • Voice notes → download ➜ Whisper transcription
    • Images → download ➜ GPT-4o Vision analysis
    • PDFs only → download ➜ text extraction
  4. Feeds the cleaned input + short-term memory buffer (20 turns) to an OpenAI Chat Agent (GPT-4o-mini by default)
  5. Sends the answer back through Wassenger:
    • If the user sent audio, the bot replies in audio (OpenAI TTS ➜ saves mp3 to Google Drive ➜ returns the public link).
    • Otherwise, returns plain text.
  6. Gracefully rejects anything that isn’t text, image, audio or a PDF (“Sorry, you can only send …”)

Result: a polite, context-aware concierge that can read your contract, describe your cat photo, or summarize a 3-minute rant into a single line—without ever leaving WhatsApp.

🧩 Key components

Node Purpose
Wassenger Trigger / Wassenger Receive & send WhatsApp messages
Switch → “Input type” Routes to Text / Audio / Image / Document branches
HTTP Request Securely downloads media from Wassenger
OpenAI Whisper Turns voice notes into text
GPT-4o Vision Describes images in detail
Extract From File Converts PDFs to text
LangChain Agent Central brain with custom system prompt
Memory Buffer Window Keeps the last 20 turns per chat
OpenAI TTS (“Generate Audio Response”) Converts answers to speech (voice “nova”)
Google Drive (Upload + Delete) Stores the mp3, grabs a share link, cleans up

(Sticky notes in the canvas label the four media lanes so future-you won’t get lost.)

🛠️ Prerequisites

  • Wassenger device + API key
  • OpenAI API key (chat, whisper, TTS, vision)
  • Google Drive OAuth credentials (for audio replies)

💡 Ideas & extensions

  • Pipe extracted conversation data into HubSpot or Airtable.
  • Replace GPT-4o with your on-prem model ➜ just swap the Chat node.
  • Add a Sentiment node to auto-escalate angry customers.
  • Expand document branch to Word, PowerPoint or spreadsheets.

⚖️ Limits & best-practice nudges

  • Only PDFs are accepted for now; other file types trigger a polite rejection.
  • The workflow rate-limits itself by design (single execution per message), but you may want extra guards if you point it at a large audience.
  • Delete Google Drive files after sending (already included) to keep storage costs clean.
  • Remember WhatsApp’s 24-hour customer-initiated window.

🏁 Ready, set, automate!

Import → Hit Active. Your WhatsApp number just became a futuristic, multimodal AI agent. Enjoy the peace and quiet while it handles the chatter. 😉

Automação sem código

Automatiza tudo no WhatsApp.

Usa este modelo como ponto de partida ou cria o teu do zero. O construtor do Wassenger Flows traz mais de 400 integrações prontas a usar.

  • Editor arrastar e largar com mais de 400 nós pré-criados
  • Agentes de IA dentro do fluxo — Claude, GPT, Gemini
  • Webhooks para qualquer passo personalizado que imagines
Mais

Descubra mais modelos

Encontra o ponto de partida certo para o caso de uso da tua equipa.

Modelo

AI agente chatbot com armazenamento de memória

Ver modelo
Modelo

Atribua automaticamente chats do WhatsApp a departamentos e usuários com IA

Ver modelo
Modelo

AI WhatsApp Agent: Treinamento de dados e suporte inteligente ao cliente

Ver modelo
Modelo

Whatsapp Notabatments AI Agent com integração do Google Calendar

Ver modelo
Modelo

Agente da IA ​​com armazenamento de dados de supabase

Ver modelo
Modelo

Bot de suporte do agente de WhatsApp AI de uso geral do WhatsApp

Ver modelo
Modelo

Ai agente re -marcado coere

Ver modelo
Modelo

Como construir um moderador de grupo WhatsApp com IA

Ver modelo
Modelo

Whatsapp + HubSpot Automation (CRM)

Ver modelo
Modelo

Whatsapp + Automação Slack

Ver modelo
Modelo

Bot Whatsapp que entende texto, áudio, imagens e PDFs

Ver modelo
Modelo

Social Auto-Publisher (GitHub → Social Platforms)

Ver modelo
Modelo

WhatsApp Group AI Moderator

Ver modelo
Modelo

Whatsapp Shopify Integration

Ver modelo
Modelo

Publique os mais recentes vídeos do YouTube em um canal do WhatsApp

Ver modelo