Latenode

Effortlessly Generate Realistic Audio from Text using Google Cloud, Drive & Airtable

This automation template provides a complete solution for content creators, marketers, and developers to convert text into high-quality, realistic audio using Google's advanced Text-to-Speech (TTS) AI.

The workflow handles the end-to-end process, from capturing the input text, voice, and language selection through a form trigger, to generating the audio files, storing them in Google Drive, and tracking the metadata in Airtable. By automating this traditionally tedious task, users can save countless hours and effortlessly add professional-sounding voiceovers to their video, audio, and multimedia content.

Updated Apr 6, 2026Est. run: 26sEst. cost: $0.0703
How Latenode estimates time and cost

Latenode bills workflow runs in credits: 1 credit = 30 seconds of processing. Minimum charge per run depends on your plan. Plug-and-Play (PnP) AI nodes are billed separately—each PnP token is $1 USD, charged pay-as-you-go at vendor cost plus a small processing fee, with no API keys required.

Full pricing — how credits work →
Video, audio & media

Workflow preview

What this template does

  • Captures user input text, voice, and language preferences through a form trigger
  • Generates high-quality audio using Google's advanced Text-to-Speech AI
  • Stores the audio files in Google Drive for easy access and sharing
  • Tracks the metadata of the generated audio in an Airtable base
  • Eliminates the manual effort required for creating professional-sounding voiceovers

How it works

1
Trigger

Capture Text, Voice & Language

Users submit a form to provide the text they want converted to audio, select their desired voice, and choose the language for the text-to-speech conversion.

2
Action

Convert Text to Audio

The workflow uses Google Cloud Text-to-Speech to generate high-quality, realistic audio files based on the input text, voice, and language selected by the user.

3
Action

Save Audio to Google Drive

The generated audio files are automatically uploaded and stored in the user's Google Drive account for easy access and sharing.

4
Action

Track in Airtable

Metadata about the audio files, including the original text, voice, language, and file location, is recorded in an Airtable base to keep track of the user's text-to-speech conversions.

Setup guide

1

Add Google Cloud Text-to-Speech credentials

In the Latenode Credentials panel, add a new credential for Google Cloud Text-to-Speech. Provide the necessary API key or OAuth credentials to authenticate with the Google Cloud Text-to-Speech API.

2

Configure Google Drive integration

In the Latenode Credentials panel, add a new credential for Google Drive. Provide the necessary OAuth credentials to authenticate with the Google Drive API.

3

Set up Airtable integration

In the Latenode Credentials panel, add a new credential for Airtable. Provide the necessary API key or OAuth credentials to authenticate with the Airtable API.

4

Map input fields in the Form node

In the Latenode visual builder, add a Form node and map the input fields for text, voice, and language selection.

5

Configure the Google Text-to-Speech node

In the Latenode visual builder, add a Google Text-to-Speech node. Select the appropriate credentials, input text field, voice, and language settings.

Requirements

Google Cloud Text-to-Speech API key with access to the Text-to-Speech service
Google Drive API access to create and update files
Airtable base and API key to track audio file metadata
Form or trigger to capture the input text, voice, and language preferences

FAQ

Common questions about this template

Each run uses credits on your Latenode plan. We charge for processing time (1 credit = 30 seconds). Your actual cost depends on your plan and how long the run takes. See pricing plans for plans and how credits work.

More templates

You might also like

Browse all templates →
Video, audio & media

Automate YouTube video uploads from Google Drive with AI metadata

This automation streamlines the process of uploading videos to YouTube, including transcription, metadata generation, and automatic publishing. It monitors a Google Drive folder for new video files, uses AI tools to generate optimized titles, descriptions, and tags, and then uploads the videos to a YouTube channel with the generated metadata. This solution targets content creators, marketing teams, and channel managers who want to automate the repetitive and time-consuming task of manual video uploads and metadata creation.

26s$0.0703
Video, audio & media

Sync new Google Drive videos to your YouTube channel

This automation template is designed to help users who need to regularly upload video content to YouTube. By monitoring a designated Google Drive folder for new video files, the automation will automatically upload those files to the user's connected YouTube channel as soon as they are added. This streamlines the content publishing process, saving time and ensuring timely updates to the YouTube channel.

Ns$0.0703
Video, audio & media

Automatically sync YouTube videos to cloud storage and extract transcripts

This automation template enables users to automatically extract transcripts and metadata from newly uploaded YouTube videos. It integrates with the ParsePrompt service to parse the video content and save the structured data to a designated destination, such as a cloud storage service or a database. This solution is designed for content creators, video producers, and marketing teams who need to efficiently manage and analyze their video assets. By automating the transcript and metadata extraction process, it streamlines video production workflows and ensures that valuable insights from video content are readily available for further analysis and utilization.

26s$0.0703