2 posts tagged with "Vertex AI"

I Created a Real Human Ad - No Camera, No Crew, $12 Budget!

June 22, 2025 · 14 min read

What if you could create a complete, polished advertisement — visuals, voiceover, and full video — all for under $12?

That's exactly what I set out to prove in this project. In this blog post, I'll walk you through:

The tools I used and why
A cost breakdown
Comparisons with OpenAI Sora and WanAI
How I ultimately built the ad using Vertex AI VEO

Let's get started.

🧠 Why AI-Powered Ads Matter

AI-powered ads are revolutionizing how we create and scale marketing content. Here's why:

⚡ Speed: Full video ads can be created in minutes
💰 Affordability: Cost is a fraction of traditional production
🔄 Scalability: Generate multiple ad variants quickly
🎥 No actors or equipment needed

With just a text prompt, these tools handle:

Script generation
Voice narration
Visual animation

🧪 Tools Compared: Sora, WanAI, and Vertex AI

To benchmark what works best, I tested three AI platforms:

🔷 1. OpenAI Sora

Sora delivered a high-quality 8-second cinematic video based on my prompt.

✅ Smooth transitions
✅ Excellent lighting and motion
❌ Limited access
❌ No built-in voice-over

Sora can be ideal for short cinematic storytelling, but not suitable for complete end-to-end ad creation unless you combine tools.

🔶 2. WanAI

WanAI provided an easy-to-use interface, but I hit a limitation quickly.

❌ Video generation failed
❌ No output, even after retries
⚠️ Likely due to the free-tier restriction

If you're using WanAI's free version, it may not be reliable for actual ad production.

🌐 Accessing WanAI via Alibaba Model Studio

WanAI is part of Alibaba Cloud's generative AI offerings and can be accessed through the Model Studio interface. While not as frictionless as some Western platforms, it's still worth exploring — especially for experimentation.

🪪 Step 1: Create an Alibaba Cloud Account

Visit https://www.alibabacloud.com
Register for a new account (email and phone verification required)
You may be prompted to complete identity verification depending on your region

🧠 Step 2: Navigate to Model Studio

After logging in, search for Model Studio from the Alibaba Cloud Console or access it directly at: https://modelstudio.console.aliyun.com
Agree to terms and enable the service for your account

🖼️ Step 3: Find WanAI

Inside Model Studio, explore the generative AI models section
Locate WanAI or its equivalent under video generation or multi-modal AI
Note: the UI is partially in Chinese — use browser translation if needed

🎬 Step 4: Provide Prompt and Generate

Use a descriptive text prompt to generate your video
Wait for processing (this may take 1-2 minutes)
Note: in the free tier, results may be throttled or not generated at all

⚠️ In my case, WanAI did not produce a video under the free plan — likely due to quota restrictions or runtime limits.

If you're interested in evaluating WanAI for business use, consider upgrading to a paid Alibaba Cloud subscription to unlock full capabilities.

✅ 3. Vertex AI VEO

This is where the magic happened. Vertex AI VEO allowed me to:

🎞️ Generate a high-quality 24 1. 30 second video
🗣️ Add professional voice narration
🧱 Use slide-based or text-prompt-based generation
✅ Fully control the visuals, timing, and tone

Best of all, the total cost stayed under $12 for the complete video with voice.

📊 AI Platform Comparison

Below is a detailed comparison of the AI tools used during this ad creation journey:

Tool	Platform	Modalities	Public Access	Strengths	Weaknesses
Vertex AI (VEO)	Google Cloud	Image, Video (VLOGGER), Sound (AudioLM)	Requires GCP Account	One-stop shop to create audio, image, and video. Transparent pricing	Requires billing setup and project configuration
Sora	OpenAI (Experimental)	Video-only (no sound)	No (Research preview)	Photorealism, physics-aware scenes	Not yet publicly available
Alibaba ModelScope	Alibaba Cloud	Image, Text2Video, Voice	Public (HuggingFace / GitHub)	Open-source, wide model support	Poor video quality, UI less polished, DIY integration

🎬 How I Built the Ad in Vertex AI VEO

🧰 Media Studio: A Unified Interface for Generative Creativity

One of the most powerful components of Google's AI content ecosystem is the Media Studio — a clean, intuitive interface that brings together all major generative modalities.

From a single dashboard, you can:

Imagen 1. Generate stunning images using natural language descriptions
Chirp 1. Produce voice-over narration with lifelike clarity
Lyria 1. Compose custom music tracks based on mood, tone, and genre
Veo 1. Generate high-quality, dynamic video scenes

1️⃣ Start with a Prompt or Slide Structure

In my case, I used a text prompt generated from the Model Studio prompt generator to define a narrative sequence of four emotional scenes:

Generate a series of four images depicting a business user, a middle-aged Asian man with short, dark hair, showcasing a range of emotions. In the first image, he is depicted feeling frustrated, his brow furrowed, and his lips pursed in a grimace. He is wearing a crisp, dark blue suit and a white dress shirt, conveying a sense of professionalism. The office is a typical corporate setting, with a large window behind him overlooking a bustling city landscape, with a cool color scheme. In the second image, he is intrigued, his eyes widened slightly as he leans forward in his chair, studying some documents on his desk. The office is the same, but a warm, muted orange tones color scheme is implemented, with soft light filtering in. In the third image, he is captured in a moment of excited, his arms raised in a gesture of triumph and a wide grin on his face. The office is the same, with high-contrast color scheme and dramatic lighting. In the fourth and final image, the user is portrayed as relieved and relaxed, his shoulders slumped in a comfortable posture, and a gentle smile playing on his lips. He is wearing the same suit as in the first three images, but now is sitting on a comfy couch in a modern, minimalist living room with a calm, pastel color scheme and soft, warm ambient lighting.

You can either:

Enter your enter a prompt similar to above, or
use Create Prompt from Model Studio prompt generator

2️⃣ 🖼️ Create or Upload an Image:

If you already have an image of your character or scene, you can upload it to enhance, animate, or extend it using AI tools inside Media Studio.

Alternatively, you can generate a new image using Imagen, Google's generative image model.

For example, try a prompt like:

Photorealistic depiction of a middle-aged Asian business man sitting in a modern office, looking frustrated, with a city skyline visible through the window behind him

This will help you visually establish the first emotional scene for your AI-generated ad.

With links to documentation and API references at the top, Media Studio is designed for both no-code users and developers — enabling fast experimentation and production across image, voice, music, and video workflows.

Whether you're creating an ad, explainer video, music-backed clip, or voice-over narration — Media Studio serves as your creative AI command center.

Sample images I created

3️⃣ Generate Video

⚙️ VEO 2 Configuration and Output Setup

To generate the final video outputs, I used VEO 2 — the latest iteration of Vertex AI's video generation model. Here's how I configured it for optimal results:

Model: VEO 2
Aspect Ratio: 16:9 (landscape, suitable for YouTube and web ads)
Number of Results: 4 (to get multiple variations for creative flexibility)
Video Length: 8 seconds per scene (ideal for short ad segments)

I also specified a Google Cloud Storage (GCS) output path for storing the generated videos:

gs://[your-bucket-name]/ads/

Storing outputs in GCS helped ensure persistence, easy access, and safe backup of all video assets during editing and review.

Finally, I enabled the Prompt Enhancement feature, which uses an LLM to automatically rewrite and enrich prompts for better video quality and fidelity. This dramatically improved the expressiveness and alignment of the output with my original creative intent.

🧩 Prompt-to-Image: From Concept to VEO Input

After generating the composite image showing the four emotional expressions using the earlier prompt, I downloaded it and used a basic image editor to split it into four separate images — each representing one distinct emotion:

Frustrated
Intrigued
Excited
Relieved

I uploaded these images one by one into Vertex AI VEO, using them as input slides. For each image, I crafted a specific narrative prompt to guide the video generation:

Frustrated:

A small business user is frustrated at the speed, availability of his website — and that too after spending an exorbitant amount.

Intrigued:

The same user is now intrigued after discovering a new cloud-based website platform that promises better speed, uptime, and affordability.

Excited:

He is excited and overjoyed after seeing his website go live instantly, with blazing fast performance and zero downtime.

Relieved:

Now relaxed and smiling, he's enjoying peace of mind, knowing that his online business is running smoothly and cost-effectively.

This scene-by-scene approach allowed VEO to generate a seamless, emotionally resonant ad narrative — visual storytelling powered entirely by AI.

For each image, I provided a targeted contextual prompt. For example, for the first image:

A small business user is frustrated at the speed, availability of his website — and that too after spending an exorbitant amount.

This approach helped VEO sequence the emotional narrative visually and thematically, delivering a more dynamic and relatable ad experience.

Choose:

Aspect ratio: 16:9
Duration: 24-30 seconds
Resolution: 720p or 1080p

Within 1-2 minutes, VEO generated a fluid video matching my structure.

4️⃣ Add Voice-Over

Now that the video was created, had to add a voice where I used Vertex AI's built-in voice generation:

English narration or multilingual depending on your ad.
Voice: Choose the voice. Ensure you are choosing male or female based on the character
Output: Clean, clear, and professional

You can review and re-generate multiple options, then export the audio.

5️⃣ 🎵 Background Music Generation (Optional)

To enhance the emotional tone of the ad, I also generated background music using AI.

I used the following prompt to create the audio:

A melodious soft music required for advertisement where a light music plays.

The result was a gentle, unobtrusive melody that blended perfectly with the voice-over and visuals — helping to elevate the overall ad experience without overpowering the narration.

🎞️ Editing the Final Ad in CapCut

Once all the individual video clips were generated by VEO and downloaded, I moved to the final phase of production: editing and assembling the ad in CapCut.

CapCut is a free and user-friendly video editor that offers a wide range of tools to polish raw footage into a professional-quality video.

🪜 Step-by-Step: Editing AI-Generated Videos in CapCut

1️⃣ Launch CapCut and Start a New Project

Open CapCut on your desktop or mobile device
Click “New Project”
Drag and drop all your downloaded video clips into the timeline

2️⃣ Arrange the Video Sequence

Organize the clips in the intended narrative flow:
Frustrated scene
Intrigued scene
Excited scene
Relieved scene
Trim any silent lead-in or fade-out to keep the pacing tight

3️⃣ Add Transitions

Insert subtle transitions (e.g., fade, slide, or zoom) between clips
This creates smoother scene shifts and a more cohesive visual flow

4️⃣ Import and Sync Voice-Over

Import the AI-generated voice-over audio file
Drag it to the audio track in the timeline
Adjust timing and sync it with the visuals so that emotional cues align

5️⃣ Add Background Music

Import the AI-generated background music file or you can choose from Canva or Capcut assets to have a background music
Lower the background music volume (e.g., 30-40%) so it complements the voice-over
Apply fade-in and fade-out to ensure the music blends smoothly

6️⃣ Overlay Text or Branding (Optional)

Add on-screen text, brand logo, or a call-to-action (CTA) at the end:
“Visit now”
“Start your website in minutes”
“Powered by AI”

7️⃣ Export the Final Video

Set export resolution to 1080p for best quality
Click Export
Save your finished ad for distribution or upload to your preferred platform

Using CapCut gave me complete control over how the scenes, voice, and music came together — transforming AI-generated assets into a polished, emotionally compelling advertisement ready for publishing.

💵 Cost Estimation: How I Calculated the $12 Budget

One of the key takeaways from this experiment was demonstrating that a professional-quality ad can be produced for under $12, using only AI-powered tools — no actors, no cameras, no editing studio.

Here's how I calculated the total cost:

🔍 Reference Sources

To estimate costs accurately, I referred to:

Yahoo Finance coverage of Google's VEO pricing
Google Cloud's official pricing calculator

📊 Breakdown of Costs

Item	Estimated Cost
VEO video generation (24-30s at 720p-1080p)	~$7.50-$8.00
Voice-over generation (via Chirp or custom TTS)	~$2.00-$2.50
Background music (via Lyria)	~$0.50
GCS storage (temporary)	~$0.10
Editing (CapCut - Free Tier)	$0.00
Total	~$10-12 USD

🧠 Note: Costs are based on prompt complexity, video duration, resolution, and number of generated outputs. For most short-form ads under 30 seconds, this range is a realistic budget using cloud-based generative AI.

📈 Final Thoughts

Creating professional ads used to take:

Studios
Actors
Editing software
Weeks of effort

Now, with tools like Vertex AI VEO you can create powerful, production-ready ads in minutes and under $12.

Ready to create your own AI ad? Here's what to do:

Sign up for Vertex AI
Use my prompt templates above
Experiment with different voices and music
Share your results!

For the product featured in this ad, check out cloudmysite.com - the affordable cloud hosting solution that inspired this project.

📎 Key Takeaways

🎯 Use Sora if you have access and need cinematic shots
⚠️ Avoid relying on WanAI's free tier for production work
✅ Vertex AI VEO is the best choice for full, affordable ad creation with voice-over and visuals

Ready to create your own ad? Let me know, and I'll share a downloadable template and workflow to get you started!

Call to Action

Choosing the right platform depends on your organizations needs. For more insights, subscribe to our newsletter for insights on cloud computing, tips, and the latest trends in technology. or follow our video series on cloud comparisons.

Interested in creating ads without the high agency cost? If yes, please contact us and we'll be more than glad to help you embark on not only building professional ads but also marketing on all platforms.

Train and Deploy an AutoML Image Classification Model with Vertex AI

June 9, 2025 · 4 min read

This tutorial guides you through the process of using Vertex AI to train and deploy an AutoML Image Classification model. You'll use a public flower image dataset, train the model using AutoML, evaluate its performance, deploy it to an endpoint, and send predictions — all from the Google Cloud Console.

🧱 Step 1: Set Up Your Google Cloud Project

Select or Create a Project

Visit the Google Cloud Console project selector.
Choose an existing project or create a new one for this tutorial.

💡 Tip: If you don't plan to retain the resources, create a new project so cleanup is easier.

Open Cloud Shell

Click the Activate Cloud Shell button in the top-right of the console. Once ready, set your project ID:

gcloud config set project PROJECT_ID
export projectid=PROJECT_ID
echo $projectid

Replace PROJECT_ID with your actual project identifier.

🔧 Step 2: Enable Required APIs

Run the following command in Cloud Shell:

gcloud services enable   iam.googleapis.com   compute.googleapis.com   notebooks.googleapis.com   storage.googleapis.com   aiplatform.googleapis.com

🔐 Step 3: Set IAM Permissions

Grant the necessary IAM roles:

gcloud projects add-iam-policy-binding PROJECT_ID   --member="user:your-email@example.com"   --role=roles/aiplatform.user

gcloud projects add-iam-policy-binding PROJECT_ID   --member="user:your-email@example.com"   --role=roles/storage.admin

These permissions allow you to use Vertex AI and access Cloud Storage for dataset management.

🗂️ Step 4: Create and Import Image Dataset

Open Vertex AI > Datasets

Click Create Dataset.
Select Image as the data type.
Choose Image Classification (Single-label).
Set region to us-central1.
Enter a name for the dataset (optional).
Click Create.

Import Data from Cloud Storage

Use this public CSV containing image URIs and labels:

cloud-samples-data/ai-platform/flowers/flowers.csv

This CSV includes rows like:

gs://cloud-samples-data/ai-platform/flowers/daisy/10559679065_50d2b16f6d.jpg,daisy
gs://cloud-samples-data/ai-platform/flowers/dandelion/10828951106_c3cd47983f.jpg,dandelion

📌 You'll see a preview of your images and labels after import completes (takes a few minutes).

🧠 Step 5: Train AutoML Model

Go to the Models section.
Click Train new model.
Select:
1. Training method: AutoML
2. Target dataset: Your image dataset
(Optional) Name your model.
Define training options:
- Training budget: 8 node hours
- Enable Incremental Training: Only if you have a base model
Click Start training.

📬 You’ll get an email notification when training finishes (can take several hours).

📈 Step 6: Evaluate Model

Navigate to the Evaluate tab of your model:

View key metrics:
- Precision
- Recall
- Confusion Matrix
Analyze false positives, false negatives, and true positives.
Use Review Similar Images to find label inconsistencies or outliers.

🔍 Fix incorrect labels and re-train if needed to improve accuracy.

🚀 Step 7: Deploy Model to Endpoint

Go to Deploy & Test tab.
Click Deploy to Endpoint.
Choose:
- Create New Endpoint
- Name: image-classification
- Traffic Split: 100%
- Compute Nodes: 1
Click Deploy.

Deployment takes a few minutes.

🧪 Step 8: Send a Prediction

After deployment:

Go to Deploy & Test > Test your model.
Click Upload Image.
Choose a local image.
View the predicted label and confidence.

🧹 Step 9: Clean Up Resources

To avoid charges, delete the resources:

Undeploy Model

Vertex AI > Models > Deploy & Test > Undeploy

Delete Endpoint

Vertex AI > Endpoints > Delete image-classification

9.3 Delete Model and Dataset

Vertex AI > Models > Delete
Vertex AI > Datasets > Delete

Delete Cloud Storage Bucket

Go to Cloud Storage and delete your bucket.

📚 Summary

✅ Created dataset with labeled images
✅ Trained an AutoML image classification model
✅ Evaluated model performance
✅ Deployed to a live endpoint
✅ Sent predictions and cleaned up

Vertex AI simplifies machine learning workflows with a UI-based AutoML approach. Stay tuned for more advanced tutorials on custom training, multi-class classification, and model explainability.

💡 "Teaching AI has never been easier. With Vertex AI, all you need is your data."

🧠 Why AI-Powered Ads Matter​

🧪 Tools Compared: Sora, WanAI, and Vertex AI​

🔷 1. OpenAI Sora​

🔶 2. WanAI​

🌐 Accessing WanAI via Alibaba Model Studio​

🪪 Step 1: Create an Alibaba Cloud Account​

🧠 Step 2: Navigate to Model Studio​

🖼️ Step 3: Find WanAI​

🎬 Step 4: Provide Prompt and Generate​

✅ 3. Vertex AI VEO​

📊 AI Platform Comparison​

🎬 How I Built the Ad in Vertex AI VEO​

🧰 Media Studio: A Unified Interface for Generative Creativity​

1️⃣ Start with a Prompt or Slide Structure​

2️⃣ 🖼️ Create or Upload an Image:​

3️⃣ Generate Video​

⚙️ VEO 2 Configuration and Output Setup​

🧩 Prompt-to-Image: From Concept to VEO Input​

4️⃣ Add Voice-Over​

5️⃣ 🎵 Background Music Generation (Optional)​

🎞️ Editing the Final Ad in CapCut​

🪜 Step-by-Step: Editing AI-Generated Videos in CapCut​

1️⃣ Launch CapCut and Start a New Project​

2️⃣ Arrange the Video Sequence​

3️⃣ Add Transitions​

4️⃣ Import and Sync Voice-Over​

5️⃣ Add Background Music​

6️⃣ Overlay Text or Branding (Optional)​

7️⃣ Export the Final Video​

💵 Cost Estimation: How I Calculated the $12 Budget​

🔍 Reference Sources​

📊 Breakdown of Costs​

📈 Final Thoughts​

For the product featured in this ad, check out cloudmysite.com - the affordable cloud hosting solution that inspired this project.​

📎 Key Takeaways​

Call to Action​

🧱 Step 1: Set Up Your Google Cloud Project​

Select or Create a Project​

Open Cloud Shell​

🔧 Step 2: Enable Required APIs​

🔐 Step 3: Set IAM Permissions​

🗂️ Step 4: Create and Import Image Dataset​

Open Vertex AI > Datasets​

Import Data from Cloud Storage​

🧠 Step 5: Train AutoML Model​

📈 Step 6: Evaluate Model​

🚀 Step 7: Deploy Model to Endpoint​

🧪 Step 8: Send a Prediction​

🧹 Step 9: Clean Up Resources​

Undeploy Model​

Delete Endpoint​

9.3 Delete Model and Dataset​

Delete Cloud Storage Bucket​

📚 Summary​