I Built an AI Agent to Grow My Social Media — Free Guide

The Big Picture

What This Agent Actually Does

I named my agent Chloe. She has one job: help me grow on social media. Here’s what she does every single day without me touching anything:

Every night: Chloe scrapes top AI creators on Instagram and TikTok using Apify. She pulls their newest posts, views, engagement, and captions. She knows what topics are trending, what hooks are getting views, and what formats are working right now.

Every night: She also pulls the latest AI and tech news from Anthropic, OpenAI, Google, TechCrunch, The Verge, Reddit, and more — so my content is always relevant to what’s happening that day.

Every night: She pulls MY Instagram and TikTok data. She uses Whisper to transcribe my videos and review my scripts and hooks. She uses GPT-4o Vision to analyze my thumbnails. And she tracks every single video over time — day-over-day views, likes, comments, shares, saves — so she knows which videos kept growing weeks later and which ones died after day one.

Every morning: I wake up to a Telegram message from Chloe with 3-5 content ideas. Each one has a hook, the best format to film it in, and a confidence score based on what’s actually performing on my account. If an idea works best as a carousel, she builds the whole thing — every slide, caption, and design direction.

Every Sunday: She sends me a full weekly report. What performed, what flopped, which hooks worked, what patterns are emerging, and exactly what to change.

Heads Up

This is a more technical build than my usual guides. You’ll need an OpenClaw account, an AWS EC2 instance (or any Linux server), API keys for Apify and OpenAI, and a Telegram account. If you’ve never set up a server before, this is doable but expect a learning curve. I’ll walk you through every step.

Here’s every tool involved and what it does:

• OpenClaw — The agent framework. This is where your agent lives, thinks, and runs. It manages the agent’s identity, memory, skills, and scheduled tasks (crons). Think of it as the brain and the operating system.

• AWS EC2 — A cloud server (Ubuntu Linux) where OpenClaw runs 24/7. Your agent needs a computer that’s always on. An EC2 instance costs ~$5-15/month for a small one. You can also use DigitalOcean, Linode, or any VPS — anything that runs Ubuntu.

• Apify — A web scraping platform with pre-built scrapers (“actors”) for Instagram and TikTok. Your agent calls the Apify API to pull public post data — views, likes, comments, captions, timestamps. Free tier gives you enough to start.

• OpenAI Whisper — Transcribes your video audio into text. This is how Chloe reads your scripts and extracts your hooks. Costs about $0.006 per minute of audio — pennies.

• OpenAI GPT-4o Vision — Analyzes your video thumbnails. Scores them on scroll-stopping power, text overlay effectiveness, composition, and energy. About $0.01-0.03 per image.

• Telegram — How your agent talks to you. Chloe sends daily ideas, weekly reports, and on-demand responses as Telegram DMs. You can message her back anytime to brainstorm, analyze, or adjust.

• Claude (Anthropic) — The actual AI powering the thinking. OpenClaw uses Claude under the hood for all the analysis, idea generation, and conversation.

What This Costs

AWS EC2: ~$5-15/month. Apify: Free tier for basic scraping, ~$49/month for heavier use. OpenAI (Whisper + Vision): ~$2-5/month depending on how many videos you analyze. OpenClaw: Check openclaw.com for current pricing. Telegram: Free. Total: Roughly $15-40/month for a fully automated social media AI agent running 24/7. That’s less than one month of most social media management tools.

Get an AWS EC2 Instance Running

1. Go to aws.amazon.com → Create an account (or sign in).
2. Go to EC2 → Launch Instance.
3. Choose Ubuntu Server 24.04 LTS (free tier eligible for t2.micro — enough to start).
4. Instance type: t2.small or t3.small recommended (~$8-15/month). t2.micro works but is tight on memory.
5. Create a key pair (download the .pem file — you’ll need this to SSH in). Store it somewhere safe.
6. Security group: Allow SSH (port 22) from your IP. Allow outbound traffic on all ports (your agent needs to reach APIs).
7. Launch the instance. Note the public IP address.

SSH into your server:

ssh -i your-key.pem ubuntu@YOUR-EC2-IP

Install OpenClaw

Once you’re SSH’d into your server, follow the OpenClaw installation instructions at openclaw.com. OpenClaw manages your agent’s identity, memory, skills, and cron jobs. After installation, you’ll have a workspace directory where your agent lives — usually at /home/ubuntu/.openclaw/agents/.

Set Up Telegram

1. Open Telegram → search for @BotFather → send /newbot.
2. Name your bot (e.g., “Chloe”) and choose a username.
3. BotFather gives you a bot token — save this. You’ll configure it in OpenClaw.
4. Message your new bot on Telegram to start a conversation. Note your chat ID (OpenClaw docs explain how to find this).
5. Configure the bot token and chat ID in OpenClaw’s settings so your agent can send you messages.

Now your agent can DM you directly on Telegram — and you can message it back anytime for on-demand analysis, ideas, or questions.

Store Your API Keys

Create a centralized credentials file on your server. You’ll need:

• APIFY_API_TOKEN — From apify.com → Settings → Integrations → API Token
• OPENAI_API_KEY — From platform.openai.com → API Keys
• ANTHROPIC_API_KEY — Configured through OpenClaw

Store these in your OpenClaw credentials directory so your scripts and agent can access them securely.

This is the engine. Three scripts that run every night to collect and analyze your data — automatically, for pennies.

Script 1: Pull Social Data (Apify)

This bash script calls the Apify API to pull your latest Instagram and TikTok post data. It uses two pre-built Apify actors:

• Instagram: apify/instagram-profile-scraper
• TikTok: clockworks/free-tiktok-scraper

The script pulls up to 30 posts per platform per run (hard limit to keep costs at zero on Apify’s free tier). It saves the raw JSON to a data/raw/ folder with the date in the filename.

Then it runs a Python script that parses the raw JSON and updates a master content tracker — a single JSON file that stores every video with daily metric snapshots. This is how your agent knows which videos are still growing vs. which ones flatlined.

Why Daily Snapshots Matter

Most analytics tools show you a video’s total views. But total views don’t tell you if a video is still growing or if it peaked on day one. By capturing a snapshot every day, your agent sees the growth curve. A video with 5,000 views that gained 3,000 of them in the last 3 days is performing completely differently than one that got 5,000 views in the first hour and hasn’t moved since. This changes your strategy.

Script 2: Analyze New Videos (Whisper + Vision)

For every NEW video that hasn’t been analyzed yet, this script does two things:

1. Transcribes the audio with OpenAI Whisper. It downloads the video temporarily, sends the audio to Whisper, gets back a full transcript with timestamps, and extracts the hook (first 5 seconds of speech). This is how your agent knows exactly what you said and how you opened every video. Cost: ~$0.006 per minute.

2. Analyzes the thumbnail with GPT-4o Vision. It sends the thumbnail URL to Vision and gets back a 1-10 score on five dimensions: scroll-stopping power, text overlay effectiveness, creator energy/expression, composition quality, and overall impact. It also gives specific notes on what’s working and what could be improved. Cost: ~$0.01-0.03 per image.

Transcripts are saved in data/transcripts/ and vision analyses in data/vision/, indexed by video ID. The content tracker gets updated with analyzed: true so each video is only processed once.

Script 3: The Content Tracker (Python)

This is the source of truth. A Python script that parses Instagram and TikTok data (they have different JSON formats), maintains a master JSON file with every video, and appends daily metric snapshots to each entry. The structure per video looks like this:

• Video ID, platform, URL, posted date, caption
• Hook text (extracted from Whisper transcript)
• Thumbnail analysis scores (from GPT-4o Vision)
• Transcript summary
• Array of daily snapshots: date, views, likes, comments, shares, saves

This means your agent can answer questions like: “Which of my videos are still gaining views after 7 days?” “Which hooks got the most saves?” “What thumbnails scored highest on scroll-stopping power?”

Cost Optimization

The scripts themselves are bash and Python — they cost $0 in LLM tokens. The only costs are Apify API calls (free tier), Whisper ($0.006/min — only on new videos), and Vision ($0.01-0.03/image — only on new thumbnails). Your agent’s daily ideas and weekly report are the only cron jobs that use agent tokens. Total daily cost: well under $1.

Cron jobs are scheduled tasks that run automatically at set times. OpenClaw has its own cron system — you use openclaw cron create to set them up (NOT system crontab).

Here are the three crons that power everything:

Cron 1: Daily Data Pull — 7:00 AM UTC

Type: Bash script
What it runs: Your data collection script + analysis script back-to-back
What happens: Apify pulls your latest Instagram/TikTok data. The tracker updates with new snapshots. Whisper transcribes any new videos. Vision analyzes any new thumbnails. All data is ready before your agent wakes up.
Cost: ~$0 (bash) + pennies for Whisper/Vision on new videos only

Cron 2: Daily Content Ideas — 7:30 AM UTC

Type: Agent turn (this one uses Claude)
What happens: Your agent reads the freshly pulled data, checks what’s trending in AI news, reviews your recent performance, avoids repeating ideas from the past 7 days, and generates 3-5 content ideas. Each idea includes: the concept, a hook (specific opening line), why it matters to your audience, the best format (talking head, tutorial, carousel, reaction), and a confidence score. Delivered to you via Telegram DM.
Cost: ~$0.05 per run (one Claude agent turn)

Cron 3: Weekly Deep Report — Sunday 8:00 AM UTC

Type: Agent turn
What happens: Your agent reads ALL your video data, analyzes growth curves, identifies your top performers and worst performers, finds patterns in hooks/formats/topics/posting times, and compiles a comprehensive weekly report. It also updates a content playbook — a knowledge file that compounds your learnings over time. This means your agent gets smarter every single week.
Cost: ~$0.10 per run

The Compounding Playbook

This is the secret weapon. Every Sunday, your agent updates a markdown file called the content playbook with everything it’s learned: which hook patterns get the most saves, which formats have the longest shelf life, which topics peak fast vs. which ones grow slowly, which posting times work best for your account. After 3 months, this file is an incredibly detailed instruction manual for exactly what works for YOU — not generic social media advice, but data-backed intelligence from your own account.

An OpenClaw agent is defined by a set of markdown files in its workspace. These files tell the agent who it is, what it knows, how it should behave, and what tools it has access to. Here are the key files you need to create:

• AGENTS.md — The master operating instructions. This is the most important file. It defines: the agent’s role, its cron schedule, what tools and scripts it has access to, cost controls, formatting rules, and behavioral guidelines. Think of it as the employee handbook.

• SOUL.md — The agent’s personality and communication style. How it talks to you, what tone it uses, whether it’s casual or formal. I made Chloe direct, data-driven, and a little bit sassy.

• USER.md — Everything about YOU. Your niche, your audience, your content pillars, your brand voice, your goals. This is how the agent knows to filter everything through your specific lens.

• MEMORY.md — Long-term learnings that compound over time. Your content pillars, proven hook patterns, audience insights, monetization context. The agent updates this as it learns.

• TOOLS.md — Technical configuration: API keys, server environment, file paths. The agent references this to know where its data lives and what APIs it can call.

Below is a starter template for the most critical file — your agent’s operating instructions. Copy and customize this for your own niche and accounts.

Template — Agent Operating Instructions (AGENTS.md)

AGENT NAME: [YOUR AGENT NAME — e.g. "Chloe"] ROLE: Social Media Content Strategist OWNER: [YOUR NAME] PLATFORMS: Instagram (@[YOUR HANDLE]) and TikTok (@[YOUR HANDLE]) ═══ YOUR JOB ═══ You are [OWNER]'s social media strategist. Your ONE job is to help them grow their social media accounts by: 1. Researching what's working in their niche RIGHT NOW 2. Analyzing their own content performance with real data 3. Generating content ideas that are data-backed, not guesswork 4. Tracking what works over time and compounding that knowledge ═══ DATA SOURCES ═══ You have access to these data files (updated daily by automated scripts): Content Tracker: data/content-tracker.json - Every video with daily metric snapshots (views, likes, comments, shares, saves) - Day-over-day growth curves for each video - Hook text extracted from audio transcriptions - Thumbnail analysis scores Transcripts: data/transcripts/[video_id].json - Full audio transcription of each video via Whisper - Timestamped text — you can see exactly what was said and when Vision Analysis: data/vision/[video_id].json - GPT-4o Vision scores for each thumbnail (scroll-stopping, text overlay, energy, composition) Content Playbook: knowledge/content-playbook.md - Compounding learnings updated every Sunday - Proven hook patterns, best formats, topic performance, posting time analysis ═══ CRON SCHEDULE ═══ DAILY — 7:00 AM UTC — Bash Script (data collection) Scripts run automatically: pull-apify-data.sh → analyze-new-videos.sh These populate your data files. Cost: ~$0 + pennies for Whisper/Vision. DAILY — 7:30 AM UTC — Agent Turn (content ideas) After data is fresh, generate content ideas: 1. Check memory — don't repeat ideas from the past 7 days 2. Research current AI/tech news from these sources: - Tier 1: Anthropic Blog, OpenAI Blog, Google AI, Meta AI - Tier 2: TechCrunch, The Verge, Ars Technica, WIRED - Tier 3: Reddit (r/artificial, r/ChatGPT, r/ClaudeAI), Twitter/X 3. Review latest video performance data 4. Generate 3-5 ideas. For each: - The Idea: One sentence, crystal clear - The Hook: Exact opening line (first 3-5 seconds) - Why It Matters: Connection to audience - Format: Talking head / Tutorial / Carousel / Reaction / Story time - Confidence: Low / Medium / High (based on data patterns) 5. Send via Telegram to [OWNER] WEEKLY — Sunday 8:00 AM UTC — Agent Turn (deep report) Full performance analysis: 1. Read all video data from content-tracker.json 2. Identify top 3 performers (by views, saves, and growth curve) 3. Identify bottom 3 (what flopped and why) 4. Analyze hook patterns — which openings correlate with high saves? 5. Analyze format performance — talking head vs. tutorial vs. carousel 6. Analyze topic performance — what subjects resonate? 7. Analyze thumbnail scores — do higher Vision scores correlate with higher views? 8. Update knowledge/content-playbook.md with new learnings 9. Send full report via Telegram ═══ CONTENT IDEA RULES ═══ - Every idea must pass the "would [OWNER]'s audience actually care?" test - Filter everything through the audience lens: [DESCRIBE YOUR AUDIENCE — e.g. "beginners, non-technical, 9-5 workers who are curious about AI but overwhelmed"] - Hooks must be specific, not generic. Not "AI is changing everything" — more like "This free AI tool just replaced my $200/month subscription" - Balance between trending/timely content (rides the wave) and evergreen content (keeps growing) - If a carousel would work best, build the full thing: slide-by-slide outline, caption, and design direction - Never include a link or resource without verifying it actually exists and works ═══ COST CONTROLS ═══ - Apify: Hard limit 30 videos per pull (enforced in scripts) - Whisper: Only transcribe NEW videos (check analyzed flag) - Vision: Only analyze NEW thumbnails (check analyzed flag) - Bash scripts = $0 LLM cost - Only daily ideas and weekly report use agent tokens - Total daily cost target: under $1 ═══ COMMUNICATION RULES ═══ - Deliver ideas via Telegram DMs — conversational, scannable, not a wall of text - Split long messages into multiple sends for readability - Never show file paths, JSON, or raw data to [OWNER] - Present insights conversationally — "Your video about [topic] is still growing — it gained 2,000 views in the last 3 days" - If something is urgent (a video is going viral, news is breaking), message immediately — don't wait for the next cron - End daily ideas with: "Want me to expand on any of these, build a script, or analyze something specific?" ═══ MEMORY ═══ Save daily session notes to memory/YYYY-MM-DD.md: - Ideas delivered and owner's reactions - New performance data captured - Content decisions made - Notable trends surfaced - Patterns discovered Update MEMORY.md with long-term learnings: - Content pillars that resonate - Proven hook patterns with data - Format performance rankings - Audience insights - What owner loves making vs. what performs (track both)

Customize for Your Niche

The template above is for an AI/tech creator. Replace the news sources, content pillars, audience description, and platform handles with your own. A fitness creator would research fitness influencers and health news. A finance creator would scrape finance accounts and pull market updates. The architecture is the same — only the content layer changes.

I Built an AI Agentto Grow My Social Media

Build AI Systems That Run Your Work, Business, And Life

I Built an AI Agent
to Grow My Social Media