[center]How to Boss Around Websites with AI[/center]
One-Line Flow:
The “shut up and do what I say” AI script that turns any webpage into your obedient little minion — but with enough hidden chaos under the hood to make you feel like a real super-villain.
[center]
[/center]PageAgent 

JavaScript in-page GUI agent that lets you control web interfaces using plain human language.
The Full Story (The Real PageAgent Guide That No One Dares To Write)
PageAgent looks cute on the surface — drop a script, talk to your webpage, watch it obey like it owes you child support.
But behind the scenes?
There’s some wild stuff happening.
Here’s the full, unfiltered, breakdown.
PageAgent Roadmap
The development progress and future plans for PageAgent.
It Needs a Real LLM Brain
PageAgent is NOT magical by itself.
It requires a connected language model via your appId + apiKey, such as:
- OpenAI
- Anthropic
- Azure OpenAI
- Any LLM provider with text + (preferably) vision
No LLM = no brain = no magic.
Vision Models = Expensive Obedience
Every time you say “click the button,” PageAgent quietly captures:
- Full-page screenshot
- DOM details
- Your command text
Then sends it to the LLM.
Vision models cost 10–60x more than text models.
Each little command costs $0.01–$0.03 depending on the image size.
1,000 commands = $10–$30 just to make the bot do chores for you.
Still fun… just pricey.
Your DOM + Screenshot Goes to Third Parties
Whatever is on your webpage — forms, private data, dashboards, admin panels — is shipped off to:
- OpenAI
- Anthropic
- Whoever you plugged in
They log this data for 30+ days by default.
If your page shows sensitive info?
Congratulations, you’ve just shared it with the world’s most powerful AI companies.
The XSS Attack Surface Is Real
PageAgent executes LLM-generated actions directly into your DOM.
If an attacker crafts a malicious prompt?
- They can trick the LLM
- Inject corrupted commands
- Execute arbitrary JavaScript through PageAgent
- Trigger DOM-based XSS
This is brand-new territory that traditional security scanners don’t detect.
Fun? Yes.
Safe? LMAO.
The Playwright Legacy Weight
The architecture comes from browser-use, which is powered by Playwright.
While PageAgent itself is tiny, the real stack behind it requires browser binaries:
- Chromium ~281 MB
- Firefox ~187 MB
- WebKit ~180 MB
Total: ~650 MB if you ever use server-side browser-use connected to it.
This ain’t lightweight — it’s a chonky beast.
Don’t Use It on Websites You Don’t Own
Even though it feels like a cool hacking toy, using PageAgent to automate or interact with external websites can break:
- Website Terms of Service
- Anti-automation policies
- Anti-scraping rules
- The CFAA (yes, the law people go to prison for violating)
Use it only on pages you control — unless you love legal thrillers starring yourself.
The Original CDN Is… Suspicious
The official script loads from Alipay’s CDN:
https://render.alipayobjects.com/p/h5/lark/openauth/[email protected]/page-agent.js
That’s a Chinese banking giant’s domain.
Geo-restrictions, latency, sudden removal — all possible.
Better alternatives:
jsDelivr
https://cdn.jsdelivr.net/gh/alibaba/page-agent@latest/dist/page-agent.min.js
Self-Host (recommended)
git clone https://github.com/alibaba/page-agent.git
npm run build
Host the output on your own server. No surprises.
The Real Use-Cases (That Are Actually Genius)
AI Customer Support
Bots that literally click buttons and fill forms for users on your site.
Accessibility
Turn your site voice-controlled with zero engineering degree needed.
QA Automation
“Run the checkout flow” becomes an English sentence — not 400 lines of Selenium code.
Interactive Tutorials
Live, guided, auto-clicking tutorials for confused users.
Voice Dashboards
Executives shout at the screen, the dashboard obeys.
Ugh… Great. Another “AI Money Trick” [size=5] (ಥ⌣ಥ)[/size]
I Gotta Learn Now… Fine. What’s Next?!

Summary
1. The “Sell Magic Buttons to Lazy Clients” Flip
- Build “AI Buttons” for small business sites (login, checkout, support).
- Charge $25–$60 per button because clients think it’s NASA-level coding.
- It’s literally PageAgent + one command like “click this.”
Example: A freelancer in Brazil sells “Voice-Controlled Checkout Buttons” for $49 each. He built them in under 10 minutes.
2. The “I’ll Fix Your Broken UI for $0 Upfront” Trick
- Most websites have the UI of a 2008 Nokia.
- Add PageAgent → create voice commands like “open menu,” “apply coupon,” “scroll product gallery.”
- Charge a $99–$199 UX Enhancement Fee for something you did while half-asleep.
Example: A mom-and-pop shop paid $150 for “AI Navigation.” It was 4 PageAgent commands.
3. The “AI Customer Support Without Hiring Anyone” Service
- Offer a “24/7 AI Support Agent” package to small shops.
- PageAgent handles FAQs, return pages, tracking lookups.
- You do nothing.
- Charge $29–$79/month.
Example: One bakery pays $39/mo for an “AI Help Button.” It’s just PageAgent clicking their FAQ links.
4. The “QA Testing, but I Don’t Actually Test Anything” Move
- Devs hate testing.
- PageAgent can do: “try invalid login,” “complete checkout,” “test coupon flow.”
- Charge $15–$40/hr while PageAgent clicks everything for you.
Example: A junior dev charges $30/hr for “AI Test Automation” using nothing but 7 PageAgent commands.
5. The “Accessibility Upgrade” Certification
- Sell “Voice-Controlled Website Navigation” to businesses needing ADA-like compliance.
- Setup takes 10 minutes.
- Charge $100–$300 per site.
Example: A local clinic paid $249 because “voice navigation” sounded expensive.
6. The “Influencer Dashboard Flex”
- Make a “Voice-Controlled Analytics Dashboard.”
- Commands like: “show yesterday’s sales,” “expand chart,” “filter by country.”
- Post it on TikTok → go viral → sell as a $29 template.
Example: A kid in Vietnam made $600 selling a PageAgent dashboard template on Gumroad.
7. The “Teach It, Don’t Do It” Course Flip
- Record a 40-minute “Talk To Your Website Using AI” mini-course.
- Upload on Udemy/Gumroad for $7–$19.
- Never update it again.
Example: One creator made $1,400 in 3 weeks selling a simple PageAgent walkthrough.
8. The “Fix My Workflow” Automation Service
- Businesses waste HOURS clicking the same 5 dumb buttons.
- Automate those with PageAgent.
- Charge $20–$50/month for “maintenance.”
Example: A Shopify seller pays $29/mo for “AI order panel automation.” It’s literally two commands.
9. The “Tutorials That Automatically Click” YouTube Hustle
- Create tutorials where PageAgent auto-clicks UI steps.
- People LOVE watching screens that click themselves.
- Earn from Adsense + affiliate links.
Example: One video explaining “AI-controlled websites” hit 80k views → $280 Adsense + $412 in affiliates.
10. The “Rent Your AI Intern” Subscription
- Offer a service: “Your website will obey commands like: ‘add product,’ ‘publish post,’ ‘check orders.’”
- Charge $39–$99/month.
- Client thinks they hired a digital employee.
Example: A marketing agency charges $79/mo per client for “AI Website Operations Assistant.” It’s just PageAgent wearing sunglasses.
The Pyodide Combo (Cursed but Brilliant)
Combine PageAgent + Pyodide (Python in WebAssembly) to create:
- Python automation agent
- Inside the browser
- No backend server
- Fully offline
- Priceless chaos
This is the most beautifully cursed idea ever.
PageAgent in Everyday Human Words
You talk to your website like it’s an intern:
“Scroll down.”
“Click login.”
“Fill email.”
“Show me the price.”
And the page does it — instantly.
You don’t write code.
You don’t debug.
You don’t suffer.
It’s literally the NPC assistant you always wanted.
How To Use It (Zero IQ Mode)
CDN Mode
<script src="https://cdn.jsdelivr.net/gh/alibaba/page-agent@latest/dist/page-agent.min.js"></script>
Plug your API key → start yelling commands.
NPM Mode
npm install page-agent
import { PageAgent } from "page-agent";
const agent = new PageAgent({
appId: "YOUR_APP_ID",
apiKey: "YOUR_API_KEY"
});
agent.execute("Click the login button");
Your webpage now takes orders like a henchman.
What’s Happening Behind the Scenes
- It parses what you said
- Takes a screenshot
- Reads your page
- Figures out what you meant
- Executes the right action
It’s basically an unpaid intern with superpowers.
License
MIT. Free.
Do stupid things responsibly.
If PageAgent made you feel like Tony Stark mixed with a cyber-criminal, go star the repo.

!