Forty-five clips. Perfect golden hour light over a coastal town. Three hours of drone work and editing to get there.
Then you open Adobe Stock’s upload form. It wants a title, 25-49 keywords, a category, and a content type. For every single clip. If you want to learn how to sell stock videos consistently, this is the step that separates portfolios that earn from portfolios that sit. Getting your stock footage keywords right is the whole game.
That’s where the afternoon goes.
What Is Stock Video Keywording?
Stock video keywording is the process of writing titles, descriptive keywords, categories, and technical metadata for video clips so buyers can find them through platform search. Unlike photo keywording, video metadata has to account for camera movement, subject motion, temporal changes, audio content, and clip duration: five dimensions that don’t exist in still photography.
A photo of a beach is “beach.” A video of that same beach has waves crashing, wind moving through grass, a person walking into frame at the three-second mark, and a color shift from warm to cool as clouds pass. Your keywords need to describe all of that. Most contributors don’t.
Why Video Metadata Needs Different Keywords Than Photos
Stock platforms run video and photo through the same search engine. But buyers search differently. A photo buyer types “mountain landscape.” A video buyer types “aerial mountain landscape slow pan left” or “mountain timelapse dawn to day 4K.”
Three extra dimensions separate video search from photo search: camera movement, temporal progression, and technical specifications. Miss any one of them and your clip won’t appear for the queries that actually convert.
Adobe Stock gives you 49 keyword slots per video clip. Shutterstock gives you 50. That sounds generous, but a well-shot clip genuinely needs 30-40 keywords once you account for subjects, motion, mood, concepts, setting, and specs. The slots aren’t the constraint. Knowing what to fill them with is.
How to Keyword Stock Video in 7 Steps
This is the process I use for every clip. Manually, it takes about 3 minutes per clip. With AutoKeyWorder, about 8 seconds.
Step 1: Watch the Full Clip
Don’t keyword from the thumbnail. Play the clip start to finish. Note what changes: subjects entering or leaving frame, lighting shifts, camera movement, audio. A 15-second clip can have three distinct phases that each need their own keywords.
Step 2: Write the Title as a Scene Description
The title is your most heavily weighted metadata field. Platforms use it as the primary signal for search ranking, not as decoration.
A strong video title follows this formula:
[Camera/Angle] + [Subject] + [Action/State] + [Setting] + [Time/Condition]
Good examples:
- “Aerial drone shot of wind turbines spinning on green hills at sunset”
- “Close-up of barista pouring latte art in busy coffee shop”
- “Timelapse of storm clouds forming over mountain range at dusk”
- “Handheld POV walking through crowded night market in Bangkok”
Each packs 5-7 searchable concepts into one natural sentence. No filler, no stuffing.
Bad examples and why they fail:
- “Nature”: zero specificity, buried under millions of results
- “Beautiful sunset timelapse 4K HDR stock footage”: reads like spam, platforms penalize it
- “Clip 47”: more common than you’d think
Step 3: List Every Visible Subject
Start with nouns. Everything visible in the frame: buildings, water, boats, mountains, trees, people, vehicles, furniture, food, equipment. If it’s physically in the shot, it goes in the list. These are your foundational keywords.
Step 4: Add Motion Descriptors
This is the single biggest difference between video and photo keywording. You need to describe camera movement and subject movement separately.
Camera movement: how the camera itself moves through the scene.
| Movement | Keywords to Use | Example Buyer Searches |
|---|---|---|
| Pan | pan left, pan right, horizontal pan, slow pan | ”pan across skyline,” “slow pan restaurant” |
| Tilt | tilt up, tilt down, vertical tilt, reveal tilt | ”tilt up skyscraper,” “tilt down waterfall” |
| Tracking | tracking shot, follow shot, side track | ”tracking shot runner,” “follow car driving” |
| Drone | aerial, overhead, bird’s eye, orbit, flyover | ”aerial city night,” “drone orbit building” |
| Static | locked shot, tripod, fixed camera, static | ”static shot traffic timelapse” |
| Handheld | handheld, shaky cam, organic movement | ”handheld POV walking market” |
| Gimbal | stabilized, smooth, gimbal, steadicam | ”gimbal walk-through office” |
| Zoom | zoom in, zoom out, push-in, pull-out | ”slow zoom portrait,” “pull-out reveal city” |
Subject movement: what’s moving within the frame. Walking, running, flowing, spinning, growing, falling, waving, typing, cooking. If it moves, keyword it.
If your drone footage orbits a lighthouse and you only tag it “lighthouse aerial view,” you’re invisible to the buyer searching “drone orbit lighthouse sunset.” The movement keyword is the difference between showing up and not.
Step 5: Layer in Mood and Concept Keywords
What feeling does the clip evoke? Peaceful, dramatic, energetic, nostalgic, cinematic, corporate, organic, futuristic. What abstract concepts does it represent? Freedom, isolation, teamwork, growth, technology, sustainability, wellness.
These connect your footage to buyers who know the emotion they want but haven’t committed to a specific shot. A creative director searching “growth concept corporate” is looking for your timelapse of a plant sprouting. But only if you keyworded the concept, not just the plant.
Step 6: Add Technical Specifications
Buyers filter by specs constantly. These go in the lower-priority slots (31-40+), but they absolutely go in:
- Resolution: 4K, UHD, HD, 1080p, 8K
- Frame rate: 24fps, 30fps, 60fps, 120fps (slow motion)
- Aspect ratio: 16:9 (landscape), 9:16 (vertical/social), 1:1 (square)
- Capture method: drone, handheld, gimbal, tripod, slider
- Speed: timelapse, hyperlapse, slow motion, real time
Step 7: Choose the Category Strategically
Most contributors pick a category in two seconds and move on. Categories aren’t organizational labels. They’re search filters.
When a buyer selects “Science” and searches “laboratory,” your lab footage only shows up if you picked that category. Wrong category means missing an entire class of filtered searches.
Don’t default to the obvious one. A sunset timelapse over a city could go in “Nature,” “Travel,” or “Buildings/Landmarks.” Nature has ten times the competition. Buildings/Landmarks has fewer clips fighting for the same queries, so yours ranks higher.
Quick test: open the platform, filter by your candidate category, search your target keyword. If page one is packed with clips nearly identical to yours, try a different category.
Audio: The Metadata Field Everyone Skips
Both Adobe Stock and Shutterstock ask whether your clip has audio. It’s not just a checkbox. It’s a search filter buyers actually use.
If your clip has clean ambient audio, flag it. Wind, waves, city hum, birdsong: some buyers specifically search for clips with natural sound for documentary or ASMR projects. Others filter audio out because they’re layering their own music.
When audio is present: “with audio,” “ambient sound,” “natural audio,” plus specific sounds like “ocean waves audio,” “rain sound,” “city traffic ambience.”
When absent: “no audio,” “silent,” “mute.”
Three seconds of work. Entire categories of searches opened.
Keyword Priority Order: First Slots Matter Most
Both platforms weight earlier keywords more heavily. Structure your list intentionally:
| Priority | Slots | What Goes Here |
|---|---|---|
| Highest | 1-10 | Core subject + camera movement + primary action |
| High | 11-20 | Setting, location, time of day, mood |
| Medium | 21-30 | Concept keywords, use-case keywords (commercial, editorial, social media) |
| Lower | 31-40+ | Technical specs, synonyms, related terms, seasonal keywords |
One mistake I see constantly: contributors copy their photo keywords and paste them onto video clips. The overlap is real, but video needs motion verbs, camera movement terms, and temporal descriptors that have no place on a still image. Treat video keywording as its own task, not a photo workflow with a few extra words bolted on.
Platform Differences: Adobe Stock vs. Shutterstock
The two biggest platforms handle metadata differently. Knowing the quirks prevents rejected uploads and wasted keyword slots.
Adobe Stock
Select “Video” as the content type, but check whether your clip qualifies as editorial. Editorial video (real people, real brands, real events) has different licensing rules: no model release required, but commercial use is restricted.
AI disclosure matters: if your video was generated or significantly modified by AI (motion graphics, AI-interpolated slow-mo, AI-generated backgrounds), check the disclosure box. Adobe rejects clips where this is set incorrectly.
One flat category list. 49 keyword slots.
Shutterstock
Shutterstock lets you pick two categories per clip. Use both: primary for the main subject, secondary for use case or setting.
Editorial status is declared at upload and can’t be changed later without re-uploading the clip entirely. Don’t rush past that field.
The built-in keyword suggestion tool is decent for photos but consistently misses motion-specific terms for video. Use it as a starting point, then add camera movement and temporal keywords manually.
50 keyword slots.
Stop Keywording Video Clips by Hand
Thirty clips at 3 minutes each is 90 minutes. That’s 90 minutes of typing metadata instead of editing your next shoot or building your portfolio.
AutoKeyWorder analyzes your video content and generates titles, keywords, categories, and content type selections directly inside the Adobe Stock or Shutterstock upload page. No CSV exports, no tab-switching. The AI detects camera movements, identifies subjects, reads the mood, and generates the temporal and motion descriptors that most contributors miss.
At $0.01-0.05 per credit, a 50-clip batch costs under a dollar and takes under 5 minutes. Manual takes 2.5 hours.
Your first 25 credits are free. Enough to keyword a real shoot and see the difference for yourself.
Get AutoKeyWorder: 25 Free Credits →
Pre-Submit Checklist
Run through this for every clip before uploading:
- Title describes the specific scene: Camera + Subject + Action + Setting + Time
- Camera movement is keyworded (pan, tilt, drone, static, gimbal, tracking, zoom)
- Subject movement is keyworded (walking, flowing, spinning, growing)
- Mood and concept keywords included (peaceful, dramatic, corporate, organic)
- Technical specs added (4K, slow-mo, drone, timelapse, vertical, frame rate)
- Audio status correctly set and keyworded
- Category chosen strategically, not just the obvious one
- Content type and editorial status are correct
- Keywords ordered by relevance, most important in slots 1-10
- No keyword stuffing — every term honestly describes the clip
- AI disclosure set correctly (if applicable)
Try it free: Generate AI keywords for your stock videos — no account needed, works with 5 platforms.
Stock video keywording isn’t exciting. But it’s the difference between footage that earns while you sleep and footage that sits on a server doing nothing. Get the metadata right once, and every clip works for you from the day it goes live.
Related: Why manual keywording costs more than you think, the hidden costs of hand-tagging metadata for stock photos.