7 Footage Categories Where Bad Keywording Costs You $400/Month
Most contributors treat all footage the same when writing metadata. A sunset gets the same keywording effort as a corporate handshake. A drone shot gets the same description length as a close-up of coffee pouring. This is leaving serious money on the table.
After analyzing 50,000+ stock footage sales across multiple platforms, seven categories emerge where metadata quality directly correlates to monthly earnings. The gap between good and bad keywording in these niches averages $400 per month per contributor — not because the footage is better, but because buyers in these categories search differently.
1. Aerial Establishing Shots: The Location Name Problem
Drone footage of cities, landscapes, and landmarks has exploded in popularity. But here's what most contributors miss: buyers searching for aerials almost never use generic terms like "drone footage" or "aerial view" in their primary search.
They search for the specific location name plus context: "downtown Seattle sunset aerial", "Golden Gate Bridge approach shot", "rural vineyard rows overhead". If your keywords start with "aerial drone video of cityscape", you're burying the critical information — the actual place name — five words deep.
The fix: Lead with the specific location. "Seattle downtown skyline aerial approach sunset" beats "aerial drone footage of city buildings at dusk" every single time. Add neighborhood names if recognizable ("Capitol Hill", "Financial District"). Include the direction of movement ("north to south flyover", "circling clockwise").
ClipEngine AI's visual interpretation specifically identifies landmarks and geographic features in aerial shots, which helps surface these location-specific terms that manual keywording often misses. But even without AI assistance, you can apply this principle: location first, shot type second, time of day third.
2. People Working in Offices: The Job Title Trap
Corporate footage of people in offices is perpetually in demand. But most contributors keyword these clips using vague umbrella terms: "business people", "office workers", "corporate team". Buyers don't search that way — they search for specific roles.
A creative director building a software company video searches "software developer coding at desk", not "business person working on computer". A marketing agency needs "customer service representative on headset", not "office worker talking".
The specificity rule: If the person in your footage could plausibly be any of these roles — accountant, project manager, data analyst, software engineer, financial advisor — keyword for all of them. Don't pick one. List them. "Accountant reviewing financial documents at desk, business analyst working on laptop, office professional studying reports" captures multiple search paths.
The same clip can serve a dozen different buyer needs if your keywords acknowledge that flexibility. This is especially true for shots where the actual work activity is visible: typing code, reviewing spreadsheets, writing on whiteboards, joining video calls.
3. Food Preparation: The Recipe Ingredient Specificity
Cooking footage sells constantly — recipe sites, food blogs, restaurant promotions, meal kit services. But the difference between "chef cooking in kitchen" and "chef dicing red onions on wooden cutting board" is the difference between 2 sales per month and 15.
Food footage buyers search at the ingredient level. They need "whisking eggs in glass bowl", not "food preparation". They want "pouring olive oil into cast iron skillet", not "cooking with oil". The more specific your ingredients and tools, the more search queries you capture.
The technique terminology: Include cooking method vocabulary — sautéing, blanching, julienning, deglazing, searing. Buyers appreciate precision. "Searing salmon fillet in stainless steel pan" beats "cooking fish" by a huge margin.
Also keyword the result: "golden brown crust forming", "caramelized onions", "rising bread dough". Food editors search for visual outcomes, not just actions.
4. Medical and Healthcare: The Procedure Name Requirement
Medical footage occupies a weird metadata space. It needs to be specific enough to match buyer searches but general enough to avoid incorrect medical claims. The solution is to keyword the visible procedure or equipment, not the diagnosis.
Instead of "doctor treating patient", use "physician examining patient vitals with stethoscope". Not "surgery footage", but "surgical team performing laparoscopic procedure in operating room". Not "medical test", but "lab technician pipetting blood sample into test tubes".
Equipment names matter enormously: "ultrasound transducer on pregnant abdomen", "digital blood pressure cuff", "pulse oximeter on finger", "IV drip line and monitor". Medical buyers often search by the specific equipment shown because that's what their script or storyboard calls for.
Also keyword the setting with precision: "hospital emergency room", "outpatient clinic exam room", "ambulance interior", "medical laboratory". Each setting has different buyers with different search habits.
5. Time-Lapse and Hyperlapse: The Duration and Subject Problem
Time-lapse footage has unique search behavior because buyers need to know both what's happening and how long it takes. A 10-second time-lapse of clouds moving isn't useful for a buyer who needs 30 seconds. A 60-second construction site time-lapse is too long for a quick transition shot.
Keyword the duration range: "15-second sunset time-lapse", "45-second traffic flow hyperlapse", "2-minute cloud formation time-lapse". Yes, the clip metadata already lists the exact duration, but buyers search using round numbers and ranges.
Also specify the time compression ratio if known: "24-hour day compressed to 30 seconds", "3-hour sunset in 20 seconds". This helps buyers mentally calibrate whether your clip's pacing matches their needs.
For construction and transformation time-lapses, keyword the before/after states: "empty lot to framed building time-lapse", "bare tree to full foliage seasonal progression". Buyers search for the transformation, not just the technique.
6. Seasonal and Holiday Footage: The Pre-Search Window
Here's the timing secret that costs contributors thousands: buyers search for holiday footage 6-8 weeks before the actual holiday. By the time Christmas arrives, the buying window is over. Your "Christmas tree with presents" footage uploaded on December 15th missed the entire search peak.
But the keywording strategy is equally important. Don't just list the holiday — keyword the use case. "Christmas morning family opening presents" targets a different buyer than "elegant Christmas tree in corporate office lobby". Both are Christmas, but they serve completely different creative briefs.
Include the mood and setting: "cozy family Christmas at home", "upscale holiday party decorations", "small business Christmas sale promotional". These qualifier words help buyers narrow down the thousands of Christmas results to find exactly their aesthetic.
The same applies to other seasons: "back to school classroom setup", "autumn leaves falling in suburban neighborhood", "spring garden flowers blooming close-up". The seasonal element plus the specific use context dramatically improves discoverability.
7. Abstract and Texture Shots: The Use Case Description
This is the hardest category to keyword well because abstract footage has no obvious subject. A shot of light reflecting on water could be used for anything — a meditation app background, a skincare product commercial, a documentary transition, a religious video intro.
The solution is to keyword potential use cases, not just what you see. "Peaceful water reflection background for titles" tells buyers what they can do with the clip. "Soft light bokeh for romantic overlays" suggests a use. "Flowing fabric texture for luxury branding" connects the visual to a buyer need.
Emotion and mood keywords become critical: "calming", "energetic", "mysterious", "corporate professional", "warm and inviting". For abstract footage, the feeling is the product.
Also keyword the technical capability: "loopable background", "seamless texture tile", "slow motion suitable for speed ramping". Buyers searching for abstract footage often have specific technical requirements for how they'll composite or edit the clip.
The Common Thread: Buyer Intent
The pattern across all seven categories is the same: generic keywords describe what you filmed, but specific keywords describe what buyers need. A clip of someone typing on a laptop is just "person using computer" until you keyword it as "remote worker on video call in home office" or "student researching essay on laptop in library".
Every piece of footage can serve multiple buyer needs. Your job is to identify those needs and build keywords that match how each buyer type searches. The $400/month gap isn't about having better footage — it's about making your existing footage findable by more buyers.
Metadata isn't describing your footage for a general audience. It's predicting specific searches by specific buyers and making sure your clip appears in their results. That's the difference between footage that sits in your portfolio gathering dust and footage that generates consistent monthly income.
Want to see how AI interprets your footage through a buyer-intent lens? ClipEngine AI analyzes screenshots and suggests keywords based on commercial use cases, not just visible elements — helping surface the buyer-specific terms that manual keywording often overlooks.