playbook/antigravity-awesome-skills/skills/web-media-getter/SKILL.md

120 lines
5.6 KiB
Markdown

---
name: web-media-getter
description: "One query across free image / video / GIF APIs (stock + historical/archival + GIF engines), returning normalized, license-tagged results with optional top-K download + attribution sidecar. The retrieval peer to local semantic search and generative media."
risk: safe
source: community
source_type: community
source_repo: connerkward/web-media-getter-skill
date_added: "2026-06-16"
author: Conner K Ward
license: MIT
tags:
- media
- images
- video
- gif
- stock
- archival
- attribution
tools:
- claude-code
- antigravity
- cursor
- gemini-cli
- codex-cli
---
## When to Use
Use when a task needs a REAL or ARCHIVAL photo / clip (hero, texture, reference, historical footage) or a reaction / animated GIF, rather than a generated one — fan out across free image/video/GIF sources in one query and download license-tagged results.
_Source: [connerkward/web-media-getter-skill](https://github.com/connerkward/web-media-getter-skill) (MIT)._
# web-media
Query many free image/video sources in one fan-out, get a normalized result list,
optionally download top-K with an attribution sidecar. Zero-dep stdlib script.
**Script:** `webmedia.py` (in this dir). **Keys:** `PEXELS_API_KEY`, `PIXABAY_API_KEY`
in `central/.env` (optional — the 5 no-key sources work without them).
## Sources
| Source | Key? | Best for | Media |
|--------|------|----------|-------|
| openverse | none | CC web images (Flickr, museums) | image |
| wikimedia | none | factual / historical / landmark photos | image |
| internetarchive | none | **historical/archival** images + films | image, video |
| loc | none | historical US prints/photos | image |
| nasa | none | space imagery + video | image, video |
| pexels | free key | modern stock photos + **short video clips** | image, video |
| pixabay | free key | modern photos/illustrations + **short clips** | image, video |
| klipy | free key | **GIFs** — recommended (free, unlimited, Tenor drop-in) | gif |
| giphy | free key | **GIFs** — biggest library (prod key needs approval) | gif |
GIF sources fire only with `--type gif`. Keys: `KLIPY_API_KEY`, `GIPHY_API_KEY`
in `central/.env`. (tenor adapter removed — Google EOL'd the API 2026-06-30.)
**klipy** is the one to get (free + unlimited);
its adapter is **unverified — assumes Tenor-compatible** request/response;
verify against docs.klipy.com when you key it. `webmedia.py "shrug" --type gif --count 6 --json`
## Usage
```bash
webmedia.py "1950s street scene" --type image --count 8 --json
webmedia.py "rocket launch" --type video --source nasa,internetarchive
webmedia.py "car factory 1930s" --source all --download --out /tmp/cars
```
- `--source all` (default) | `nokey` (no-key only) | comma list (`wikimedia,pexels`)
- `--type image|video` · `--count N` · `--json` · `--download --out DIR`
- `--download` fetches each result's direct media URL and writes `attribution.json`
(source, author, license, url, page_url) alongside the files.
## Record schema
`{source, title, url, thumb, dl, page_url, author, license, w, h, type}`
`dl` is the directly-downloadable media URL (None when only a page exists).
## The video caveat (important)
Archival sources (Internet Archive, Europeana, LoC) host **whole films/documentaries**,
not single shots. So:
- **Modern single clip** → `pexels` / `pixabay` (born as short clips, direct MP4). Done.
- **Historical single shot** → retrieve the IA film here, then extract the shot:
- **Twelve Labs** Marengo search (free 600 min) — pass the IA public MP4 URL, get a
timestamped moment for "car on assembly line", clip with ffmpeg. Semantic, cheap.
- or **PySceneDetect** (free, local) to cut the film into shots, then rank keyframes
with CLIP via the `muser` skill. Fully offline.
## Audio: freesound + audio QA
`webmedia.py` is image/video. For **sound effects** (real, CC-licensed) and for
**judging audio** (since Claude can't hear), two sibling scripts live in
`central/scripts/`:
- **`freesound-fetch.py "<query>" [count] [max_sec] [out_dir]`** — searches freesound.org
and downloads short hq-mp3 previews. Prints one JSON line per file with
`license`/`user` for attribution. Key: `FREESOUND_API_KEY` in `central/.env`
(token-based read; full originals would need OAuth — previews suffice for SFX).
- **`audio-judge.py <file> "<target>"`** — sends the clip to OpenAI `gpt-audio`
(audio-native) and returns JSON `{heard, score, matches, suggestion}`, enabling a
generate/fetch → judge → iterate loop. Auto-sources a real `sk-` `OPENAI_API_KEY`
from `.env` (ignores a local `lm-studio` stub env var). Pads sub-2s clips so the
speech-tuned model doesn't refuse. **Caveat:** it reliably *describes* audio and
filters obvious mismatches, but it is NOT a trustworthy judge of subjective qualities
like "grating" — it labels nearly any beep "sharp/high-pitched". Use it to cull, not
to make the final aesthetic call; confirm by ear.
## Where this fits
This is the **internet-retrieval** capability — peer to `muser` (local semantic search)
and `fal` (generate). A future `media` router would fan out across all three and rank
candidates by relevance (CLIP), handing aesthetic spreads to `lookdev`. Don't build that
router until the model demonstrably mis-routes without it.
## Limitations
- Results depend on third-party API availability, quotas, credentials, and license metadata quality.
- License tags and attribution fields must still be reviewed before commercial or public use.
- Relevance ranking can find plausible assets, but final aesthetic fit, brand safety, and audio suitability require human inspection.