Accusys
|
b5e3adf5de
|
feat: add generate_seed_embeddings.py for TMDb profile extraction
Implements:
- get_tmdb_identities(): Query PG for TMDb identities with profile photos
- download_tmdb_image(): Download profile image from TMDb (handles full URL or path)
- extract_face_embedding(): CoreML FaceNet 512D embedding extraction
- generate_seed_embeddings(): Full flow: download → extract → push to _seeds
TMDb image handling:
- Supports both full URL (https://...) and path (/xxx.jpg)
- Uses 'original' size for better quality (replaces /w185)
Usage:
python generate_seed_embeddings.py # All TMDb identities
python generate_seed_embeddings.py --limit 10 # Limit to 10
python generate_seed_embeddings.py --dry-run # Don't push to Qdrant
Tested: 3 seeds successfully pushed (Cary Grant, Audrey Hepburn, Walter Matthau)
|
2026-06-25 01:45:48 +08:00 |
|