refactor: remove face embedding architecture - single Qdrant _faces collection

- Delete FaceEmbeddingDb module (face_embedding_db.rs)
- Stub match_faces_iterative, generate_seed_embeddings, tmdb_match_handler
- Remove sync_trace_embeddings, populate_face_embeddings_to_qdrant
- Remove embedding from face.json output (face_processor.py)
- Remove embedding from PG UPDATE (store_traced_faces.py)
- Remove workspace traces staging (checkin.rs, qdrant_workspace.rs)
- Fix tests: add pose_angle to Face, hand_nodes to TkgResult

Disabled functions (need reimplement with _faces):
- match_faces_iterative (identity agent)
- generate_seed_embeddings (TMDb seeds)
- tmdb_match_handler (TMDb matching)
- cluster_face_embeddings, search_similar_faces
- merge_traces_within_cuts
This commit is contained in:
Accusys
2026-06-24 22:27:09 +08:00
parent 360cb991e1
commit 074cdcdbed
60 changed files with 657 additions and 9454 deletions

View File

@@ -119,12 +119,12 @@ curl<span class="w"> </span>-s<span class="w"> </span>-X<span class="w"> </span>
<tr>
<td><code>status</code></td>
<td>string</td>
<td><code>"processing"</code></td>
<td><code>"queued"</code> — file enters the FIFO queue</td>
</tr>
<tr>
<td><code>pids</code></td>
<td>integer[]</td>
<td>Process IDs of started processors</td>
<td>Process IDs of started processors (empty for queued)</td>
</tr>
<tr>
<td><code>message</code></td>
@@ -507,6 +507,239 @@ curl<span class="w"> </span>-s<span class="w"> </span>-X<span class="w"> </span>
</tr>
</tbody>
</table>
<h3><code>GET /api/v1/job/:uuid</code></h3>
<p><strong>Auth</strong>: Required
<strong>Scope</strong>: file-level</p>
<p>Get detailed information about a specific processing job, including its queue position.</p>
<h4>Response (200)</h4>
<div class="codehilite"><pre><span></span><code><span class="p">{</span>
<span class="w"> </span><span class="nt">&quot;id&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">51</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;uuid&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;c36f35685177c981aa139b66bbbccc5b&quot;</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;status&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;queued&quot;</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;current_processor&quot;</span><span class="p">:</span><span class="w"> </span><span class="kc">null</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;progress_current&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">0</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;progress_total&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">0</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;processors&quot;</span><span class="p">:</span><span class="w"> </span><span class="p">[],</span>
<span class="w"> </span><span class="nt">&quot;created_at&quot;</span><span class="p">:</span><span class="w"> </span><span class="s2">&quot;2026-06-22 23:08:48.497018&quot;</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;started_at&quot;</span><span class="p">:</span><span class="w"> </span><span class="kc">null</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;updated_at&quot;</span><span class="p">:</span><span class="w"> </span><span class="kc">null</span><span class="p">,</span>
<span class="w"> </span><span class="nt">&quot;queue_position&quot;</span><span class="p">:</span><span class="w"> </span><span class="mi">3</span>
<span class="p">}</span>
</code></pre></div>
<table class="table">
<thead>
<tr>
<th>Field</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>id</code></td>
<td>integer</td>
<td>Monitor job ID</td>
</tr>
<tr>
<td><code>uuid</code></td>
<td>string</td>
<td>File UUID</td>
</tr>
<tr>
<td><code>status</code></td>
<td>string</td>
<td><code>"pending"</code>, <code>"queued"</code>, <code>"running"</code>, <code>"completed"</code>, <code>"failed"</code></td>
</tr>
<tr>
<td><code>current_processor</code></td>
<td>string</td>
<td>Currently active processor, or null</td>
</tr>
<tr>
<td><code>progress_current</code></td>
<td>integer</td>
<td>Current progress count</td>
</tr>
<tr>
<td><code>progress_total</code></td>
<td>integer</td>
<td>Total progress count</td>
</tr>
<tr>
<td><code>processors</code></td>
<td>array</td>
<td>Processor list</td>
</tr>
<tr>
<td><code>created_at</code></td>
<td>string</td>
<td>Job creation timestamp</td>
</tr>
<tr>
<td><code>started_at</code></td>
<td>string</td>
<td>Processing start timestamp, or null</td>
</tr>
<tr>
<td><code>updated_at</code></td>
<td>string</td>
<td>Last update timestamp, or null</td>
</tr>
<tr>
<td><code>queue_position</code></td>
<td>integer</td>
<td>Position in FIFO queue (null if not pending/queued)</td>
</tr>
</tbody>
</table>
<hr />
<h3>Status Lifecycle</h3>
<div class="codehilite"><pre><span></span><code><span class="n">register</span><span class="w"> </span><span class="err">──→</span><span class="w"> </span><span class="n">pending</span>
<span class="w"> </span><span class="err"></span>
<span class="w"> </span><span class="n">trigger</span><span class="w"> </span><span class="p">(</span><span class="n">POST</span><span class="w"> </span><span class="o">/</span><span class="n">process</span><span class="p">)</span>
<span class="w"> </span><span class="err"></span>
<span class="w"> </span><span class="n">queued</span><span class="w"> </span><span class="err">←──</span><span class="w"> </span><span class="n">queue_position</span><span class="w"> </span><span class="n">counts</span><span class="w"> </span><span class="n">jobs</span><span class="w"> </span><span class="n">ahead</span>
<span class="w"> </span><span class="err"></span>
<span class="w"> </span><span class="n">worker</span><span class="w"> </span><span class="n">picks</span><span class="w"> </span><span class="n">up</span>
<span class="w"> </span><span class="err"></span>
<span class="w"> </span><span class="n">processing</span>
<span class="w"> </span><span class="err"></span>
<span class="w"> </span><span class="err">┌────────┴────────┐</span>
<span class="w"> </span><span class="err"></span><span class="w"> </span><span class="err"></span>
<span class="w"> </span><span class="n">completed</span><span class="w"> </span><span class="n">failed</span>
<span class="w"> </span><span class="err"></span>
<span class="w"> </span><span class="n">checkin</span><span class="w"> </span><span class="err">──→</span><span class="w"> </span><span class="n">indexed</span>
<span class="w"> </span><span class="n">checkout</span><span class="w"> </span><span class="err">──→</span><span class="w"> </span><span class="n">checked_out</span>
</code></pre></div>
<table class="table">
<thead>
<tr>
<th>Status</th>
<th>Meaning</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>pending</code></td>
<td>File registered, not yet triggered</td>
</tr>
<tr>
<td><code>queued</code></td>
<td>Triggered, waiting for worker in FIFO queue</td>
</tr>
<tr>
<td><code>processing</code></td>
<td>Worker actively processing</td>
</tr>
<tr>
<td><code>completed</code></td>
<td>All processors finished successfully</td>
</tr>
<tr>
<td><code>failed</code></td>
<td>One or more essential processors failed</td>
</tr>
<tr>
<td><code>indexed</code></td>
<td>Post-processing checkin complete</td>
</tr>
<tr>
<td><code>checked_out</code></td>
<td>User checked out the file</td>
</tr>
</tbody>
</table>
<p>Queue order is FIFO (<code>created_at ASC</code>). The <code>GET /api/v1/job/:uuid</code> endpoint returns <code>queue_position</code> showing how many jobs are ahead.</p>
<h3>Frontend Status Mapping</h3>
<p>When displaying file status in the frontend list (e.g. after <code>GET /api/v1/files/scan</code>), map the <code>status</code> field as follows:</p>
<table class="table">
<thead>
<tr>
<th>DB Status</th>
<th>Status Label</th>
<th>Filter: 待處理</th>
<th>Filter: 處理中</th>
<th>Count: pendingCount</th>
<th>Count: processingCount</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>unregistered</code></td>
<td>未註冊</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td><code>registered</code></td>
<td>待處理</td>
<td><strong>Yes</strong></td>
<td>No</td>
<td><strong>Yes</strong></td>
<td>No</td>
</tr>
<tr>
<td><code>pending</code></td>
<td>待處理</td>
<td><strong>Yes</strong></td>
<td>No</td>
<td><strong>Yes</strong></td>
<td>No</td>
</tr>
<tr>
<td><code>queued</code></td>
<td>排隊中</td>
<td><strong>Yes</strong></td>
<td><strong>Yes</strong></td>
<td><strong>Yes</strong></td>
<td><strong>Yes</strong></td>
</tr>
<tr>
<td><code>processing</code></td>
<td>處理中</td>
<td>No</td>
<td><strong>Yes</strong></td>
<td>No</td>
<td><strong>Yes</strong></td>
</tr>
<tr>
<td><code>completed</code></td>
<td>已完成</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td><code>failed</code></td>
<td>處理失敗</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td><code>indexed</code></td>
<td>已入庫</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
</tbody>
</table>
<p><strong><code>queued</code> 的特殊處理</strong>
- <code>statusLabel</code> → 顯示「排隊中」,加 <code>ms-badge-warn</code> 樣式(黃色)
- <code>filterPending</code> → 應包含 <code>queued</code>讓它在「待處理」filter 可見
- <code>pendingCount</code> + <code>processingCount</code> → 兩者都應包含 <code>queued</code>,因它既是「待處理」也是「正在排隊」
- 在 <code>refreshAllStatus</code> / <code>loadFiles</code> 中,如果檔案狀態是 <code>queued</code>,應顯示簡單的排隊訊息(無需 polling progress
- 當 worker pickup 後,狀態會變為 <code>processing</code>,此時 <code>refreshAllStatus</code> 會自動偵測到並開始 polling progress
- 也可以提供一個「queue_position」顯示呼叫 <code>GET /api/v1/job/:uuid</code> 取得排在第幾位</p>
<hr />
<h3><code>GET /api/v1/file/:file_uuid/processor-counts</code></h3>
<p><strong>Auth</strong>: Required
<strong>Scope</strong>: file-level</p>
@@ -652,7 +885,7 @@ curl<span class="w"> </span>-s<span class="w"> </span>-X<span class="w"> </span>
<p>Phase 1 (<code>/phase1</code>) combines store-asrx + rule1 + vectorize into one call.</p>
<hr />
<p><em>Updated: 2026-06-20 12:00:00</em></p>
<p><em>Updated: 2026-06-23 — Added queued status, FIFO queue order, queue_position in job detail, frontend status mapping table</em></p>
</div>
</body>
</html>