更新

2026-02-10 13:31:29 +08:00 · 2026-02-09 14:47:19 +08:00 · 2026-02-08 19:54:11 +08:00 · 2026-02-08 16:23:39 +08:00 · 2026-02-08 10:46:08 +08:00 · 2026-02-07 14:29:57 +08:00
121 changed files with 12280 additions and 4178 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -20,11 +20,14 @@ node_modules/
 out/
 .turbo/

-# ============ IDE ============
+# ============ IDE / AI 工具 ============
 .vscode/
 .idea/
 *.swp
 *.swo
+.agents/
+.opencode/
+.claude/

 # ============ 系统文件 ============
 .DS_Store
@@ -35,11 +38,21 @@ desktop.ini
 backend/outputs/
 backend/uploads/
 backend/cookies/
+backend/user_data/
+backend/debug_screenshots/
 *_cookies.json

-# ============ MuseTalk ============
+# ============ 模型权重 ============
+models/*/checkpoints/
 models/MuseTalk/models/
 models/MuseTalk/results/
+models/LatentSync/temp/
+
+# ============ Remotion 构建 ============
+remotion/dist/
+
+# ============ 临时文件 ============
+Temp/

 # ============ 日志 ============
 *.log
--- a/Docs/BACKEND_DEV.md
+++ b/Docs/BACKEND_DEV.md
@@ -19,7 +19,6 @@
 - **repositories/**：数据读写（Supabase），不包含业务逻辑。
 - **services/**：外部依赖与基础能力（TTS、Storage、Remotion 等）。
 - **core/**：配置、安全、依赖注入、统一响应。
- **api/**：仅做 router 透传，保持 `/api/*` 路由稳定。

 ---

@@ -28,18 +27,30 @@
 ```
 backend/
 ├── app/
-│   ├── api/                 # 兼容路由入口，透传到 modules
 │   ├── core/                # config、deps、security、response
-│   ├── modules/             # 业务模块
-│   │   ├── videos/
-│   │   ├── materials/
-│   │   ├── publish/
-│   │   ├── auth/
-│   │   └── ...
+│   ├── modules/             # 业务模块（路由 + 逻辑）
+│   │   ├── videos/          # 视频生成任务（router/schemas/service/workflow）
+│   │   ├── materials/       # 素材管理（router/schemas/service）
+│   │   ├── publish/         # 多平台发布
+│   │   ├── auth/            # 认证与会话
+│   │   ├── ai/              # AI 功能（标题标签生成、多语言翻译）
+│   │   ├── assets/          # 静态资源（字体/样式/BGM）
+│   │   ├── ref_audios/      # 声音克隆参考音频（router/schemas/service）
+│   │   ├── generated_audios/ # 预生成配音管理（router/schemas/service）
+│   │   ├── login_helper/    # 扫码登录辅助
+│   │   ├── tools/           # 工具接口（router/schemas/service）
+│   │   └── admin/           # 管理员功能
 │   ├── repositories/        # Supabase 数据访问
 │   ├── services/            # 外部服务集成
+│   │   ├── uploader/        # 平台发布器（douyin/weixin）
+│   │   ├── qr_login_service.py
+│   │   ├── publish_service.py
+│   │   ├── remotion_service.py
+│   │   ├── storage.py
+│   │   └── ...
 │   └── tests/
 ├── assets/                  # 字体 / 样式 / bgm
+├── user_data/               # 用户隔离数据（Cookie 等）
 ├── scripts/
 └── requirements.txt
 ```
@@ -86,6 +97,21 @@ backend/
 - 所有文件上传/下载/删除/移动通过 `services/storage.py`。
 - 需要重命名时使用 `move_file`，避免直接读写 Storage。

+### Cookie 存储（用户隔离）
+
+多平台扫码登录产生的 Cookie 按用户隔离存储：
+
+```
+backend/user_data/{user_uuid}/cookies/
+├── douyin_cookies.json
+├── weixin_cookies.json
+└── ...
+```
+
+- `publish_service.py` 中通过 `_get_cookies_dir(user_id)` / `_get_cookie_path(user_id, platform)` 定位
+- 会话 key 格式：`"{user_id}_{platform}"`，确保多用户并发登录互不干扰
+- 登录成功后 Cookie 自动保存到对应路径，发布时自动加载
+
 ---

 ## 7. 代码约定
@@ -99,10 +125,13 @@ backend/

 ## 8. 开发流程建议

- **新增功能**：先建模块，再写 router/service/workflow。
- **修复 Bug**：顺手把涉及的逻辑抽到对应 service/workflow。
+- **新增功能**：先建模块，**必须**包含 `router.py + schemas.py + service.py`，不允许 router-only。
+- **修复 Bug**：顺手把涉及的逻辑抽到对应 service/workflow（渐进式改造）。
+- **改旧模块**：改动哪部分就拆哪部分，不要求一次重构整个文件。
 - **核心流程变更**：必跑冒烟（登录/生成/发布）。

+> **渐进原则**：新代码高标准，旧代码逐步改。不做大规模一次性重构，避免引入回归风险。
+
 ---

 ## 9. 常用环境变量
@@ -112,10 +141,35 @@ backend/
 - `REDIS_URL`
 - `GLM_API_KEY`
 - `LATENTSYNC_*`
+- `CORS_ORIGINS` (CORS 白名单，默认 *)
+
+### 微信视频号
+- `WEIXIN_HEADLESS_MODE` (headful/headless-new)
+- `WEIXIN_CHROME_PATH` / `WEIXIN_BROWSER_CHANNEL`
+- `WEIXIN_USER_AGENT` / `WEIXIN_LOCALE` / `WEIXIN_TIMEZONE_ID`
+- `WEIXIN_FORCE_SWIFTSHADER`
+- `WEIXIN_TRANSCODE_MODE` (reencode/faststart/off)
+
+### 抖音
+- `DOUYIN_HEADLESS_MODE` (headful/headless-new，默认 headless-new)
+- `DOUYIN_CHROME_PATH` / `DOUYIN_BROWSER_CHANNEL`
+- `DOUYIN_USER_AGENT` (默认 Chrome/144)
+- `DOUYIN_LOCALE` / `DOUYIN_TIMEZONE_ID`
+- `DOUYIN_FORCE_SWIFTSHADER`
+- `DOUYIN_DEBUG_ARTIFACTS` / `DOUYIN_RECORD_VIDEO` / `DOUYIN_KEEP_SUCCESS_VIDEO`
+- `DOUYIN_COOKIE` (抖音视频下载 Cookie)

 ---

-## 10. 最小新增模块示例
+## 10. Playwright 发布调试
+
+- 诊断日志落盘：`backend/app/debug_screenshots/weixin_network.log` / `douyin_network.log`
+- 关键失败截图：`backend/app/debug_screenshots/weixin_*.png` / `douyin_*.png`
+- 视频号建议使用 headful + xvfb-run（避免 headless 解码/指纹问题）
+
+---
+
+## 11. 最小新增模块示例

 ```
 app/modules/foo/
--- a/Docs/BACKEND_README.md
+++ b/Docs/BACKEND_README.md
@@ -13,14 +13,25 @@
 ```
 backend/
 ├── app/
-│   ├── api/              # 兼容路由入口 (透传到 modules)
 │   ├── core/             # 核心配置 (config.py, security.py, response.py)
 │   ├── modules/          # 业务模块 (router/service/workflow/schemas)
+│   │   ├── videos/       # 视频生成任务（router/schemas/service/workflow）
+│   │   ├── materials/    # 素材管理（router/schemas/service）
+│   │   ├── publish/      # 多平台发布
+│   │   ├── auth/         # 认证与会话
+│   │   ├── ai/              # AI 功能（标题标签生成、多语言翻译）
+│   │   ├── assets/          # 静态资源（字体/样式/BGM）
+│   │   ├── ref_audios/      # 声音克隆参考音频（router/schemas/service）
+│   │   ├── generated_audios/ # 预生成配音管理（router/schemas/service）
+│   │   ├── login_helper/    # 扫码登录辅助
+│   │   ├── tools/           # 工具接口（router/schemas/service）
+│   │   └── admin/           # 管理员功能
 │   ├── repositories/     # Supabase 数据访问
-│   ├── services/         # 外部服务集成 (TTS/Remotion/Storage 等)
+│   ├── services/         # 外部服务集成 (TTS/Remotion/Storage/Uploader 等)
 │   └── tests/            # 单元测试与集成测试
 ├── scripts/              # 运维脚本 (watchdog.py, init_db.py)
 ├── assets/               # 资源库 (fonts, bgm, styles)
+├── user_data/            # 用户隔离数据 (Cookie 等)
 └── requirements.txt      # 依赖清单
 ```

@@ -42,25 +53,53 @@ backend/

 2.  **视频生成 (Videos)**
    *   `POST /api/videos/generate`: 提交生成任务
-    *   `GET /api/videos/tasks/{task_id}`: 查询任务状态
+    *   `GET /api/videos/tasks/{task_id}`: 查询单个任务状态
+    *   `GET /api/videos/tasks`: 获取用户所有任务列表
    *   `GET /api/videos/generated`: 获取历史视频列表
    *   `DELETE /api/videos/generated/{video_id}`: 删除历史视频

-> **修正 (16:20)**：任务查询与历史列表接口已更新为 `/api/videos/tasks/{task_id}` 与 `/api/videos/generated`。
-
 3.  **素材管理 (Materials)**
    *   `POST /api/materials`: 上传素材
    *   `GET /api/materials`: 获取素材列表
    *   `PUT /api/materials/{material_id}`: 重命名素材

 4.  **社交发布 (Publish)**
-    *   `POST /api/publish`: 发布视频到 B站/抖音/小红书
+    *   `POST /api/publish`: 发布视频到 抖音/微信视频号/B站/小红书
+    *   `POST /api/publish/login`: 扫码登录平台
+    *   `GET /api/publish/login/status`: 查询登录状态（含刷脸验证二维码）
+    *   `GET /api/publish/accounts`: 获取已登录账号列表
+
+> 提示：视频号/抖音发布建议使用 headful + xvfb-run 运行后端。

 5.  **资源库 (Assets)**
    *   `GET /api/assets/subtitle-styles`: 字幕样式列表
    *   `GET /api/assets/title-styles`: 标题样式列表
    *   `GET /api/assets/bgm`: 背景音乐列表

+6.  **声音克隆 (Ref Audios)**
+    *   `POST /api/ref-audios`: 上传参考音频 (multipart/form-data)
+    *   `GET /api/ref-audios`: 获取参考音频列表
+    *   `PUT /api/ref-audios/{id}`: 重命名参考音频
+    *   `DELETE /api/ref-audios/{id}`: 删除参考音频
+
+7.  **AI 功能 (AI)**
+    *   `POST /api/ai/generate-meta`: AI 生成标题和标签
+    *   `POST /api/ai/translate`: AI 多语言翻译（支持 9 种目标语言）
+
+8.  **预生成配音 (Generated Audios)**
+    *   `POST /api/generated-audios/generate`: 异步生成配音（返回 task_id）
+    *   `GET /api/generated-audios/tasks/{task_id}`: 轮询生成进度
+    *   `GET /api/generated-audios`: 列出用户所有配音
+    *   `DELETE /api/generated-audios/{audio_id}`: 删除配音
+    *   `PUT /api/generated-audios/{audio_id}`: 重命名配音
+
+9.  **工具 (Tools)**
+    *   `POST /api/tools/extract-script`: 从视频链接提取文案
+
+10. **健康检查**
+    *   `GET /api/lipsync/health`: LatentSync 服务健康状态
+    *   `GET /api/voiceclone/health`: Qwen3-TTS 服务健康状态
+
 ### 统一响应结构

 ```json
@@ -78,10 +117,22 @@ backend/

 `POST /api/videos/generate` 支持以下可选字段：

+- `material_path`: 视频素材路径（单素材模式）
+- `material_paths`: 多素材路径数组（多机位模式，≥2 个素材时按句子自动切换）
+- `tts_mode`: TTS 模式 (`edgetts` / `voiceclone`)
+- `voice`: EdgeTTS 音色 ID（edgetts 模式）
+- `ref_audio_id` / `ref_text`: 参考音频 ID 与文本（voiceclone 模式）
+- `generated_audio_id`: 预生成配音 ID（存在时跳过内联 TTS，使用已生成的配音文件）
+- `custom_assignments`: 自定义素材分配数组（每项含 `material_path` / `start` / `end` / `source_start`），存在时跳过 Whisper 均分
+- `language`: TTS 语言（默认自动检测，声音克隆时透传给 Qwen3-TTS）
+- `title`: 片头标题文字
 - `subtitle_style_id`: 字幕样式 ID
 - `title_style_id`: 标题样式 ID
 - `subtitle_font_size`: 字幕字号（覆盖样式默认值）
 - `title_font_size`: 标题字号（覆盖样式默认值）
+- `title_top_margin`: 标题距顶部像素
+- `subtitle_bottom_margin`: 字幕距底部像素
+- `enable_subtitles`: 是否启用字幕
 - `bgm_id`: 背景音乐 ID
 - `bgm_volume`: 背景音乐音量（0-1，默认 0.2）

@@ -146,7 +197,7 @@ uvicorn app.main:app --host 0.0.0.0 --port 8006 --reload
 1.  在 `app/services/` 下创建新的 Service 类 (如 `NewTTSService`)。
 2.  实现 `generate` 方法，可以使用 subprocess 调用，也可以是 HTTP 请求。
 3.  **重要**: 如果模型占用 GPU，请务必使用 `asyncio.Lock` 进行并发控制，防止 OOM。
-4.  在 `app/api/` 中添加对应的路由调用。
+4.  在 `app/modules/` 下创建对应模块，添加 router/service/schemas，并在 `main.py` 注册路由。

 ### 添加定时任务

--- a/Docs/DEPLOY_MANUAL.md
+++ b/Docs/DEPLOY_MANUAL.md
@@ -28,11 +28,17 @@ node --version
 # 检查 FFmpeg
 ffmpeg -version

-# 检查 pm2 (用于服务管理)
-pm2 --version
-
-# 检查 Redis (任务状态存储，推荐)
-redis-server --version
+# 检查 Chrome (视频号发布)
+google-chrome --version
+
+# 检查 Xvfb
+xvfb-run --help
+
+# 检查 pm2 (用于服务管理)
+pm2 --version
+
+# 检查 Redis (任务状态存储，推荐)
+redis-server --version
 ```

 如果缺少依赖:
@@ -40,8 +46,17 @@ redis-server --version
 sudo apt update
 sudo apt install ffmpeg

+# 安装 Xvfb (视频号发布)
+sudo apt install xvfb
+
 # 安装 pm2
 npm install -g pm2
+
+# 安装 Chrome (视频号发布)
+wget -q -O - https://dl.google.com/linux/linux_signing_key.pub | sudo gpg --dearmor -o /usr/share/keyrings/google-linux-signing-keyring.gpg
+printf "deb [arch=amd64 signed-by=/usr/share/keyrings/google-linux-signing-keyring.gpg] http://dl.google.com/linux/chrome/deb/ stable main\n" | sudo tee /etc/apt/sources.list.d/google-chrome.list > /dev/null
+sudo apt update
+sudo apt install -y google-chrome-stable
 ```

 ---
@@ -99,6 +114,18 @@ pip install -r requirements.txt
 playwright install chromium
 ```

+> 提示：视频号发布建议使用系统 Chrome + xvfb-run（避免 headless 解码失败）。
+> 抖音发布同样建议 headful 模式 (`DOUYIN_HEADLESS_MODE=headful`)。
+
+### 扫码登录注意事项
+
+- **Cookie 按用户隔离**：每个用户的 Cookie 存储在 `backend/user_data/{uuid}/cookies/` 目录下，多用户并发登录互不干扰。
+- **抖音 QR 登录关键教训**：
+  - 扫码后绝对**不能重新加载 QR 页面**，否则会销毁会话 token
+  - 使用**新标签页**检测登录完成状态（检查 URL 包含 `creator-micro` + session cookies 存在）
+  - 抖音可能弹出**刷脸验证**，后端会自动提取验证二维码返回给前端展示
+- **微信视频号发布**：标题、描述、标签统一写入"视频描述"字段
+
 ---

 ### 可选：AI 标题/标签生成
@@ -138,6 +165,8 @@ playwright install chromium
    CREATE POLICY "Allow public read" ON storage.objects FOR SELECT TO anon USING (bucket_id = 'materials' OR bucket_id = 'outputs');
    EOF
    ```
+
+> **注意**：后端启动时会自动创建额外的存储桶（`ref-audios`、`generated-audios`），无需手动创建。
 
 ---
 
@@ -161,9 +190,29 @@ cp .env.example .env
 | `LATENTSYNC_GPU_ID` | 1 | GPU 选择 (0 或 1) |
 | `LATENTSYNC_USE_SERVER` | false | 设为 true 以启用常驻服务加速 |
 | `LATENTSYNC_INFERENCE_STEPS` | 20 | 推理步数 (20-50) |
-| `LATENTSYNC_GUIDANCE_SCALE` | 1.5 | 引导系数 (1.0-3.0) |
-| `DEBUG` | true | 生产环境改为 false |
-| `REDIS_URL` | `redis://localhost:6379/0` | 任务状态存储（不可用时回退内存） |
+| `LATENTSYNC_GUIDANCE_SCALE` | 1.5 | 引导系数 (1.0-3.0) |
+| `DEBUG` | true | 生产环境改为 false |
+| `REDIS_URL` | `redis://localhost:6379/0` | 任务状态存储（不可用时回退内存） |
+| `WEIXIN_HEADLESS_MODE` | headless-new | 视频号 Playwright 模式 (headful/headless-new) |
+| `WEIXIN_CHROME_PATH` | `/usr/bin/google-chrome` | 系统 Chrome 路径 |
+| `WEIXIN_BROWSER_CHANNEL` |  | Chromium 通道 (可选) |
+| `WEIXIN_USER_AGENT` | Chrome 120 UA | 视频号浏览器指纹 UA |
+| `WEIXIN_LOCALE` | zh-CN | 视频号语言环境 |
+| `WEIXIN_TIMEZONE_ID` | Asia/Shanghai | 视频号时区 |
+| `WEIXIN_FORCE_SWIFTSHADER` | true | 强制软件 WebGL，避免 context lost |
+| `WEIXIN_TRANSCODE_MODE` | reencode | 上传前转码 (reencode/faststart/off) |
+| `DOUYIN_HEADLESS_MODE` | headless-new | 抖音 Playwright 模式 (headful/headless-new) |
+| `DOUYIN_CHROME_PATH` | `/usr/bin/google-chrome` | 抖音 Chrome 路径 |
+| `DOUYIN_BROWSER_CHANNEL` |  | 抖音 Chromium 通道 (可选) |
+| `DOUYIN_USER_AGENT` | Chrome/144 UA | 抖音浏览器指纹 UA |
+| `DOUYIN_LOCALE` | zh-CN | 抖音语言环境 |
+| `DOUYIN_TIMEZONE_ID` | Asia/Shanghai | 抖音时区 |
+| `DOUYIN_FORCE_SWIFTSHADER` | true | 强制软件 WebGL |
+| `DOUYIN_DEBUG_ARTIFACTS` | false | 保留调试截图 |
+| `DOUYIN_RECORD_VIDEO` | false | 录制浏览器操作视频 |
+| `DOUYIN_KEEP_SUCCESS_VIDEO` | false | 成功后保留录屏 |
+| `CORS_ORIGINS` | `*` | CORS 允许源 (生产环境建议白名单) |
+| `DOUYIN_COOKIE` | 空 | 抖音视频下载 Cookie (文案提取功能) |

 ---

@@ -193,6 +242,12 @@ source venv/bin/activate
 uvicorn app.main:app --host 0.0.0.0 --port 8006
 ```

+推荐使用项目脚本启动后端（已内置 xvfb + headful 发布环境）：
+```bash
+cd /home/rongye/ProgramFiles/ViGent2
+./run_backend.sh  # 默认 8006，可用 PORT 覆盖
+```
+
 ### 启动前端 (终端 2)

 ```bash
@@ -227,9 +282,19 @@ python -m scripts.server
 1. 创建启动脚本 `run_backend.sh`:
 ```bash
 cat > run_backend.sh << 'EOF'
-#!/bin/bash
-cd /home/rongye/ProgramFiles/ViGent2/backend
-./venv/bin/uvicorn app.main:app --host 0.0.0.0 --port 8006
+#!/usr/bin/env bash
+set -e
+BASE_DIR="$(cd "$(dirname "$0")" && pwd)"
+export WEIXIN_HEADLESS_MODE=headful
+export DOUYIN_HEADLESS_MODE=headful
+export WEIXIN_DEBUG_ARTIFACTS=false
+export WEIXIN_RECORD_VIDEO=false
+export DOUYIN_DEBUG_ARTIFACTS=false
+export DOUYIN_RECORD_VIDEO=false
+PORT=${PORT:-8006}
+cd "$BASE_DIR/backend"
+exec xvfb-run --auto-servernum --server-args="-screen 0 1920x1080x24" \
+  ./venv/bin/uvicorn app.main:app --host 0.0.0.0 --port "$PORT"
 EOF
 chmod +x run_backend.sh
 ```
@@ -507,6 +572,7 @@ pm2 logs vigent2-qwen-tts
 | `next` | React 框架 |
 | `swr` | 数据请求与缓存 |
 | `tailwindcss` | CSS 样式 |
+| `wavesurfer.js` | 音频波形（时间轴编辑器） |

 ### LatentSync 关键依赖

--- a/Docs/DevLogs/Day12.md
+++ b/Docs/DevLogs/Day12.md
@@ -342,6 +342,6 @@ models/Qwen3-TTS/

 ## 🔗 相关文档

- [task_complete.md](../task_complete.md) - 任务总览
+- [TASK_COMPLETE.md](../TASK_COMPLETE.md) - 任务总览
 - [Day11.md](./Day11.md) - 上传架构重构
 - [QWEN3_TTS_DEPLOY.md](../QWEN3_TTS_DEPLOY.md) - Qwen3-TTS 部署指南
--- a/Docs/DevLogs/Day13.md
+++ b/Docs/DevLogs/Day13.md
@@ -273,7 +273,7 @@ pm2 logs vigent2-qwen-tts --lines 50

 ## 🔗 相关文档

- [task_complete.md](../task_complete.md) - 任务总览
+- [TASK_COMPLETE.md](../TASK_COMPLETE.md) - 任务总览
 - [Day12.md](./Day12.md) - iOS 兼容与 Qwen3-TTS 部署
 - [QWEN3_TTS_DEPLOY.md](../QWEN3_TTS_DEPLOY.md) - Qwen3-TTS 部署指南
 - [SUBTITLE_DEPLOY.md](../SUBTITLE_DEPLOY.md) - 字幕功能部署指南
--- a/Docs/DevLogs/Day14.md
+++ b/Docs/DevLogs/Day14.md
@@ -397,6 +397,6 @@ if ((status === 401 || status === 403) && !isRedirecting && !isPublicPath) {

 ## 🔗 相关文档

- [task_complete.md](../task_complete.md) - 任务总览
+- [TASK_COMPLETE.md](../TASK_COMPLETE.md) - 任务总览
 - [Day13.md](./Day13.md) - 声音克隆功能集成 + 字幕功能
 - [QWEN3_TTS_DEPLOY.md](../QWEN3_TTS_DEPLOY.md) - Qwen3-TTS 1.7B 部署指南
--- a/Docs/DevLogs/Day15.md
+++ b/Docs/DevLogs/Day15.md
@@ -342,7 +342,7 @@ pm2 restart vigent2-backend vigent2-frontend

 ## 🔗 相关文档

- [task_complete.md](../task_complete.md) - 任务总览
+- [TASK_COMPLETE.md](../TASK_COMPLETE.md) - 任务总览
 - [Day14.md](./Day14.md) - 模型升级 + AI 标题标签
 - [AUTH_DEPLOY.md](../AUTH_DEPLOY.md) - 认证系统部署指南

--- a/Docs/DevLogs/Day16.md
+++ b/Docs/DevLogs/Day16.md
@@ -136,4 +136,4 @@ if service["failures"] >= service['threshold']:

 - [x] `Docs/QWEN3_TTS_DEPLOY.md`: 添加 Flash Attention 安装指南
 - [x] `Docs/DEPLOY_MANUAL.md`: 添加 Watchdog 部署说明
- [x] `Docs/task_complete.md`: 更新进度至 100% (Day 16)
+- [x] `Docs/TASK_COMPLETE.md`: 更新进度至 100% (Day 16)
--- a/Docs/DevLogs/Day18.md
+++ b/Docs/DevLogs/Day18.md
@@ -107,3 +107,62 @@
 - `frontend/src/components/VideoPreviewModal.tsx`
 - `frontend/src/features/home/ui/PreviewPanel.tsx`
 - `frontend/src/features/publish/ui/PublishPage.tsx`
+
+---
+
+## 📹 微信视频号发布接入 (16:30)
+
+### 内容
+- 新增视频号上传器 `WeixinUploader`，打通上传/标题/简介/标签/发布流程
+- 视频号扫码登录配置完善（iframe 扫码、候选二维码过滤）
+- 发布平台与路由接入视频号
+- 中文错误提示 + 关键节点截图保存到 `debug_screenshots`
+
+### 涉及文件
+- `backend/app/services/uploader/weixin_uploader.py`
+- `backend/app/services/qr_login_service.py`
+- `backend/app/services/publish_service.py`
+- `backend/app/modules/publish/router.py`
+- `backend/app/modules/login_helper/router.py`
+
+---
+
+## 🧪 视频号上传稳定性修复 (17:40)
+
+### 内容
+- 统一浏览器指纹（UA/locale/timezone）并支持系统 Chrome
+- 增加 headful + xvfb-run 运行方案，避免 headless 检测与解码失败
+- 强制 SwiftShader，修复 WebGL context loss
+- 上传前转码为兼容 MP4（H.264 + AAC + faststart）
+- 增强上传状态判断与调试日志 `weixin_network.log`
+
+### 涉及文件
+- `backend/app/core/config.py`
+- `backend/app/services/uploader/weixin_uploader.py`
+- `backend/app/services/qr_login_service.py`
+- `run_backend.sh`
+
+---
+
+## 🧾 发布诊断增强 (18:10)
+
+### 内容
+- 抖音发布新增网络日志与失败截图，便于定位上传/发布失败
+- 视频号上传失败截图与网络日志落盘
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+- `backend/app/services/uploader/weixin_uploader.py`
+- `backend/app/debug_screenshots/*`
+
+---
+
+## 🧩 发布页交互调整 (18:20)
+
+### 内容
+- 未选择平台时禁用发布按钮
+- 移除定时发布 UI/参数，仅保留立即发布
+
+### 涉及文件
+- `frontend/src/features/publish/ui/PublishPage.tsx`
+- `frontend/src/features/publish/model/usePublishController.ts`
--- a/Docs/DevLogs/Day19.md
+++ b/Docs/DevLogs/Day19.md
@@ -0,0 +1,485 @@
+## 🛡️ 发布中防误刷新（15:46，合并）
+
+### 内容
+- 发布按钮文案统一为：`正在发布...请勿刷新或关闭网页`
+- 发布中启用浏览器 `beforeunload` 拦截，刷新/关闭页面会触发原生二次确认
+- 适用于发布管理页全部平台（抖音 / 微信视频号 / B站 / 小红书）
+- 后续优化已登记：发布任务状态恢复机制（任务化 + 状态持久化 + 前端轮询恢复）
+
+### 涉及文件
+- `frontend/src/features/publish/model/usePublishController.ts`
+- `frontend/src/features/publish/ui/PublishPage.tsx`
+
+---
+
+## 🖼️ 发布成功截图稳定性优化（15:26，合并）
+
+### 内容
+- 成功判定后先等待页面加载，再额外等待 `3s` 后截图，避免抓到半加载页面
+- 针对“截图里页面内容只占 1/3”问题，成功截图从 `full_page=True` 调整为视口截图 `full_page=False`
+- 视频号成功截图前额外恢复 `zoom=1.0`，避免流程缩放影响最终截图比例
+- 抖音成功截图同步应用相同策略，统一前端展示观感
+
+### 涉及文件
+- `backend/app/services/uploader/weixin_uploader.py`
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🧪 视频号录屏 Debug 开关（15:12，已回收）
+
+### 内容
+- 为视频号上传器新增 Playwright 录屏能力，开关受 `WEIXIN_DEBUG_ARTIFACTS && WEIXIN_RECORD_VIDEO` 控制
+- 新增视频号录屏配置项：
+  - `WEIXIN_RECORD_VIDEO`
+  - `WEIXIN_KEEP_SUCCESS_VIDEO`
+  - `WEIXIN_RECORD_VIDEO_WIDTH`
+  - `WEIXIN_RECORD_VIDEO_HEIGHT`
+- 上传流程在 `finally` 中统一保存录屏，失败必保留；成功录屏默认按开关清理
+- 排障阶段临时开启过视频号 debug/录屏；当前已回收为默认关闭（`run_backend.sh` 设为 `false`）
+
+### 涉及文件
+- `backend/app/services/uploader/weixin_uploader.py`
+- `backend/app/core/config.py`
+- `run_backend.sh`
+- `Docs/DEPLOY_MANUAL.md`
+
+---
+
+## 🔁 后端启动脚本统一为 run_backend.sh (15:00)
+
+### 内容
+- 删除旧脚本 `run_backend_xvfb.sh`
+- 将 `run_backend.sh` 统一为 xvfb + headful 启动逻辑（不再保留非 xvfb 版本）
+- 默认端口从 `8010` 统一为 `8006`
+- 启动脚本默认关闭微信/抖音 debug 产物
+- 更新部署手册中的启动与 pm2 示例，统一使用 `run_backend.sh`
+
+### 涉及文件
+- `run_backend.sh`
+- `run_backend_xvfb.sh` (deleted)
+- `Docs/DEPLOY_MANUAL.md`
+
+---
+
+## 🧾 视频号卡顿与文案未写入修复 (14:52)
+
+### 内容
+- 复盘日志确认视频号 `post_create` 请求已成功，但结果判定仅靠页面文案，导致长时间“等待发布结果”
+- 发布判定优化：`post_create` 成功且页面进入 `post/list` 时立即判定成功
+- 发布超时改为失败返回（不再 `success=true` 假成功）
+- “标题+标签写在视频描述”进一步加强：先按 `视频描述` 标签定位输入框，再做 placeholder 与 contenteditable 兜底
+- 视频号发布结果等待超时从 `180s` 收敛到 `90s`
+
+### 涉及文件
+- `backend/app/services/uploader/weixin_uploader.py`
+
+---
+
+## 🚦 视频号发布卡顿根因与快速判定 (14:45)
+
+### 内容
+- 定位到卡顿根因是实际请求已提交（`post_create` 成功）但结果判定仍在轮询文本提示，导致长时间等待
+- 新增发布成功网络信号：监听 `post/post_create` 成功响应后标记已提交
+- 若已提交且页面已回到内容列表（`/post/list`），立即判定发布成功，不再等满超时
+- 新增发布接口失败信号：`post_create` 返回错误时立即失败返回
+
+### 涉及文件
+- `backend/app/services/uploader/weixin_uploader.py`
+
+---
+
+## 📸 视频号发布成功截图接入前端 (13:34)
+
+### 内容
+- 为微信视频号新增“发布成功截图”能力：发布成功后直接对当前成功页截图
+- 截图存储沿用私有隔离目录：`private_outputs/publish_screenshots/{user_id}`
+- 返回前端的 `screenshot_url` 使用鉴权接口：`/api/publish/screenshot/{filename}`
+- 视频号上传器新增 `user_id` 透传，确保截图按用户隔离
+
+### 涉及文件
+- `backend/app/services/uploader/weixin_uploader.py`
+- `backend/app/services/publish_service.py`
+
+---
+
+## ✍️ 视频号描述填充修正 + 关闭调试产物 (13:26)
+
+### 内容
+- 按最新规则调整视频号文案填充：标题和标签统一写入“视频描述”输入区
+- 标签统一规范为 `#标签` 形式并去重
+- 若未找到“视频描述”输入区，直接返回失败，避免“发布成功但标题/标签为空”
+- 关闭视频号 debug 产物：新增 `WEIXIN_DEBUG_ARTIFACTS=false`，禁用调试日志与截图输出
+- `run_backend.sh` 增加 `WEIXIN_DEBUG_ARTIFACTS=false`，启动脚本层面强制关闭
+
+### 涉及文件
+- `backend/app/services/uploader/weixin_uploader.py`
+- `backend/app/core/config.py`
+- `run_backend.sh`
+
+---
+
+## 🚫 强制关闭抖音调试产物 (13:15)
+
+### 内容
+- 进一步收紧为“默认不生成任何抖音 debug 截屏/日志/录屏”
+- 录屏开关改为依赖 `DOUYIN_DEBUG_ARTIFACTS && DOUYIN_RECORD_VIDEO`，避免单独误开
+- `run_backend.sh` 增加环境变量强制关闭：
+  - `DOUYIN_DEBUG_ARTIFACTS=false`
+  - `DOUYIN_RECORD_VIDEO=false`
+- 仅保留给用户看的发布成功截图（私有目录 + 鉴权访问）
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+- `backend/app/core/config.py`
+- `run_backend.sh`
+
+---
+
+## 🧹 关闭调试截屏/录屏并清理历史文件 (13:08)
+
+### 内容
+- 抖音调试产物默认关闭：
+  - `DOUYIN_DEBUG_ARTIFACTS=false`
+  - `DOUYIN_RECORD_VIDEO=false`
+- 保留功能信号监听（上传提交/封面生成/发布接口状态）用于流程判断，不依赖调试文件
+- 已删除现有抖音调试文件（`debug_screenshots` 下的 `douyin_*` 截图、日志与失败录屏）
+- 继续保留并展示“给用户看的发布成功截图”（用户隔离 + 鉴权访问）
+
+### 涉及文件
+- `backend/app/core/config.py`
+- `backend/app/services/uploader/douyin_uploader.py`
+- `backend/app/debug_screenshots/douyin_*` (deleted)
+- `backend/app/debug_screenshots/videos/douyin_*` (deleted)
+
+---
+
+## 🔒 成功截图用户隔离 (12:58)
+
+### 内容
+- 发布成功截图改为用户隔离存储，不再写入公开静态目录
+- 存储目录迁移到私有路径：`private_outputs/publish_screenshots/{user_id}`
+- 新增鉴权访问接口：`GET /api/publish/screenshot/{filename}`（必须登录，仅可访问本人截图）
+- 返回给前端的 `screenshot_url` 改为鉴权接口地址，避免跨用户直接猜路径访问
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+- `backend/app/services/publish_service.py`
+- `backend/app/modules/publish/router.py`
+- `backend/app/core/config.py`
+
+---
+
+## 🎯 封面触发提速与审核中截图强化 (12:49)
+
+### 内容
+- 修复“上传完成后长时间不进入封面”：当出现 `重新上传+预览` 且已收到视频提交信号时，立即进入封面步骤
+- 目标是减少“处理中”文案残留导致的额外等待
+- 成功截图逻辑强化为优先“真实点击审核中标签”，新增文本点击兜底，不再只用可见即通过
+- 若审核中列表未马上出现标题，自动刷新并再次进入审核中重查后再截图
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🔐 登录态识别增强（避免误报上传失败） (12:41)
+
+### 内容
+- 针对“未触发文件选择弹窗”误报，新增登录页识别：
+  - URL 关键字：`passport/login/check_qrconnect/sso`
+  - 页面文本：`扫码登录/验证码登录/立即登录/抖音APP扫码登录` 等
+  - 登录控件：手机号/验证码输入框、登录按钮
+- 上传阶段重试后若识别为登录页，直接返回 `Cookie 已失效，请重新登录`
+- 避免把“实际掉登录”误判成“上传入口失效”
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## ⏱️ 发布阶段超时与网络不佳快速失败 (12:30)
+
+### 内容
+- 针对“网络不佳后长时间卡住”增加发布阶段快速失败
+- 上传完成后到发布结果设置总超时 `60s`（`POST_UPLOAD_STAGE_TIMEOUT`），超过直接失败
+- 识别发布接口 `create_v2` 的 HTTP 错误（如 403）并立即返回失败，不再等待 180 秒
+- 发布结果判定新增网络类失败文案匹配（`网络不佳/网络异常/请稍后重试`）
+- 阻塞弹窗关闭策略新增 `暂不设置`，避免“设置横封面获更多流量”弹窗阻塞点击发布
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🧯 封面已完成但误判失败修复 (12:22)
+
+### 内容
+- 针对报错“封面为必填但未设置成功”新增页面态兜底，避免封面已完成却未点击发布
+- 新增 `_is_cover_configured_on_page()`：通过 `横封面/竖封面` + 封面预览图判断页面已配置封面
+- 当出现 `horizontal_switch_missed` 或 `no_cover_button` 时，若页面已配置封面则允许继续发布
+- 封面必填主流程增加 `configured_fallback_continue` 兜底，降低误杀
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🧾 成功截图切到审核中视图 (11:26)
+
+### 内容
+- 按需求将“发布成功截图”改为内容管理 `审核中/待审核` 视图，不再截“全部作品”
+- 发布成功后先进入内容管理并点击 `审核中`（或 `待审核`）再截图
+- 截图前额外尝试等待当前标题出现在审核中列表，便于确认是最新发布作品
+- 发布超时兜底验证也改为优先在审核中列表查找标题
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## ✅ 封面步骤按指定顺序强约束 (11:18)
+
+### 内容
+- 按确认流程收紧旧发布页封面链路：
+  - 作品描述填完 → 点击 `选择封面` → 点击 `设置横封面` → 点击 `完成` → 等待封面效果检测通过 → 才允许发布
+- 新增 `require_horizontal` 约束：封面必填场景必须切换到横封面，否则直接失败重试
+- 新增封面效果检测通过等待：优先 `cover/gen` 新请求信号，其次页面“检测通过”文案
+- 避免因漏点 `设置横封面` 导致后续卡住或误发布
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🧩 横封面点击漏判修复 (11:10)
+
+### 内容
+- 根据复现反馈修复“未点击设置横封面导致封面流程卡住”问题
+- 新增 `_switch_to_horizontal_cover()`，扩展横封面入口选择器（`设置横封面/横封面/横版封面`）
+- 进入封面弹窗后先关闭阻塞弹窗再点击横封面，点击失败会重试一次
+- 若页面存在横封面入口但始终未切换成功，直接返回失败并重试，避免长时间假等待
+- 新增日志：`[douyin][cover] switched_horizontal ...`、`horizontal_switch_missed`
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## ⚡ 横封面后直接完成优化 (11:03)
+
+### 内容
+- 根据实测反馈，在点击 `设置横封面` 后新增一次“立即点击完成”快速路径
+- 若平台已自动选中横封面，将直接确认并退出弹窗，不再执行后续封面扫描
+- 新增日志：`[douyin][cover] fast_confirm_after_switch ...`
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## ⚙️ 封面步骤提速优化 (10:58)
+
+### 内容
+- 复盘日志确认旧发布页封面步骤存在明显耗时（示例：`required_by_text` 到 `cover selected` 约 35 秒）
+- 新增封面“快速确认”路径：若平台已默认选中封面，直接确认并跳过多余扫描
+- 收紧封面成功条件：仅“确认按钮点击成功”才算封面设置成功，避免误判
+- 缩短不必要等待并新增封面耗时日志：`[douyin][cover] fast_confirm/selected=... confirmed=... elapsed=...`
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🧾 发布成功截图前台展示 (10:48)
+
+### 内容
+- 按需求删除 `run_backend_xvfb_live.sh`，不再提供实时直播脚本
+- 抖音发布成功时自动保存成功截图到 `outputs/publish_screenshots`
+- 发布接口返回 `screenshot_url`，前端发布结果卡片直接展示截图并支持点击查看大图
+- 发布结果不再 10 秒自动清空，方便用户确认“是否真正发布成功”
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+- `frontend/src/features/publish/model/usePublishController.ts`
+- `frontend/src/features/publish/ui/PublishPage.tsx`
+- `run_backend_xvfb_live.sh` (deleted)
+
+---
+
+## 🧬 抖音界面差异根因与环境对齐 (10:20)
+
+### 内容
+- 定位到 Playwright 与手动 Win11 Chrome 的环境指纹不一致（Linux 平台 + 自动化上下文），可能触发不同灰度界面
+- 抖音上传器新增独立浏览器配置项，不再复用 `WEIXIN_*` 配置
+- 新增 `DOUYIN_*` 配置：`HEADLESS_MODE/USER_AGENT/LOCALE/TIMEZONE_ID/CHROME_PATH/BROWSER_CHANNEL/FORCE_SWIFTSHADER`
+- 上传器启动改为 `_build_launch_options()`，可直接切换到系统 Chrome + headful（推荐配合 xvfb）
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+- `backend/app/core/config.py`
+
+---
+
+## 🪄 新旧发布页封面逻辑分流 (10:28)
+
+### 内容
+- 依据页面结构自动分流：
+  - 新版发布页（封面非必填）：默认跳过封面设置
+  - 旧版发布页（出现 `设置封面` + `必填`）：强制先设置封面
+- 新增 `_is_cover_required()` 判断，避免在新页面做多余封面操作
+- 若判定为非必填但点击发布失败，会回退尝试设置封面后再重试发布
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 📺 虚拟屏实时观看方案 (10:36)
+
+### 内容
+- 新增 `run_backend_xvfb_live.sh`，在 Xvfb 下同时启动后端与实时画面转码
+- 通过 ffmpeg 抓取虚拟屏并输出 HLS：`/outputs/live/live.m3u8`
+- 适用于“边跑自动发布边实时观看”，不依赖 VNC
+- 默认仍保留失败录屏，HLS 用于过程实时观察
+
+### 涉及文件
+- `run_backend_xvfb_live.sh`
+
+---
+
+## 🎥 抖音后台录屏能力 (09:55)
+
+### 内容
+- 新增抖音自动发布过程录屏能力，便于定位“卡住在哪一步”
+- 录屏文件保存目录：`backend/app/debug_screenshots/videos`
+- 默认开启录屏，默认只保留失败录屏（成功录屏自动清理）
+- 每次执行会在网络日志追加录屏保存记录（`[douyin][record]`）
+- 增加发布阶段关键标记日志：`publish_wait ready`、`publish_click try/clicked`
+- 新增配置项：`DOUYIN_RECORD_VIDEO`、`DOUYIN_KEEP_SUCCESS_VIDEO`、`DOUYIN_RECORD_VIDEO_WIDTH`、`DOUYIN_RECORD_VIDEO_HEIGHT`
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+- `backend/app/core/config.py`
+
+---
+
+## 🚀 发布按钮等待逻辑修正 (10:00)
+
+### 内容
+- 根据线上反馈，发布页不再做冗长前置等待，改为“尽快尝试点击发布”
+- 新增发布按钮定位策略（role + text 多选择器），避免 `exact role` 匹配失败导致假等待
+- 将发布按钮等待上限从上传超时（300s）独立为 `PUBLISH_BUTTON_TIMEOUT=60s`
+- 点击发布阶段统一走 `_click_publish_button`，并持续记录 `publish_wait/publish_click` 日志
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🧪 上传完成特征判定增强 (10:07)
+
+### 内容
+- 基于实测页面特征补齐“上传中/上传完成”判定：
+  - 上传中：`上传过程中请不要刷新`、`取消上传`、`已上传/当前速度/剩余时间`
+  - 上传完成：`重新上传` + `预览视频/预览封面/标题`
+- 仅在确认上传完成后才允许执行发布点击，避免“未传完提前发布”
+- 新增上传等待日志：`[douyin][upload_wait] ...`，可直观看到卡在上传中还是等完成信号
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## ⏸️ 上传完成后延时发布 (10:10)
+
+### 内容
+- 根据实测反馈，增加“上传完成后固定等待 2 秒”再点发布
+- 避免刚出现完成信号就立即点击，给前端状态收敛留缓冲
+- 新增日志标记：`[douyin][upload_ready] wait_before_publish=2s`
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🖼️ 恢复封面设置流程 (10:14)
+
+### 内容
+- 按实测需求恢复“上传完成后先设置封面，再发布”流程
+- 封面设置改为最多尝试 2 次，成功写入 `[douyin][cover] selected`
+- 若封面未设置成功则直接终止发布并保存截图 `cover_not_selected`
+- 避免出现“未设封面就点击发布”的情况
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🛠️ 抖音发布流程修复 (09:20)
+
+### 内容
+- 按最新页面流程改为先进入首页并点击 `高清发布`，再进入上传页
+- 新增未发布草稿处理：检测到 `你还有上次未发布的视频` 时自动点击 `放弃`
+- 上传策略改为优先点击 `上传视频` 并走 file chooser，失败后再回退多 input 选择器
+- 只有检测到 `基础信息/作品描述/发布设置/重新上传` 等发布态信号才继续，避免误判“已上传”
+- 修复无扩展名视频临时文件策略：优先 hardlink，失败时 copy，移除 symlink 回退
+- 适配当前智能封面流程：跳过手动封面操作
+- 话题填写改为在简介/描述区域使用 `#标签` 形式追加
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## ⚡ 抖音等待链路再收敛 (09:52)
+
+### 内容
+- 根据“选完视频即进入发布页”流程，移除独立的上传完成轮询阶段
+- 改为在点击发布前统一等待“发布按钮可点击”，避免重复等待导致总时长偏长
+- 新增 `publish_wait` 调试日志，按秒记录按钮可点击等待时长
+- 超时文案改为明确提示“发布按钮长时间不可点击”
+- 上传入口改为严格 file chooser 流程：只走“点击上传视频 → 选择文件 → 进入发布页”链路
+- 移除直接 input 回退上传，避免绕开上传入口导致状态机异常
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## 🧭 抖音卡慢环节定位与修复 (09:45)
+
+### 内容
+- 通过 `douyin_network.log` 定位到卡慢发生在“上传完成判定”阶段，而非真正提交发布接口
+- 新增上传完成网络信号：`CommitUploadInner` 成功与封面生成成功信号写入日志
+- 收紧“上传完成”判定，移除 `publish_button_enabled` 这种过早放行条件
+- 仅在检测到 `重新上传/重新选择` 或上传提交信号后才进入下一步，降低误判导致的长等待
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## ✅ 抖音发布结果判定修正 (09:38)
+
+### 内容
+- 修复“发布检测超时仍返回 success=true”的问题，超时场景改为 `success=false`
+- 优化超时返回文案，明确为“发布状态未知，需要后台确认”
+- 下线过于宽松的管理页兜底判定（仅出现 `审核中` 不再当作发布成功）
+- 超时时即使管理页出现同名标题也不直接判定成功，避免旧作品同名导致误报
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
+
+---
+
+## ⏱️ 抖音上传完成判定优化 (09:34)
+
+### 内容
+- 根据最新日志确认文件上传已开始并有分片上传请求成功，但流程长时间停留在“等待上传完成”
+- 扩展“上传完成”判定条件，不再只依赖单一 `long-card + 重新上传` 选择器
+- 新增上传完成信号：`重新上传/重新选择` 可见、发布按钮可用、`发布设置` 或 `预览视频` 可见
+- 上传等待日志增加耗时秒数，便于判断是否真实卡住
+
+### 涉及文件
+- `backend/app/services/uploader/douyin_uploader.py`
--- a/Docs/DevLogs/Day20.md
+++ b/Docs/DevLogs/Day20.md
@@ -0,0 +1,103 @@
+## 🔧 代码质量与安全优化 (13:30)
+
+### 概述
+本日进行项目全面代码审查与优化，共处理 27 项优化点，完成 18 项核心修复。
+
+### 已完成优化
+
+#### 功能性修复
+- [x] **P0-1**: LatentSync 回退逻辑空实现 → 改为 `raise RuntimeError`
+- [x] **P1-1**: 任务状态接口无用户归属校验 → 添加用户认证依赖
+- [x] **P1-2**: 前端 User 类型定义重复 → 统一到 `shared/types/user.ts`
+
+#### 性能优化
+- [x] **P1-3**: 参考音频列表 N+1 查询 → 使用 `asyncio.gather` 并发
+- [x] **P1-4**: 视频上传整读内存 → 新增 `upload_file_from_path` 流式处理
+- [x] **P1-5**: async 路由内同步阻塞 → `httpx.AsyncClient` 替换 `requests`
+- [x] **P2-2**: GLM 服务同步调用 → `asyncio.to_thread` 包装
+- [x] **P2-3**: Remotion 渲染启动慢 → 预编译 JS + `build:render` 脚本
+
+#### 安全修复
+- [x] **P1-8**: 硬编码 Cookie → 移至环境变量 `DOUYIN_COOKIE`
+- [x] **P1-9**: 请求日志打印完整 headers → 敏感信息脱敏
+- [x] **P2-10**: ffprobe 使用 `shell=True` → 改为参数列表
+- [x] **P2-11**: CORS 配置 `*` + credentials → 从 `CORS_ORIGINS` 环境变量读取
+
+#### 配置优化
+- [x] **P2-5**: 存储服务硬编码路径 → 环境变量 `SUPABASE_STORAGE_LOCAL_PATH`
+- [x] **P3-3**: Remotion `execSync` 同步调用 → promisified `exec` 异步
+- [x] **P3-5**: LatentSync 相对路径 → 基于 `__file__` 绝对路径
+
+### 暂不处理（收益有限）
+- [~] **P1-6**: useHomeController 超大文件 (884行)
+- [~] **P1-7**: 抖音/微信上传器重复代码（流程差异大）
+
+### 低优先级（后续处理）
+- [~] **P2-6~P2-9**: API 转发壳、前端 API 客户端混用、ESLint、重复逻辑
+- [~] **P3-1~P3-4**: 阻塞式交互、Modal 过大、样式兼容层
+
+### 涉及文件
+- `backend/app/services/latentsync_service.py` - 回退逻辑
+- `backend/app/modules/videos/router.py` - 任务状态认证
+- `backend/app/modules/tools/router.py` - httpx 异步、Cookie 配置化
+- `backend/app/services/glm_service.py` - 异步包装
+- `backend/app/services/storage.py` - 流式上传、路径配置化
+- `backend/app/services/video_service.py` - ffprobe 安全调用
+- `backend/app/main.py` - CORS 配置、日志脱敏
+- `backend/app/core/config.py` - 新增配置项
+- `remotion/render.ts` - 异步 exec
+- `remotion/package.json` - build:render 脚本
+- `models/LatentSync/scripts/server.py` - 绝对路径
+- `frontend/src/shared/types/user.ts` - 统一类型定义
+
+### 新增环境变量
+```bash
+# .env 新增配置（均有默认值，无需必填）
+CORS_ORIGINS=*                              # CORS 白名单
+SUPABASE_STORAGE_LOCAL_PATH=/path/to/...    # 本地存储路径
+DOUYIN_COOKIE=...                           # 抖音视频下载 Cookie
+```
+
+### 重启要求
+```bash
+pm2 restart vigent2-backend
+pm2 restart vigent2-latentsync
+# Remotion 已自动编译
+```
+
+### 🎨 交互与体验优化 (17:00)
+
+- [x] **UX-1**: PublishPage 图片加载优化 (`<img>` → `next/image`)
+- [x] **UX-2**: 按钮 Loading 状态统一 (提取脚本弹窗 + 发布页)
+- [x] **UX-3**: 骨架屏加载优化 (发布页加载中状态)
+- [x] **UX-4**: 全局快捷键支持 (ESC 关闭弹窗, Enter 确认)
+- [x] **UX-5**: 移除全局 GlobalTaskIndicator (视觉降噪)
+- [x] **UX-6**: 视频生成完成自动刷新列表并选中最新
+
+### 🐛 缺陷修复与回归治理 (17:30)
+
+#### 严重缺陷修复
+- [x] **BUG-1**: Remotion 渲染脚本路径解析错误 (导致标题字幕丢失)
+    - *原因*: `render.js` 预编译后使用了 `__dirname`，在 `dist` 目录下寻找源码失败。
+    - *修复*: 修改 `render.ts` 使用 `process.cwd()` 动态解析路径，并重新编译。
+
+- [x] **BUG-2**: 发布页视频选择持久化失效 (Auth 异步竞态)
+    - *原因*: 页面加载时 `useAuth` 尚未返回用户 ID，导致使用 `guest` Key 读取不到记录，随后被默认值覆盖。
+    - *修复*: 引入 `isVideoRestored` 状态机，强制等待 Auth 完成且 Video 列表加载完毕后，才执行恢复逻辑。
+
+#### 回归问题治理
+- [x] **REG-1**: 首页历史作品 ID 恢复后内容不显示
+    - *原因*: 持久化模块恢复了 ID，但 `useGeneratedVideos` 未监听 ID 变化同步 URL。
+    - *修复*: 新增 `useEffect` 监听 `selectedVideoId` 变化并同步 `generatedVideo` URL。
+
+- [x] **REG-2**: 首页/发布页“默认选中第一个”逻辑丢失
+    - *原因*: 重构移除旧逻辑后，新用户或无缓存用户进入页面无默认选中。
+    - *修复*: 在 `isRestored` 且无选中时，增加兜底逻辑自动选中列表第一项。
+
+- [x] **REG-3**: 素材选择持久化失效 (闭包陷阱)
+    - *原因*: `useMaterials` 加载回调中捕获了旧的 `selectedMaterial` 状态，覆盖了已恢复的值。
+    - *修复*: 改为函数式状态更新 (`setState(prev => ...)`)，确保基于最新状态判断。
+
+- [x] **REF-1**: 持久化逻辑全站收敛与排查
+    - *优化*: 清理 `useBgm`, `useGeneratedVideos`, `useTitleSubtitleStyles` 中的冗余 `localStorage` 读取，统一由 `useHomePersistence` 管理。
+    - *排查*: 深度排查 `useRefAudios`, `useTitleSubtitleStyles` 等模块，确认逻辑健壮，无类似回归风险。
--- a/Docs/DevLogs/Day21.md
+++ b/Docs/DevLogs/Day21.md
@@ -0,0 +1,449 @@
+## 🐛 缺陷修复：视频生成与持久化回归 (Day 21)
+
+### 概述
+本日修复 Day 20 优化后引入的 3 个回归缺陷：Remotion 渲染崩溃容错、首页作品选择持久化、发布页作品选择持久化。
+
+---
+
+### 已完成修复
+
+#### BUG-1: Remotion 渲染进程崩溃导致标题/字幕丢失
+- **现象**: 视频生成后没有标题和字幕，回退到纯 FFmpeg 合成。
+- **根因**: Remotion Node.js 进程在渲染完成（100%）后以 SIGABRT (code -6) 退出，Python 端将其视为失败。
+- **修复**: `remotion_service.py` 在进程非零退出时，先检查输出文件是否存在且大小合理（>1KB），若存在则视为成功。
+- **文件**: `backend/app/services/remotion_service.py`
+
+```python
+if process.returncode != 0:
+    output_file = Path(output_path)
+    if output_file.exists() and output_file.stat().st_size > 1024:
+        logger.warning(
+            f"Remotion process exited with code {process.returncode}, "
+            f"but output file exists ({output_file.stat().st_size} bytes). Treating as success."
+        )
+        return output_path
+    raise RuntimeError(...)
+```
+
+#### BUG-2: 首页历史作品选择刷新后不保持
+- **现象**: 用户选择某个历史作品后刷新页面，总是回到第一个视频。
+- **根因**: `fetchGeneratedVideos()` 在初始加载时无条件自动选中第一个视频，覆盖了 `useHomePersistence` 的恢复值。
+- **修复**: `fetchGeneratedVideos` 增加 `preferVideoId` 参数，仅在明确指定时才自动选中；新增 `"__latest__"` 哨兵值用于生成完成后选中最新。
+- **文件**: `frontend/src/features/home/model/useGeneratedVideos.ts`, `frontend/src/features/home/model/useHomeController.ts`
+
+```typescript
+// 任务完成 → 自动选中最新
+useEffect(() => {
+  if (prevIsGenerating.current && !isGenerating) {
+    if (currentTask?.status === "completed") {
+      void fetchGeneratedVideos("__latest__");
+    } else {
+      void fetchGeneratedVideos();
+    }
+  }
+  prevIsGenerating.current = isGenerating;
+}, [isGenerating, currentTask, fetchGeneratedVideos]);
+```
+
+#### BUG-3: 发布页作品选择刷新后不保持（根因：签名 URL 不稳定）
+- **现象**: 发布管理页选择视频后刷新，选择丢失（无任何视频被选中）。
+- **根因**: 后端 `/api/videos/generated` 返回的 `path` 是 Supabase 签名 URL，每次请求都会变化。发布页用 `path` 作为选择标识存入 localStorage，刷新后新的 `path` 与保存值永远不匹配。首页不受影响是因为使用稳定的 `video.id`。
+- **修复**: 发布页全面改用 `id`（稳定标识）替代 `path`（签名 URL）进行选择、持久化和比较。
+- **文件**:
+  - `frontend/src/shared/types/publish.ts` — `PublishVideo` 新增 `id` 字段
+  - `frontend/src/features/publish/model/usePublishController.ts` — `selectedVideo` 存储 `id`，发布时根据 `id` 查找 `path`
+  - `frontend/src/features/publish/ui/PublishPage.tsx` — `key`/`onClick`/选中比较改用 `v.id`
+  - `frontend/src/features/home/model/useHomeController.ts` — 预取缓存加入 `id` 字段
+
+```typescript
+// 类型定义新增 id
+export interface PublishVideo {
+    id: string;    // 稳定标识符
+    name: string;
+    path: string;  // 签名 URL（仅用于播放/发布）
+}
+
+// 发布时根据 id 查找 path
+const video = videos.find(v => v.id === selectedVideo);
+await api.post('/api/publish', { video_path: video.path, ... });
+```
+
+---
+
+### 涉及文件汇总
+
+| 文件 | 变更 |
+|------|------|
+| `backend/app/services/remotion_service.py` | Remotion 崩溃容错 |
+| `frontend/src/features/home/model/useGeneratedVideos.ts` | 首页视频选择不自动覆盖 |
+| `frontend/src/features/home/model/useHomeController.ts` | 任务完成监听 + 预取缓存加 id |
+| `frontend/src/shared/types/publish.ts` | PublishVideo 新增 id 字段 |
+| `frontend/src/features/publish/model/usePublishController.ts` | 选择/持久化/发布改用 id |
+| `frontend/src/features/publish/ui/PublishPage.tsx` | UI 选择比较改用 id |
+
+### 关键教训
+
+> **签名 URL 不可作为持久化标识**。Supabase Storage 的签名 URL 包含时间戳和签名参数，每次请求都不同。任何需要跨请求/跨刷新保持的标识，必须使用后端返回的稳定 `id` 字段。
+
+### 重启要求
+```bash
+pm2 restart vigent2-backend    # Remotion 容错
+npm run build && pm2 restart vigent2-frontend  # 前端持久化修复
+```
+
+---
+
+## 🎨 浮动样式预览窗口优化 (Day 21)
+
+### 概述
+标题与字幕面板中的预览区域原本是内联折叠的，展开后调节下方滑块时看不到预览效果。改为 `position: fixed` 浮动窗口，固定在视口左上角，滚动页面时预览始终可见，边调边看。
+
+### 已完成优化
+
+#### 1. 新建浮动预览组件 `FloatingStylePreview.tsx`
+- `createPortal(jsx, document.body)` 渲染到 body 层级，脱离面板 DOM 树
+- `position: fixed` + 左上角固定定位，滚动时不移动
+- `z-index: 150`（低于 VideoPreviewModal 的 200）
+- 顶部标题栏 + X 关闭按钮，ESC 键关闭
+- 桌面端固定宽度 280px，移动端自适应（最大 360px）
+- `previewScale = windowWidth / previewBaseWidth` 自行计算缩放
+- `maxHeight: calc(100dvh - 32px)` 防止超出视口
+
+#### 2. 修改 `TitleSubtitlePanel.tsx`
+- 删除内联预览区域（`ref={previewContainerRef}` 整块 JSX）
+- 条件渲染 `<FloatingStylePreview />`，按钮文本保持"预览样式"/"收起预览"
+- 移除 `previewScale`、`previewAspectRatio`、`previewContainerRef` props
+- 保留 `previewBaseWidth/Height`（浮动窗口需要原始尺寸计算 scale）
+
+#### 3. 清理 `useHomeController.ts`
+- 移除 `previewContainerWidth` 状态
+- 移除 `titlePreviewContainerRef` ref
+- 移除 ResizeObserver useEffect（浮动窗口自管尺寸，不再需要）
+
+#### 4. 简化 `HomePage.tsx` 传参
+- 移除 `previewContainerWidth`、`titlePreviewContainerRef` 解构
+- 移除 `previewScale`、`previewAspectRatio`、`previewContainerRef` prop 传递
+
+#### 5. 移动端适配
+- `ScriptEditor.tsx`：标题行改为 `flex-wrap`，"AI生成标题标签"按钮不再溢出
+- 预览默认比例从 1280×720 (16:9) 改为 1080×1920 (9:16)，符合抖音竖屏视频
+
+### 涉及文件汇总
+
+| 文件 | 变更 |
+|------|------|
+| `frontend/src/features/home/ui/FloatingStylePreview.tsx` | **新建** 浮动预览组件 |
+| `frontend/src/features/home/ui/TitleSubtitlePanel.tsx` | 移除内联预览，渲染浮动组件 |
+| `frontend/src/features/home/model/useHomeController.ts` | 移除 preview 容器相关状态和 ResizeObserver |
+| `frontend/src/features/home/ui/HomePage.tsx` | 简化 props 传递，默认比例改 9:16 |
+| `frontend/src/features/home/ui/ScriptEditor.tsx` | 移动端按钮换行适配 |
+
+### 重启要求
+```bash
+npm run build && pm2 restart vigent2-frontend
+```
+
+---
+
+## 🔧 多平台发布体系重构：用户隔离与抖音刷脸验证 (Day 21)
+
+### 概述
+重构发布系统的两大核心问题：① 多用户场景下 Cookie/会话缺乏隔离，② 抖音登录新增刷脸验证步骤无法处理。同时修复了平台配置混用和微信视频号发布流程问题。
+
+---
+
+### 一、平台配置独立化
+
+#### 问题
+所有平台（抖音、微信、B站、小红书）共用 WEIXIN_* 配置，导致 User-Agent、Headless 模式等设置不匹配。
+
+#### 修复 — `config.py`
+- 新增 `DOUYIN_*` 独立配置项：`DOUYIN_HEADLESS_MODE`、`DOUYIN_USER_AGENT`（Chrome/144）、`DOUYIN_LOCALE`、`DOUYIN_TIMEZONE_ID`、`DOUYIN_CHROME_PATH`、`DOUYIN_FORCE_SWIFTSHADER`、调试开关等
+- 微信保持已有 `WEIXIN_*` 配置
+- B站/小红书使用通用默认值
+
+#### 修复 — `qr_login_service.py` 平台配置映射
+```python
+# 之前：所有平台都用 WEIXIN 设置
+# 之后：每个平台独立配置
+PLATFORM_CONFIGS = {
+    "douyin": { headless, user_agent, locale, timezone... },
+    "weixin": { headless, user_agent, locale, timezone... },
+    "bilibili": { 通用配置 },
+    "xiaohongshu": { 通用配置 },
+}
+```
+
+---
+
+### 二、用户隔离的 Cookie 管理
+
+#### 问题
+多用户共享同一套 Cookie 文件，用户 A 的登录态可能被用户 B 覆盖。
+
+#### 修复 — `publish_service.py`
+- `_get_cookies_dir(user_id)` → `backend/user_data/{uuid}/cookies/`
+- `_get_cookie_path(user_id, platform)` → 按用户+平台返回独立 Cookie 文件路径
+- `_get_session_key(user_id, platform)` → `"{user_id}_{platform}"` 格式的会话 key
+- 登录/发布流程全链路传入 `user_id`，清理残留会话避免干扰
+
+---
+
+### 三、抖音刷脸验证二维码
+
+#### 问题
+抖音扫码登录后可能弹出刷脸验证窗口，内含新的二维码需要用户再次扫描，前端无法感知和展示。
+
+#### 修复 — 后端 `qr_login_service.py`
+- 扩展 QR 选择器：支持跨 iframe 搜索二维码元素
+- 抖音 API 拦截：监听 `check_qrconnect` 响应，检测 `redirect_url`
+- 检测 "完成验证" / "请前往APP完成验证" 文案
+- 在验证弹窗内找到正方形二维码（排除头像），截图返回给前端
+- API 确认后直接导航到 redirect_url（不重新加载 QR 页，避免销毁会话）
+
+#### 修复 — 后端 `publish_service.py`
+- `get_login_session_status()` 新增 `face_verify_qr` 字段返回
+- 登录成功且 Cookie 保存后自动清理会话
+
+#### 修复 — 前端
+- `usePublishController.ts`：新增 `faceVerifyQr` 状态，轮询时获取 `face_verify_qr` 字段
+- `PublishPage.tsx`：QR 弹窗优先展示刷脸验证二维码，附提示文案
+
+```tsx
+{faceVerifyQr ? (
+  <>
+    <Image src={`data:image/png;base64,${faceVerifyQr}`} />
+    <p>需要身份验证，请用抖音APP扫描上方二维码完成刷脸验证</p>
+  </>
+) : /* 普通登录二维码 */ }
+```
+
+---
+
+### 四、微信视频号发布流程优化
+
+#### 修复 — `weixin_uploader.py`
+- 添加 `user_id` 参数支持，发布截图目录隔离
+- 新增 `post_create` API 响应监听，精准判断发布成功
+- 发布结果判定：URL 离开创建页 或 API 确认提交 → 视为成功
+- 标题/标签处理改为统一写入"视频描述"字段（不再单独填写 title/tags）
+
+---
+
+### 涉及文件汇总
+
+| 文件 | 变更 |
+|------|------|
+| `backend/app/core/config.py` | 新增 DOUYIN_* 独立配置项 |
+| `backend/app/services/qr_login_service.py` | 平台配置拆分、刷脸验证二维码、跨 iframe 选择器 |
+| `backend/app/services/publish_service.py` | 用户隔离 Cookie 管理、刷脸验证状态返回 |
+| `backend/app/services/uploader/weixin_uploader.py` | user_id 支持、post_create API 监听、描述字段合并 |
+| `frontend/src/features/publish/model/usePublishController.ts` | faceVerifyQr 状态 |
+| `frontend/src/features/publish/ui/PublishPage.tsx` | 刷脸验证二维码展示 |
+
+### 重启要求
+```bash
+pm2 restart vigent2-backend    # 发布服务 + QR登录
+npm run build && pm2 restart vigent2-frontend  # 刷脸验证UI
+```
+
+---
+
+## 🏗️ 架构优化：前端结构微调 + 后端模块分层 (Day 21)
+
+### 概述
+根据架构审计结果，完成前端目录规范化和后端核心模块的分层补全。
+
+### 一、前端结构微调
+
+#### 1. ScriptExtractionModal 迁移
+- `components/ScriptExtractionModal.tsx` → `features/home/ui/ScriptExtractionModal.tsx`
+- 连带 `components/script-extraction/` 目录一并迁移到 `features/home/ui/script-extraction/`
+- 更新 `HomePage.tsx` 的 import 路径
+
+#### 2. contexts/ 目录归并
+- `src/contexts/AuthContext.tsx` → `src/shared/contexts/AuthContext.tsx`
+- `src/contexts/TaskContext.tsx` → `src/shared/contexts/TaskContext.tsx`
+- 更新 6 处 import（layout.tsx, useHomeController.ts, usePublishController.ts, AccountSettingsDropdown.tsx, GlobalTaskIndicator.tsx）
+- 删除空的 `src/contexts/` 目录
+
+#### 3. 清理重构遗留空目录
+- 删除 `src/lib/`、`src/components/home/`、`src/hooks/`
+
+### 二、后端模块分层补全
+
+将 3 个 400+ 行的 router-only 模块拆分为 `router.py + schemas.py + service.py`：
+
+| 模块 | 改造前 | 改造后 router |
+|------|--------|--------------|
+| `materials/` | 416 行 | 63 行 |
+| `tools/` | 417 行 | 33 行 |
+| `ref_audios/` | 421 行 | 71 行 |
+
+业务逻辑全部提取到 `service.py`，数据模型定义在 `schemas.py`，router 只做参数校验 + 调用 service + 返回响应。
+
+### 三、开发规范更新
+
+`BACKEND_DEV.md` 第 8 节新增渐进原则：
+- 新模块**必须**包含 `router.py + schemas.py + service.py`
+- 改旧模块时顺手拆涉及的部分
+- 新代码高标准，旧代码逐步改
+
+### 涉及文件汇总
+
+| 文件 | 变更 |
+|------|------|
+| `frontend/src/features/home/ui/ScriptExtractionModal.tsx` | 从 components/ 迁入 |
+| `frontend/src/features/home/ui/script-extraction/` | 从 components/ 迁入 |
+| `frontend/src/shared/contexts/AuthContext.tsx` | 从 contexts/ 迁入 |
+| `frontend/src/shared/contexts/TaskContext.tsx` | 从 contexts/ 迁入 |
+| `backend/app/modules/materials/schemas.py` | **新建** |
+| `backend/app/modules/materials/service.py` | **新建** |
+| `backend/app/modules/materials/router.py` | 精简为薄路由 |
+| `backend/app/modules/tools/schemas.py` | **新建** |
+| `backend/app/modules/tools/service.py` | **新建** |
+| `backend/app/modules/tools/router.py` | 精简为薄路由 |
+| `backend/app/modules/ref_audios/schemas.py` | **新建** |
+| `backend/app/modules/ref_audios/service.py` | **新建** |
+| `backend/app/modules/ref_audios/router.py` | 精简为薄路由 |
+| `Docs/BACKEND_DEV.md` | 目录结构标注分层、新增渐进原则 |
+| `Docs/BACKEND_README.md` | 目录结构标注分层 |
+| `Docs/FRONTEND_DEV.md` | 更新目录结构（contexts 迁移、ScriptExtractionModal 迁移） |
+
+### 重启要求
+```bash
+pm2 restart vigent2-backend
+npm run build && pm2 restart vigent2-frontend
+```
+
+---
+
+## 🎬 多素材视频生成（多机位效果）
+
+### 概述
+支持用户上传多个不同角度的自拍视频，生成视频时按句子自动切换素材，最终效果类似多机位拍摄。单素材时走原有流程，无额外开销。
+
+### 核心架构
+
+#### 流水线变更
+```
+【单素材（不变）】
+text → TTS → audio → LatentSync(1个素材+完整audio) → Whisper字幕 → Remotion → 成片
+
+【多素材（新增）】
+text → TTS → audio → Whisper字幕(提前) → 按素材数量均分时长(对齐字边界)
+  → 对每段: 切分audio + LatentSync(素材[i]+音频片段[i])
+  → FFmpeg拼接所有片段 → Remotion(完整字幕时间戳) → 成片
+```
+
+#### 素材切换逻辑（均分方案）
+1. Whisper 对完整音频转录，得到字级别时间戳
+2. 按素材数量**均分音频总时长**（`total_duration / N`）
+3. 每个分割点对齐到最近的 Whisper 字边界，避免在字中间切分
+4. 首段 start 扩展为 0.0，末段 end 扩展为音频结尾，确保完整覆盖
+
+> **设计决策**：最初方案基于原始文案标点分句，但用户文案往往不含句号（只有逗号），导致只产生 1 段。改为均分方案后不依赖文案标点，对任何输入都能正确切分。
+
+---
+
+### 一、后端改动
+
+#### 1. `backend/app/modules/videos/schemas.py`
+- 新增 `material_paths: Optional[List[str]]` 字段
+- 保留 `material_path: str` 向后兼容
+
+#### 2. `backend/app/modules/videos/workflow.py`（核心改动）
+
+**新增函数**：
+- `_split_equal(segments, material_paths)`: 按素材数量均分音频时长，对齐到最近的 Whisper 字边界
+
+**修改 `process_video_generation()`**：
+- `is_multi = len(material_paths) > 1` 判断走多素材/单素材分支
+- 多素材分支：Whisper 提前 → 均分切分 → 音频切分 → 逐段 LatentSync → FFmpeg 拼接
+
+#### 3. `backend/app/services/video_service.py`
+- 新增 `concat_videos()`: FFmpeg concat demuxer (`-c copy`) 拼接视频片段
+- 新增 `split_audio()`: FFmpeg 按时间范围切分音频 (`-ss` + `-t` + `-c copy`)
+
+#### 4. `backend/scripts/watchdog.py`
+- 健康检查阈值从 3 次提高到 5 次（容忍期 2.5 分钟）
+- 新增重启后 120 秒冷却期，避免模型加载期间被误判为故障
+- 启动时给所有服务 60 秒初始冷却期
+
+---
+
+### 二、前端改动
+
+#### 1. 新增依赖
+```bash
+npm install @dnd-kit/core @dnd-kit/sortable @dnd-kit/utilities
+```
+
+#### 2. `frontend/src/features/home/model/useMaterials.ts`
+- `selectedMaterial: string` → `selectedMaterials: string[]`（多选）
+- 新增 `toggleMaterial(id)`: 切换选中/取消（至少保留1个）
+- 新增 `reorderMaterials(activeId, overId)`: 拖拽排序
+- 上传格式扩展：新增 `.mkv/.webm/.flv/.wmv/.m4v/.ts/.mts`
+
+#### 3. `frontend/src/features/home/ui/MaterialSelector.tsx`（重写）
+- 素材列表每行增加复选框 + 序号徽标（①②③）
+- 选中 ≥2 个时显示拖拽排序区（@dnd-kit `SortableContext`）
+- 每个排序项：拖拽把手 + 序号 + 素材名 + 移除按钮
+- HTML input accept 改为 `video/*`
+
+#### 4. `frontend/src/features/home/model/useHomeController.ts`
+- 多素材 payload：`material_paths` 数组 + `material_path` 向后兼容
+- `enable_subtitles` 硬编码为 `true`（移除开关）
+- 验证：至少选中 1 个素材
+
+#### 5. `frontend/src/features/home/model/useHomePersistence.ts`
+- 素材持久化改为 JSON 数组，向后兼容旧格式（单字符串）
+- 移除 `enableSubtitles` 持久化
+
+#### 6. `frontend/src/features/home/ui/TitleSubtitlePanel.tsx`
+- 移除"逐字高亮字幕"开关，字幕样式区始终显示
+
+#### 7. `frontend/src/features/home/ui/HomePage.tsx`
+- 更新 props 传递（`selectedMaterials`, `toggleMaterial`, `reorderMaterials`）
+
+---
+
+### 三、Bug 修复记录
+
+#### BUG-1: 多素材只使用第一个视频（基于标点的分句方案失败）
+- **现象**: 选了 2 个素材但生成的视频只使用第 1 个，日志显示 `Multi-material: 1 segments, 2 materials`。
+- **根因 v1**: 最初通过正则 `[。！？!?]` 在 Whisper 输出中分句，但 Whisper 不输出标点。
+- **修复 v1**: 改为用原始文案标点分句——但用户文案往往只含逗号（，），无句末标点（。！？），仍退化为 1 段。
+- **最终修复**: 彻底放弃基于标点的分句方案，改为 `_split_equal()` **按素材数量均分音频时长**，对齐到最近的 Whisper 字边界。不依赖任何标点符号，对所有文案均有效。
+
+#### BUG-2: 口型对不上（音频时间偏移）
+- **根因**: `split_audio` 用 Whisper 的 start/end 时间（如 0.11~7.21）切分音频，但 `compose()` 用完整原始音频（0.0~结尾）合成，导致时间偏移。
+- **修复**: 强制首段 start=0.0，末段 end=音频实际时长，确保切分音频完整覆盖。
+
+#### BUG-3: min_segment_sec 过度合并导致退化（已随方案切换移除）
+- **根因**: 旧方案中 2 个句子第 2 句不足 3 秒时，最短时长检查合并为 1 段，多素材退化为单素材。
+- **状态**: 均分方案不存在此问题，相关代码已移除。
+
+---
+
+### 涉及文件汇总
+
+| 文件 | 变更类型 | 说明 |
+|------|----------|------|
+| `backend/app/modules/videos/schemas.py` | 修改 | 新增 material_paths 字段 |
+| `backend/app/modules/videos/workflow.py` | 修改 | 多素材流水线核心逻辑 + 3个 Bug 修复 |
+| `backend/app/services/video_service.py` | 修改 | 新增 concat_videos / split_audio |
+| `backend/scripts/watchdog.py` | 修改 | 阈值优化 + 冷却期机制 |
+| `frontend/package.json` | 修改 | 新增 @dnd-kit 依赖 |
+| `frontend/src/features/home/model/useMaterials.ts` | 修改 | 多选 + 排序状态管理 |
+| `frontend/src/features/home/ui/MaterialSelector.tsx` | 重写 | 多选复选框 + 拖拽排序 UI |
+| `frontend/src/features/home/model/useHomeController.ts` | 修改 | 多素材 payload + 移除字幕开关 |
+| `frontend/src/features/home/model/useHomePersistence.ts` | 修改 | JSON 数组持久化 |
+| `frontend/src/features/home/ui/TitleSubtitlePanel.tsx` | 修改 | 移除字幕开关 |
+| `frontend/src/features/home/ui/HomePage.tsx` | 修改 | 更新 props 传递 |
+
+### 重启要求
+```bash
+pm2 restart vigent2-backend
+npm run build && pm2 restart vigent2-frontend
+```
--- a/Docs/DevLogs/Day22.md
+++ b/Docs/DevLogs/Day22.md
@@ -0,0 +1,221 @@
+## 🔧 多素材生成优化与健壮性加固 (Day 22)
+
+### 概述
+对 Day 21 实现的多素材视频生成（多机位）功能进行全面审查，修复 6 个高优先级 Bug、完成 8 项体验优化，并将多素材流水线从"逐段 LatentSync"重构为"先拼接再推理"架构，推理次数从 N 次降为 1 次。
+
+---
+
+### 一、后端高优 Bug 修复
+
+#### 1. `_split_equal()` 素材数 > 字符数边界溢出
+- **问题**: 5 个素材但只有 2 个 Whisper 字符时，边界索引重复，部分素材被跳过
+- **修复**: 加入 `n = min(n, len(all_chars))` 上限保护
+- **文件**: `backend/app/modules/videos/workflow.py`
+
+#### 2. 多素材 LatentSync 单段失败无 fallback
+- **问题**: 单素材模式下 LatentSync 失败会 fallback 到原始素材，但多素材模式直接抛异常，整个任务失败
+- **修复**: 多素材循环中加 try-except，失败时 fallback 到原始素材片段
+- **文件**: `backend/app/modules/videos/workflow.py`
+
+#### 3. `num_segments == 0` 时 ZeroDivisionError
+- **问题**: 所有 assignments 被跳过后 `i / num_segments` 触发除零
+- **修复**: 循环前加 `if num_segments == 0` 检查并抛出明确错误
+- **文件**: `backend/app/modules/videos/workflow.py`
+
+#### 4. `split_audio` 未校验 duration > 0
+- **问题**: `end <= start` 时 FFmpeg 行为异常
+- **修复**: 加入 `if duration <= 0: raise ValueError(...)`
+- **文件**: `backend/app/services/video_service.py`
+
+#### 5. Whisper 失败时按时长均分兜底
+- **问题**: Whisper 失败后直接退化为单素材，其他素材被浪费
+- **修复**: 按 `audio_duration / len(material_paths)` 均分，不依赖字符对齐
+- **文件**: `backend/app/modules/videos/workflow.py`
+
+#### 6. `concat_videos` 空列表未检查
+- **问题**: 传入空 `video_paths` 时 FFmpeg 报错
+- **修复**: 加入 `if not video_paths: raise ValueError(...)`
+- **文件**: `backend/app/services/video_service.py`
+
+---
+
+### 二、前端优化
+
+#### 1. payload 构建非空断言修复
+- `m!.path` → `m?.path` + `.filter(Boolean)`，防止素材被删后 crash
+- **文件**: `frontend/src/features/home/model/useHomeController.ts`
+
+#### 2. 生成按钮展示后端进度消息
+- 新增 `message` prop，生成中显示如"(正在处理片段 2/3...)"
+- **文件**: `frontend/src/features/home/ui/GenerateActionBar.tsx`, `HomePage.tsx`
+
+#### 3. 新上传素材自动选中
+- 上传成功后对比前后素材列表，新增的 ID 自动追加到 `selectedMaterials`
+- **文件**: `frontend/src/features/home/model/useMaterials.ts`
+
+#### 4. Material 接口统一
+- 三处 `interface Material` 重复定义提取到 `shared/types/material.ts`
+- **文件**: `frontend/src/shared/types/material.ts` (新建), `useMaterials.ts`, `useHomeController.ts`, `MaterialSelector.tsx`
+
+#### 5. 拖拽排序修复
+- 移除 `DragOverlay`（`backdrop-blur` 创建新 containing block 导致定位错乱）
+- 改为 `useSortable` 原生拖拽 + `CSS.Translate`，拖拽中元素高亮加阴影
+- **文件**: `frontend/src/features/home/ui/MaterialSelector.tsx`
+
+#### 6. 素材选择上限 4 个
+- `toggleMaterial` 新增 `MAX_MATERIALS = 4` 限制
+- UI 选满后未选中项变半透明禁用，提示文字改为"可多选，最多4个"
+- **文件**: `useMaterials.ts`, `MaterialSelector.tsx`
+
+#### 7. 移动端排序区域响应式
+- 素材列表 `max-h-64` → `max-h-48 sm:max-h-64`
+- **文件**: `MaterialSelector.tsx`
+
+#### 8. 多素材耗时提示
+- 选中 ≥2 素材时生成按钮下方显示"多素材模式 (N 个机位)，生成耗时较长"
+- **文件**: `GenerateActionBar.tsx`, `HomePage.tsx`
+
+---
+
+### 三、核心架构重构：先拼接再推理
+
+#### V1 (Day 21): 逐段 LatentSync
+```
+素材A → LatentSync(素材A, 音频片段1) → lipsync_A
+素材B → LatentSync(素材B, 音频片段2) → lipsync_B
+FFmpeg concat(lipsync_A, lipsync_B) → 最终视频
+```
+- 缺点：N 个素材 = N 次 LatentSync 推理（每次 ~30s）
+
+#### V2 (Day 22): 先拼接再推理
+```
+素材A → prepare_segment(裁剪到3.67s) → prepared_A
+素材B → prepare_segment(裁剪到4.00s) → prepared_B
+FFmpeg concat(prepared_A, prepared_B) → concat_video (7.67s)
+LatentSync(concat_video, 完整音频) → 最终视频
+```
+- 优点：只需 **1 次** LatentSync 推理，时间从 N×30s 降为 1×30s
+
+#### 新增 `prepare_segment()` 方法
+```python
+def prepare_segment(self, video_path, target_duration, output_path, target_resolution=None):
+    # 素材时长 > 目标: 裁剪 (-t)
+    # 素材时长 < 目标: 循环 (-stream_loop) + 裁剪
+    # 分辨率一致: -c copy 无损 (不重编码)
+    # 分辨率不一致: scale + pad 统一到第一个素材分辨率
+```
+
+#### 分辨率处理策略
+- 新增 `get_resolution()` 方法检测各素材分辨率
+- 所有素材分辨率相同时：`-c copy` 无损裁剪（保持原画质）
+- 分辨率不一致时：统一到第一个素材的分辨率，`force_original_aspect_ratio=decrease` + `pad` 居中
+- LatentSync 只处理嘴部 512×512 区域，输出保持原分辨率
+
+#### 时间对齐验证
+
+| 环节 | 时间基准 | 对齐关系 |
+|------|---------|---------|
+| TTS 音频 | 原始时长 (7.67s) | 基准 |
+| Whisper 字幕 | 基于 TTS 音频 | 时间戳对齐音频 |
+| 均分切分 | assignments 总时长 = 音频时长 | 首段 start=0, 末段 end=audio_duration |
+| prepare 各段 | `-t seg_dur` 精确截断 | 总和 ≈ 音频时长 |
+| LatentSync | concat_video + 完整音频 | 内部 0.5s 容差 |
+| compose | lipsync_video + 音频/BGM | `-shortest` 保证同步 |
+| Remotion | 基于 captions_path 渲染字幕 | 时间戳对齐音频 |
+
+---
+
+### 涉及文件汇总
+
+| 文件 | 变更类型 | 说明 |
+|------|----------|------|
+| `backend/app/modules/videos/workflow.py` | 修改 | 6 个 Bug 修复 + 流水线重构（先拼接再推理）|
+| `backend/app/services/video_service.py` | 修改 | 新增 `prepare_segment()`、`get_resolution()`，`split_audio` 校验，`concat_videos` 空列表检查 |
+| `frontend/src/shared/types/material.ts` | 新建 | 统一 Material 接口 |
+| `frontend/src/features/home/model/useMaterials.ts` | 修改 | 上传自动选中、素材上限 4 个 |
+| `frontend/src/features/home/model/useHomeController.ts` | 修改 | payload 非空断言修复、Material 接口引用 |
+| `frontend/src/features/home/ui/MaterialSelector.tsx` | 修改 | 拖拽修复、上限 4 个 UI、移动端响应式 |
+| `frontend/src/features/home/ui/GenerateActionBar.tsx` | 修改 | 进度消息展示、多素材耗时提示 |
+| `frontend/src/features/home/ui/HomePage.tsx` | 修改 | 传递 message、materialCount prop |
+
+---
+
+### 四、AI 多语言翻译
+
+#### 功能
+在文案编辑区新增「AI多语言」按钮，支持将中文口播文案一键翻译为 9 种语言，并可随时还原原文。
+
+#### 支持语言
+英语 English、日语 日本語、韩语 한국어、法语 Français、德语 Deutsch、西班牙语 Español、俄语 Русский、意大利语 Italiano、葡萄牙语 Português
+
+#### 实现
+
+##### 后端
+- **`backend/app/services/glm_service.py`** — 新增 `translate_text()` 方法，调用智谱 GLM API（temperature=0.3），prompt 要求只返回译文、保持语气风格
+- **`backend/app/modules/ai/router.py`** — 新增 `POST /api/ai/translate` 接口，接收 `{text, target_lang}`，返回 `{translated_text}`
+
+##### 前端
+- **`frontend/src/features/home/ui/ScriptEditor.tsx`** — 新增 `LANGUAGES` 列表（9 种语言）、语言下拉菜单（点击外部自动关闭）、翻译中 loading 状态、「还原原文」按钮（翻译过后出现在菜单顶部）
+- **`frontend/src/features/home/model/useHomeController.ts`** — 新增 `handleTranslate`（调用翻译 API、首次翻译保存原文）、`originalText` 状态、`handleRestoreOriginal`（恢复原文）
+
+#### 涉及文件
+
+| 文件 | 变更 | 说明 |
+|------|------|------|
+| `backend/app/services/glm_service.py` | 修改 | 新增 `translate_text()` 方法 |
+| `backend/app/modules/ai/router.py` | 修改 | 新增 `/api/ai/translate` 接口 |
+| `frontend/src/features/home/ui/ScriptEditor.tsx` | 修改 | 语言菜单 UI、翻译 loading、还原原文按钮 |
+| `frontend/src/features/home/model/useHomeController.ts` | 修改 | `handleTranslate`、`originalText`、`handleRestoreOriginal` |
+
+---
+
+### 五、TTS 多语言支持
+
+#### 背景
+翻译功能实现后，用户可将中文文案翻译为其他语言。但翻译后生成视频时 TTS 仍只支持中文：
+- **EdgeTTS**：声音列表只有 5 个 `zh-CN-*` 中文声音
+- **声音克隆 (Qwen3-TTS)**：`language` 参数硬编码为 `"Chinese"`
+
+#### 实现方案
+
+##### 1. 前端：语言感知的声音列表
+- `VOICES` 从扁平数组扩展为 `Record<string, VoiceOption[]>`，覆盖 10 种语言（zh-CN / en-US / ja-JP / ko-KR / fr-FR / de-DE / es-ES / ru-RU / it-IT / pt-BR），每种语言 2 个声音（男/女）
+- 新增 `LANG_TO_LOCALE` 映射：翻译目标语言名 → EdgeTTS locale（如 `"English" → "en-US"`）
+- 新增 `textLang` 状态，跟踪当前文案语言，默认 `"zh-CN"`
+
+##### 2. 翻译时自动切换声音
+- `handleTranslate` 成功后：根据目标语言设置 `textLang`，EdgeTTS 模式下自动切换 `voice` 为目标语言的默认声音
+- `handleRestoreOriginal` 还原时：重置 `textLang` 为 `"zh-CN"`，恢复中文默认声音
+- `VoiceSelector` 根据 `textLang` 动态显示对应语言的声音列表
+
+##### 3. 声音克隆语言透传
+- 前端：新增 `LOCALE_TO_QWEN_LANG` 映射（`zh-CN→"Chinese"`, `en-US→"English"`, 其他→`"Auto"`）
+- 生成请求 payload 加入 `language` 字段（仅声音克隆模式）
+- 后端 `GenerateRequest` schema 新增 `language: str = "Chinese"` 字段
+- `workflow.py`：`language="Chinese"` 硬编码改为 `language=req.language`
+
+##### 4. Bug 修复：textLang 持久化
+- **问题**: `voice` 已持久化但 `textLang` 未持久化，刷新页面后 `voice` 恢复为英文声音但 `textLang` 默认回中文，导致 VoiceSelector 显示中文声音列表却选中英文声音，无高亮按钮
+- **修复**: 在 `useHomePersistence` 中加入 `textLang` 的 localStorage 读写
+
+#### 数据流
+
+```
+用户翻译 "English"
+  → ScriptEditor.onTranslate("English")
+  → LANG_TO_LOCALE["English"] = "en-US"
+  → setTextLang("en-US"), setVoice("en-US-GuyNeural")
+  → VoiceSelector 显示 VOICES["en-US"] = [Guy, Jenny]
+  → 生成时:
+      EdgeTTS: payload.voice = "en-US-GuyNeural"
+      声音克隆: payload.language = "English" (via getQwenLanguage)
+```
+
+#### 涉及文件
+
+| 文件 | 变更 | 说明 |
+|------|------|------|
+| `frontend/src/features/home/model/useHomeController.ts` | 修改 | VOICES 多语言 Record、textLang 状态、LANG_TO_LOCALE / LOCALE_TO_QWEN_LANG 映射、翻译自动切换 voice |
+| `frontend/src/features/home/model/useHomePersistence.ts` | 修改 | textLang 持久化读写 |
+| `backend/app/modules/videos/schemas.py` | 修改 | GenerateRequest 加 `language` 字段 |
+| `backend/app/modules/videos/workflow.py` | 修改 | 声音克隆调用处用 `req.language` 替代硬编码 |
--- a/Docs/DevLogs/Day23.md
+++ b/Docs/DevLogs/Day23.md
@@ -0,0 +1,546 @@
+## 🎙️ 配音前置重构 — 第一阶段 (Day 23)
+
+### 概述
+
+将配音从视频生成流程中独立出来，实现"先生成配音 → 选中配音 → 再选素材 → 生成视频"的新工作流。用户可以独立管理配音（生成/试听/改名/删除/选择），并在选中配音后看到时长信息，为第二阶段的素材时间轴编排奠定数据基础。
+
+**旧流程**: 文案 + 选素材 → 一键生成（内联 TTS → Whisper → 均分 → LipSync → 合成）
+**新流程**: 文案 → 配音方式 → **生成配音** → 选中配音 → 选素材 → 背景音乐 → 生成视频
+
+---
+
+### 一、后端：新增 `generated_audios` 模块
+
+#### 模块结构
+
+```
+backend/app/modules/generated_audios/
+├── __init__.py
+├── router.py      # 5 个 API 端点
+├── schemas.py     # 请求/响应模型
+└── service.py     # 生成/列表/删除/改名
+```
+
+#### API 端点
+
+| 方法 | 路径 | 说明 |
+|------|------|------|
+| POST | `/api/generated-audios/generate` | 异步生成配音（返回 task_id） |
+| GET | `/api/generated-audios/tasks/{task_id}` | 轮询生成进度 |
+| GET | `/api/generated-audios` | 列出用户所有配音 |
+| DELETE | `/api/generated-audios/{audio_id}` | 删除配音 |
+| PUT | `/api/generated-audios/{audio_id}` | 改名 |
+
+#### 存储方案
+
+- Supabase 存储桶：`generated-audios`（启动时自动创建）
+- 音频文件：`{user_id}/{timestamp}_audio.wav`
+- 元数据文件：`{user_id}/{timestamp}_audio.json`（含 display_name、text、tts_mode、duration_sec 等）
+
+#### 生成流程
+
+复用现有 `TTSService` / `voice_clone_service` / `task_store`：
+
+```
+POST /generate → 创建 task → BackgroundTask:
+  1. edgetts → TTSService.generate_audio()
+     voiceclone → 下载 ref_audio → voice_clone_service.generate_audio()
+  2. ffprobe 获取时长
+  3. 上传 .wav + .json 到 generated-audios 桶
+  4. 更新 task(status=completed, output={audio_id, duration_sec, ...})
+```
+
+---
+
+### 二、后端：修改视频生成 workflow
+
+#### `GenerateRequest` 新增字段
+
+```python
+generated_audio_id: Optional[str] = None  # 预生成配音 ID（存在时跳过内联 TTS）
+```
+
+#### `workflow.py` TTS 阶段新增分支
+
+```python
+if req.generated_audio_id:
+    # 下载预生成配音 + 从元数据读取 language
+elif req.tts_mode == "voiceclone":
+    # 原有声音克隆逻辑
+else:
+    # 原有 EdgeTTS 逻辑
+```
+
+向后兼容：不传 `generated_audio_id` 时，原有内联 TTS 流程不受影响。
+
+---
+
+### 三、前端：新增配音列表 hook + 面板
+
+#### `useGeneratedAudios.ts`
+
+- 状态：`generatedAudios[]`、`selectedAudio`、`isGeneratingAudio`、`audioTask`
+- 方法：`fetchGeneratedAudios()`、`generateAudio()`、`deleteAudio()`、`renameAudio()`、`selectAudio()`
+- 轮询：生成后 1s 轮询 task 状态，完成后自动刷新列表并选中最新配音
+- 独立于视频生成的 TaskContext（不互相干扰）
+
+#### `GeneratedAudiosPanel.tsx`
+
+- 每条配音：播放/暂停、名称、时长、重命名、删除
+- 选中态：`border-purple-500 bg-purple-500/20`
+- 内嵌进度条（生成中显示）
+- 底部显示选中配音的原始文案（截断）
+- 播放逻辑自包含于面板内（`new Audio()` + play/pause toggle）
+
+---
+
+### 四、前端：UI 面板重排序
+
+**旧顺序**: MaterialSelector → ScriptEditor → TitleSubtitle → VoiceSelector → BgmPanel → GenerateActionBar
+
+**新顺序**:
+1. ScriptEditor（文案编辑）
+2. TitleSubtitlePanel（标题与字幕样式）
+3. VoiceSelector（配音方式）
+4. **GeneratedAudiosPanel**（配音列表）← 新增
+5. MaterialSelector（视频素材）← 后移，需选中配音才解锁
+6. BgmPanel（背景音乐）
+7. GenerateActionBar（生成视频）
+
+#### 素材区门控
+
+未选中配音时，素材区显示半透明遮罩 + "请先生成并选中配音"提示。素材上传/预览/改名/删除始终可用，仅选择勾选被遮罩。
+
+#### 时长信息
+
+选中配音后，MaterialSelector 顶部显示：
+```
+当前配音: 45.2 秒 | 已选 3 个素材（自动均分每段 ~15.1 秒）
+```
+
+#### 生成按钮条件更新
+
+```typescript
+// 旧条件
+disabled={isGenerating || selectedMaterials.length === 0 || (ttsMode === "voiceclone" && !selectedRefAudio)}
+// 新条件
+disabled={isGenerating || selectedMaterials.length === 0 || !selectedAudio}
+```
+
+---
+
+### 五、持久化
+
+`useHomePersistence` 新增 `selectedAudioId` 的 localStorage 读写，刷新页面后恢复选中的配音。
+
+---
+
+### 涉及文件汇总
+
+#### 后端新增
+
+| 文件 | 说明 |
+|------|------|
+| `backend/app/modules/generated_audios/__init__.py` | 模块标记 |
+| `backend/app/modules/generated_audios/router.py` | 5 个 API 端点 |
+| `backend/app/modules/generated_audios/service.py` | 生成/列表/删除/改名 |
+| `backend/app/modules/generated_audios/schemas.py` | 请求/响应模型 |
+
+#### 后端修改
+
+| 文件 | 变更 |
+|------|------|
+| `backend/app/main.py` | 注册 generated_audios 路由 |
+| `backend/app/services/storage.py` | 新增 `BUCKET_GENERATED_AUDIOS`，启动时自动创建桶 |
+| `backend/app/modules/videos/schemas.py` | `GenerateRequest` 新增 `generated_audio_id` 字段 |
+| `backend/app/modules/videos/workflow.py` | TTS 阶段新增预生成音频分支 |
+
+#### 前端新增
+
+| 文件 | 说明 |
+|------|------|
+| `frontend/src/features/home/model/useGeneratedAudios.ts` | 配音列表 hook |
+| `frontend/src/features/home/ui/GeneratedAudiosPanel.tsx` | 配音列表面板 |
+
+#### 前端修改
+
+| 文件 | 变更 |
+|------|------|
+| `frontend/src/features/home/ui/HomePage.tsx` | 面板重排序 + 素材区门控 + 插入 GeneratedAudiosPanel |
+| `frontend/src/features/home/ui/MaterialSelector.tsx` | 新增 `selectedAudioDuration` prop + 时长信息显示 |
+| `frontend/src/features/home/ui/GenerateActionBar.tsx` | 禁用条件改为 `!selectedAudio` |
+| `frontend/src/features/home/model/useHomeController.ts` | 集成 useGeneratedAudios、新增 handleGenerateAudio、修改 handleGenerate 使用 generated_audio_id |
+| `frontend/src/features/home/model/useHomePersistence.ts` | 新增 selectedAudioId 持久化 |
+
+---
+
+## 🎞️ 素材时间轴编排 — 第二阶段 (Day 23)
+
+### 概述
+
+在第一阶段"配音前置"基础上，新增**时间轴编辑器**，用户可以：
+1. 在音频波形上查看各素材块的时长分配
+2. 拖拽分割线调整每段素材的时长（无缝铺满，调整一段自动压缩/扩展相邻段）
+3. 为每段素材设置**源视频截取起点**（从视频任意位置开始，而非始终从头）
+
+**旧行为**: 多素材时自动均分（`_split_equal`），无法控制每段时长和源视频起始点
+**新行为**: 时间轴编辑器可视化分配 + 拖拽调整 + ClipTrimmer 截取设置
+
+---
+
+### 一、后端改动
+
+#### 1.1 新增 `CustomAssignment` 模型
+
+```python
+# backend/app/modules/videos/schemas.py
+class CustomAssignment(BaseModel):
+    material_path: str
+    start: float           # 音频时间轴起点
+    end: float             # 音频时间轴终点
+    source_start: float = 0.0  # 源视频截取起点
+```
+
+`GenerateRequest` 新增 `custom_assignments: Optional[List[CustomAssignment]] = None`。存在时跳过 Whisper 均分，直接使用用户定义的分配。
+
+#### 1.2 `prepare_segment` 支持 `source_start`
+
+```python
+def prepare_segment(self, video_path, target_duration, output_path,
+                    target_resolution=None, source_start: float = 0.0):
+```
+
+关键逻辑：
+- `source_start > 0` 时使用 `-ss` 快速 seek，并强制重编码（避免 stream copy 关键帧不精确）
+- 当需要循环且有 `source_start` 时，先裁剪出 `source_start` 到视频结尾的片段，再循环裁剪后的文件（避免 `stream_loop` 从视频 0s 开始循环）
+- 裁剪临时文件在 `finally` 中自动清理
+
+#### 1.3 `workflow.py` 支持 `custom_assignments`
+
+- **多素材模式**: `custom_assignments` 存在时，直接使用用户分配（仍运行 Whisper 生成字幕），每个 `prepare_segment` 调用传入 `source_start`
+- **单素材模式**: `custom_assignments` 有 1 条且 `source_start > 0` 时，先截取片段再传入 LatentSync
+- **向后兼容**: `custom_assignments` 为 `None` 时完全走旧路径
+
+---
+
+### 二、前端新增组件
+
+#### 2.1 `useTimelineEditor.ts` — 时间轴段管理 hook
+
+```typescript
+interface TimelineSegment {
+  id: string;              // React key
+  materialId: string;      // 素材 ID
+  materialName: string;    // 显示名
+  start: number;           // 音频时间轴开始秒数
+  end: number;             // 音频时间轴结束秒数
+  sourceStart: number;     // 源视频截取起点（默认 0）
+  sourceEnd: number;       // 源视频截取终点（0 = 到结尾）
+  color: string;           // 色块颜色
+}
+```
+
+核心方法：
+- `initSegments()`: selectedMaterials 变化时按数量均分 audioDuration
+- `resizeSegment(id, newEnd)`: 拖拽右边界，约束每段最小 1s
+- `setSourceRange(id, sourceStart, sourceEnd)`: 设置截取范围
+- `toCustomAssignments()`: 转为后端 `CustomAssignment[]` 格式
+
+#### 2.2 `TimelineEditor.tsx` — 波形 + 色块时间轴
+
+- **wavesurfer.js** 渲染音频波形（仅展示，不播放）
+- 色块层按比例排列，显示素材名 + 时长 + 截取标记
+- 色块间分割线可拖拽（`onPointerDown/Move/Up` 实现连续像素拖拽）
+- 点击色块打开 ClipTrimmer
+
+#### 2.3 `ClipTrimmer.tsx` — 素材截取模态框
+
+- HTML5 `<video>` 实时预览，拖拽滑块时 `video.currentTime` 跟随
+- 双端 Range Slider（起点/终点），互锁约束 ≥ 0.5s
+- 显示截取时长 vs 分配时长对比（循环补足/截断提示）
+- `loadedmetadata` 获取源视频时长
+
+---
+
+### 三、前端整合改动
+
+#### 3.1 `useHomeController.ts`
+
+- 集成 `useTimelineEditor` hook
+- 新增 `clipTrimmerOpen` / `clipTrimmerSegmentId` 状态
+- `handleGenerate` 多素材时始终发送 `custom_assignments`；单素材 + `sourceStart > 0` 时也发送
+- 移除不再使用的 `reorderMaterials` 导出
+
+#### 3.2 `HomePage.tsx`
+
+- 在 MaterialSelector 和 BgmPanel 之间插入 TimelineEditor（仅当有配音且已选素材时显示）
+- 底部新增 ClipTrimmer 模态框
+- 移除 `reorderMaterials` 和 `selectedAudioDuration` prop 传递
+
+#### 3.3 `MaterialSelector.tsx`
+
+- 移除配音时长信息栏（功能迁至 TimelineEditor）
+- 移除拖拽排序区（SortableChip + @dnd-kit 相关代码）
+- 移除 `onReorderMaterials` / `selectedAudioDuration` prop
+
+---
+
+### 四、审查修复的 Bug
+
+| # | 严重程度 | 问题 | 修复 |
+|---|---------|------|------|
+| 1 | **中** | `prepare_segment` 使用 `source_start > 0` + stream copy 时 seek 不精确 | 添加 `source_start > 0` 到重编码条件 |
+| 2 | **高** | `stream_loop + source_start` 循环时从视频 0s 开始而非从 source_start 循环 | 改为两步：先裁剪片段再循环裁剪后的文件 |
+| 3 | **低** | `useHomeController` 导出已废弃的 `reorderMaterials` | 移除 |
+
+---
+
+### 涉及文件汇总
+
+#### 后端修改
+
+| 文件 | 变更 |
+|------|------|
+| `backend/app/modules/videos/schemas.py` | 新增 `CustomAssignment` model，`GenerateRequest` 新增 `custom_assignments` 字段 |
+| `backend/app/services/video_service.py` | `prepare_segment` 新增 `source_start` 参数，循环+截取两步处理 |
+| `backend/app/modules/videos/workflow.py` | 多素材/单素材流水线支持 `custom_assignments`，传递 `source_start` |
+
+#### 前端新增
+
+| 文件 | 说明 |
+|------|------|
+| `frontend/src/features/home/model/useTimelineEditor.ts` | 时间轴段管理 hook |
+| `frontend/src/features/home/ui/TimelineEditor.tsx` | 波形 + 色块时间轴组件 |
+| `frontend/src/features/home/ui/ClipTrimmer.tsx` | 素材截取模态框 |
+
+#### 前端修改
+
+| 文件 | 变更 |
+|------|------|
+| `frontend/src/features/home/ui/HomePage.tsx` | 插入 TimelineEditor + ClipTrimmer |
+| `frontend/src/features/home/ui/MaterialSelector.tsx` | 移除时长信息 + 拖拽排序区 + 相关 prop |
+| `frontend/src/features/home/model/useHomeController.ts` | 集成 useTimelineEditor，handleGenerate 发送 custom_assignments |
+| `frontend/package.json` | 新增 `wavesurfer.js` 依赖 |
+
+---
+
+## 🎨 UI 体验优化 + TTS 稳定性修复 — 第三阶段 (Day 23)
+
+### 概述
+
+根据用户反馈，修复 6 项 UI 体验问题，同时修复 Qwen3-TTS 声音克隆服务的 SoX 路径问题和显存缓存管理。
+
+---
+
+### 一、Qwen3-TTS 稳定性修复
+
+#### 1.1 SoX PATH 修复
+
+**问题**: PM2 启动 qwen-tts 时，`sox` 工具安装在 conda env 的 bin 目录中，系统 PATH 找不到，导致音频编解码走 fallback 路径（CPU 密集型），日志中出现 `SoX could not be found!` 警告。
+
+**修复**: `run_qwen_tts.sh` 中 export conda env bin 到 PATH：
+
+```bash
+export PATH="/home/rongye/ProgramFiles/miniconda3/envs/qwen-tts/bin:$PATH"
+```
+
+#### 1.2 CUDA 缓存清理
+
+**修复**: `qwen_tts_server.py` 每次生成完成后（无论成功或失败）调用 `torch.cuda.empty_cache()`，防止显存碎片累积。使用 `asyncio.to_thread()` 在线程池中运行推理，避免阻塞事件循环导致健康检查超时。
+
+---
+
+### 二、配音列表按钮布局统一 (反馈 #1 + #6)
+
+**问题**: `GeneratedAudiosPanel` 的试听按钮位于左侧（独立于 Edit/Delete），与 `RefAudioPanel` 的布局不一致。底部文案摘要区域不需要展示。
+
+**修复**:
+- Play/Edit/Delete 按钮统一放在右侧同组，hover 显示，顺序为 试听→重命名→删除
+- 移除选中配音的文案摘要区域
+- 布局与 RefAudioPanel 一致：左侧名称+时长，右侧操作按钮组
+
+---
+
+### 三、视频素材区域移除配音依赖遮罩 (反馈 #2)
+
+**问题**: MaterialSelector 被 `!selectedAudio` 遮罩覆盖，必须先选配音才能操作素材。
+
+**修复**: 移除 `HomePage.tsx` 中 MaterialSelector 外层的 disabled overlay `<div>`。素材随时可上传/预览/管理，仅 TimelineEditor 需要选中配音才显示（已有独立条件 `selectedAudio && selectedMaterials.length > 0`）。
+
+---
+
+### 四、时间轴拖拽排序 (反馈 #3)
+
+**问题**: TimelineEditor 不支持调换素材顺序。
+
+**修复**:
+- `useTimelineEditor` 已有 `reorderSegments()` 方法（交换两个段的素材信息但保留时间范围）
+- 通过 `useHomeController` 暴露 `reorderSegments`，传入 `TimelineEditor`
+- 色块支持 HTML5 Drag & Drop：`draggable` + `onDragStart/Over/Drop/End`
+- 拖拽时：源色块半透明（`opacity-50`），目标色块高亮 ring（`ring-2 ring-purple-400 scale-[1.02]`）
+- 光标样式：`cursor-grab` / `active:cursor-grabbing`
+
+---
+
+### 五、截取设置双手柄 Range Slider (反馈 #4)
+
+**问题**: ClipTrimmer 使用两个独立的 `<input type="range">` 滑块，起点和终点分开操作，体验不直观。
+
+**修复**: 改为自定义双手柄 range slider：
+- 单条轨道，紫色圆形手柄（起点）+ 粉色圆形手柄（终点）
+- 轨道底色 `bg-white/10`，选中范围用素材对应颜色高亮
+- Pointer Events 实现拖拽：`onPointerDown` 捕获手柄 → `onPointerMove` 更新位置 → `onPointerUp` 释放
+- 手柄互锁约束：起点不超过终点 - 0.5s，终点不低于起点 + 0.5s
+- 底部显示起点（紫色）和终点（粉色）时间标签
+
+---
+
+### 六、截取设置视频预览 (反馈 #5)
+
+**问题**: ClipTrimmer 的视频只能静态查看，无法播放预览截取范围。
+
+**修复**:
+- 视频区域点击可播放/暂停（Play/Pause 图标覆盖层）
+- 播放范围：从 sourceStart 播放到 sourceEnd 自动停止
+- 播放结束后回到起点
+- 拖拽手柄时 `video.currentTime` 实时跟随（seek 到当前位置查看画面）
+- 播放进度条（白色竖线）叠加在 range slider 轨道上
+- `preload="auto"` 预加载视频，确保拖拽时快速 seek
+
+---
+
+### 涉及文件汇总
+
+#### 后端修改
+
+| 文件 | 变更 |
+|------|------|
+| `run_qwen_tts.sh` | export conda env bin 到 PATH，修复 SoX 找不到问题 |
+| `models/Qwen3-TTS/qwen_tts_server.py` | 每次生成后 `torch.cuda.empty_cache()`，asyncio.to_thread 避免阻塞 |
+
+#### 前端修改
+
+| 文件 | 变更 |
+|------|------|
+| `frontend/src/features/home/ui/GeneratedAudiosPanel.tsx` | 按钮布局统一（Play/Edit/Delete 右侧同组），移除文案摘要 |
+| `frontend/src/features/home/ui/HomePage.tsx` | 移除 MaterialSelector 配音遮罩，传入 onReorderSegment |
+| `frontend/src/features/home/ui/TimelineEditor.tsx` | 新增 HTML5 Drag & Drop 排序，新增 onReorderSegment prop |
+| `frontend/src/features/home/ui/ClipTrimmer.tsx` | 双手柄 range slider + 视频播放预览 + 播放进度指示 |
+| `frontend/src/features/home/model/useHomeController.ts` | 暴露 reorderSegments 方法 |
+
+---
+
+## 📝 历史文案保存 + 时间轴拖拽修复 — 第四阶段 (Day 23)
+
+### 概述
+
+新增文案手动保存与加载功能，修复时间轴拖拽排序后素材时长不跟随的 Bug，统一按钮视觉规范。
+
+---
+
+### 一、历史文案保存与加载
+
+#### 功能
+
+用户可手动保存当前文案到历史列表，随时从历史中加载恢复。只有手动保存的文案才出现在历史列表中，与自动保存（`useHomePersistence`）完全独立。
+
+#### UI 布局
+
+```
+按钮栏: [历史文案▼] [文案提取助手] [AI多语言▼] [AI生成标题标签]
+底部栏: 128 字                                    [保存文案]
+```
+
+- **历史文案下拉**: 展示已保存列表（名称 + 日期 + 删除按钮），点击条目加载文案，空列表显示"暂无保存的文案"
+- **保存文案按钮**: 文案为空时 disabled，点击后 `toast.success("文案已保存")`
+- **预计时长已移除**: 底部栏只保留字数 + 保存按钮
+
+#### 实现
+
+##### `useSavedScripts.ts`（新建）
+
+```typescript
+interface SavedScript { id: string; name: string; content: string; savedAt: number }
+```
+
+- localStorage key: `vigent_{storageKey}_savedScripts`
+- `saveScript(content)`: 取前 15 字符自动命名，新条目插入列表头部，**直接写入 localStorage**
+- `deleteScript(id)`: 删除指定条目，直接写入 localStorage
+- `useEffect([lsKey])`: lsKey 变化时（guest → userId）重新从 localStorage 读取
+- **不使用自动持久化 effect**，避免 storageKey 切换时空数组覆盖已有数据
+
+##### 数据流
+
+```
+ScriptEditor (UI)
+  ↑ savedScripts / onSaveScript / onLoadScript / onDeleteScript (纯 props + callbacks)
+  │
+useHomeController
+  ├── useSavedScripts(storageKey) → { savedScripts, saveScript, deleteScript }
+  └── handleSaveScript() → saveScript(text) + toast
+  │
+HomePage
+  └── 传递 props 到 ScriptEditor
+```
+
+---
+
+### 二、时间轴拖拽排序 Bug 修复
+
+#### 问题
+
+拖拽调换素材顺序后，各素材的时长没有跟随素材移动，而是留在原槽位。例如：素材1(3s) + 素材2(8s+4s循环)，拖拽后变成素材2(3s) + 素材1(8s+4s循环)，时长分配没变。
+
+#### 根因
+
+`reorderSegments` 使用**属性交换**方式：逐个拷贝 `materialId`、`sourceStart`、`sourceEnd` 等属性在两个槽位间交换，然后调用 `recalcPositions` 重算位置。
+
+#### 修复
+
+改为**数组移动**（splice）：将整个 segment 对象从旧位置取出，插入到新位置。segment 对象携带全部属性（materialId、sourceStart、sourceEnd、color 等）作为一个整体移动，再由 `recalcPositions` 重算位置。
+
+```typescript
+// 修复前：属性交换
+const fromMat = { materialId: next[fromIdx].materialId, ... };
+const toMat = { materialId: next[toIdx].materialId, ... };
+next[fromIdx] = { ...next[fromIdx], ...toMat };
+next[toIdx] = { ...next[toIdx], ...fromMat };
+
+// 修复后：数组移动
+const [moved] = next.splice(fromIdx, 1);
+next.splice(toIdx, 0, moved);
+```
+
+附带优势：3+ 素材拖拽行为从"交换"变为"插入"，更符合用户直觉。
+
+---
+
+### 三、按钮视觉统一
+
+#### 问题
+
+历史文案、文案提取助手、AI多语言、AI生成标题标签 4 个按钮高度不一致，AI 按钮的文本被 `<span>` 嵌套包裹导致内部布局差异。
+
+#### 修复
+
+- 4 个按钮统一为 `h-7 px-2.5 text-xs rounded inline-flex items-center gap-1`（固定高度 28px）
+- 移除 AI多语言 / AI生成标题标签 按钮内多余的 `<span>` 嵌套，改为 `<>...</>` fragment
+
+---
+
+### 涉及文件汇总
+
+#### 前端新增
+
+| 文件 | 说明 |
+|------|------|
+| `frontend/src/features/home/model/useSavedScripts.ts` | 历史文案 hook（localStorage 持久化） |
+
+#### 前端修改
+
+| 文件 | 变更 |
+|------|------|
+| `frontend/src/features/home/ui/ScriptEditor.tsx` | 历史文案下拉 + 保存按钮 + 移除预计时长 + 按钮高度统一 |
+| `frontend/src/features/home/model/useHomeController.ts` | 集成 useSavedScripts，新增 handleSaveScript |
+| `frontend/src/features/home/ui/HomePage.tsx` | 传递 savedScripts / handleSaveScript / deleteSavedScript 到 ScriptEditor |
+| `frontend/src/features/home/model/useTimelineEditor.ts` | reorderSegments 从属性交换改为数组移动（splice） |
--- a/Docs/DevLogs/Day7.md
+++ b/Docs/DevLogs/Day7.md
@@ -389,7 +389,7 @@ if not qr_element:

 ## 📋 文档规则优化 (16:42 - 17:10)

-**问题**：Doc_Rules需要优化，避免误删历史内容、规范工具使用、防止任务清单遗漏
+**问题**：DOC_RULES需要优化，避免误删历史内容、规范工具使用、防止任务清单遗漏

 **优化内容（最终版）**：

@@ -411,7 +411,7 @@ if not qr_element:
   - 移除无关项目组件

 **修改文件**：
- `Docs/Doc_Rules.md` - 包含检查清单的最终完善版
+- `Docs/DOC_RULES.md` - 包含检查清单的最终完善版

 ---

--- a/Docs/Doc_Rules.md
+++ b/Docs/Doc_Rules.md
@@ -8,8 +8,8 @@

 | 规则 | 说明 |
 |------|------|
-| **默认更新** | 只更新 `DayN.md` |
-| **按需更新** | `task_complete.md` 仅在用户**明确要求**时更新 |
+| **默认更新** | 更新 `DayN.md` 和 `TASK_COMPLETE.md` |
+| **按需更新** | 其他文档仅在内容变化涉及时更新 |
 | **智能修改** | 错误→替换，改进→追加（见下方详细规则） |
 | **先读后写** | 更新前先查看文件当前内容 |
 | **日内合并** | 同一天的多次小修改合并为最终版本 |
@@ -23,13 +23,14 @@
 | 优先级 | 文件路径 | 检查重点 |
 | :---: | :--- | :--- |
 | 🔥 **High** | `Docs/DevLogs/DayN.md` | **(最新日志)** 详细记录变更、修复、代码片段 |
-| 🔥 **High** | `Docs/task_complete.md` | **(任务总览)** 更新 `[x]`、进度条、时间线 |
-| ⚡ **Med** | `README.md` | **(项目主页)** 功能特性、技术栈、最新截图 |
-| ⚡ **Med** | `Docs/DEPLOY_MANUAL.md` | **(部署手册)** 环境变量、依赖包、启动命令变更 |
-| ⚡ **Med** | `Docs/FRONTEND_DEV.md` | **(前端规范)** API封装、日期格式化、新页面规范 |
-| ⚡ **Med** | `Docs/FRONTEND_README.md` | **(前端文档)** 功能说明、页面变更 |
-| 🧊 **Low** | `Docs/implementation_plan.md` | **(实施计划)** 核对计划与实际实现的差异 |
-| 🧊 **Low** | `Docs/architecture_plan.md` | **(前端架构)** 拆分计划与阶段目标 |
+| 🔥 **High** | `Docs/TASK_COMPLETE.md` | **(任务总览)** 更新 `[x]`、进度条、时间线 |
+| ⚡ **Med** | `README.md` | **(项目主页)** 功能特性、技术栈、最新截图 |
+| ⚡ **Med** | `Docs/DEPLOY_MANUAL.md` | **(部署手册)** 环境变量、依赖包、启动命令变更 |
+| ⚡ **Med** | `Docs/BACKEND_DEV.md` | **(后端规范)** 接口契约、模块划分、环境变量 |
+| ⚡ **Med** | `Docs/BACKEND_README.md` | **(后端文档)** 接口说明、架构设计 |
+| ⚡ **Med** | `Docs/FRONTEND_DEV.md` | **(前端规范)** API封装、日期格式化、新页面规范 |
+| ⚡ **Med** | `Docs/FRONTEND_README.md` | **(前端文档)** 功能说明、页面变更 |
+| 🧊 **Low** | `Docs/*_DEPLOY.md` | **(子系统部署)** LatentSync/Qwen3/字幕等独立部署文档 |

 ---

@@ -94,7 +95,7 @@

 ### 必须执行的检查步骤

-**1. 快速浏览全文**（使用 `view_file` 或 `grep_search`）
+**1. 快速浏览全文**（使用 `Read` 或 `Grep`）
 ```markdown
 # 检查是否存在：
 - 同主题的旧章节？
@@ -141,82 +142,61 @@

 > **核心原则**：使用正确的工具，避免字符编码问题

-### ✅ 推荐工具：apply_patch
+### ✅ 推荐工具：Edit / Read / Grep

-**使用场景**：
- 追加新章节到文件末尾
- 修改/替换现有章节内容
- 更新状态标记（🔄 → ✅）
- 修正错误内容
+**使用场景**：
+- `Read`：更新前先查看文件当前内容
+- `Edit`：精确替换现有内容、追加新章节
+- `Grep`：搜索文件中是否已有相关章节
+- `Write`：创建新文件（如 Day{N+1}.md）

-**优势**：
- ✅ 自动处理字符编码（Windows CRLF）
- ✅ 精确替换，不会误删其他内容
- ✅ 有错误提示，方便调试
-
-**注意事项**：
+**注意事项**：
 ```markdown
-1. **必须精确匹配**：TargetContent 必须与文件完全一致
-2. **处理换行符**：文件使用 \r\n，不要漏掉 \r
-3. **合理范围**：StartLine/EndLine 应覆盖目标内容
-4. **先读后写**：编辑前先 view_file 确认内容
+1. **先读后写**：编辑前先用 Read 确认内容
+2. **精确匹配**：Edit 的 old_string 必须与文件内容完全一致
+3. **避免重复**：编辑前用 Grep 检查是否已存在同主题章节
 ```

-### ❌ 禁止使用：命令行工具
+### ❌ 禁止使用：命令行工具修改文档

 **禁止场景**：
- ❌ 使用 `echo >>` 追加内容（编码问题）
- ❌ 使用 PowerShell 直接修改文档（破坏格式）
- ❌ 使用 sed/awk 等命令行工具
+- ❌ 使用 `echo >>` 追加内容
+- ❌ 使用 `sed` / `awk` 修改文档
+- ❌ 使用 `cat <<EOF` 写入内容

 **原因**：
- 容易破坏 UTF-8 编码
- Windows CRLF vs Unix LF 混乱
+- 容易破坏 UTF-8 编码和中文字符
 - 难以追踪修改，容易出错
-
-**唯一例外**：简单的全局文本替换（如批量更新日期），且必须使用 `-NoNewline` 参数
+- 无法精确匹配替换位置

 ### 📝 最佳实践示例

-**追加新章节**：
-```diff
-*** Begin Patch
-*** Update File: Docs/DevLogs/DayN.md
-@@
- ## 🔗 相关文档
- 
- ...
---
-
-## 🆕 新章节
-内容...
-*** End Patch
-```
+**追加新章节**：使用 `Edit` 工具，`old_string` 匹配文件末尾内容，`new_string` 包含原内容 + 新章节。

-**修改现有内容**：
-```diff
-*** Begin Patch
-*** Update File: Docs/DevLogs/DayN.md
-@@
-**状态**：🔄 待修复
-+**状态**：✅ 已修复
-*** End Patch
-```
+**修改现有内容**：使用 `Edit` 工具精确替换。
+```markdown
+old_string: "**状态**：🔄 待修复"
+new_string: "**状态**：✅ 已修复"
+```


 ---

-## 📁 文件结构
+## 📁 文件结构

 ```
-ViGent2/Docs/
-├── task_complete.md              # 任务总览（仅按需更新）
-├── Doc_Rules.md                  # 本文件
-├── FRONTEND_DEV.md               # 前端开发规范
-├── FRONTEND_README.md            # 前端功能文档
-├── architecture_plan.md          # 前端拆分计划
-├── DEPLOY_MANUAL.md              # 部署手册
-├── SUPABASE_DEPLOY.md            # Supabase 部署文档
+ViGent2/Docs/
+├── TASK_COMPLETE.md              # 任务总览（仅按需更新）
+├── DOC_RULES.md                  # 本文件
+├── BACKEND_DEV.md                # 后端开发规范
+├── BACKEND_README.md             # 后端功能文档
+├── FRONTEND_DEV.md               # 前端开发规范
+├── FRONTEND_README.md            # 前端功能文档
+├── DEPLOY_MANUAL.md              # 部署手册
+├── SUPABASE_DEPLOY.md            # Supabase 部署文档
+├── LATENTSYNC_DEPLOY.md          # LatentSync 部署文档
+├── QWEN3_TTS_DEPLOY.md           # 声音克隆部署文档
+├── SUBTITLE_DEPLOY.md            # 字幕系统部署文档
 └── DevLogs/
    ├── Day1.md                   # 开发日志
    └── ...
@@ -224,17 +204,25 @@ ViGent2/Docs/

 ---

-## 📅 DayN.md 更新规则（日常更新）
+## 📅 DayN.md 更新规则（日常更新）
+
+### 更新时机
+
+> **边开发边记录，不要等到最后才写。**
+
+- 每完成一个功能/修复后，**立即**追加到 DayN.md
+- 避免积攒到对话末尾一次性补写，容易遗漏变更
+- `TASK_COMPLETE.md` 同理，重要变更完成后及时同步

 ### 新建判断 (对话开始前)
-1. **回顾进度**：查看 `task_complete.md` 了解当前状态
+1. **回顾进度**：查看 `TASK_COMPLETE.md` 了解当前状态
 2. **检查日期**：查看最新 `DayN.md`
   - **今天 (与当前日期相同)** → 🚨 **绝对禁止创建新文件**，必须**追加**到现有 `DayN.md` 末尾！即使是完全不同的功能模块。
   - **之前 (昨天或更早)** → 创建 `Day{N+1}.md`

-### 追加格式
-```markdown
---
+### 追加格式
+```markdown
+---

 ## 🔧 [章节标题]

@@ -250,18 +238,18 @@ ViGent2/Docs/
 - ✅ 修复了 xxx
 ```

-### 快速修复格式
-```markdown
-## 🐛 [Bug 简述] (HH:MM)
+### 快速修复格式
+```markdown
+## 🐛 [Bug 简述] (HH:MM)

 **问题**：一句话描述
 **修复**：修改了 `文件名` 中的 xxx
-**状态**：✅ 已修复 / 🔄 待验证
-```
-
-### ⚠️ 注意
- **DayN.md 文件开头禁止使用 `---`**，避免被解析为 Front Matter。
- 分隔线只用于章节之间，不作为文件第一行。
+**状态**：✅ 已修复 / 🔄 待验证
+```
+
+### ⚠️ 注意
+- **DayN.md 文件开头禁止使用 `---`**，避免被解析为 Front Matter。
+- 分隔线只用于章节之间，不作为文件第一行。

 ---

@@ -283,17 +271,17 @@ ViGent2/Docs/

 ---

-## 📝 task_complete.md 更新规则（仅按需）
+## 📝 TASK_COMPLETE.md 更新规则

-> ⚠️ **仅当用户明确要求更新 `task_complete.md` 时才更新**
+> 与 DayN.md 同步更新，记录重要变更时更新任务总览。

 ### 更新原则
- **格式一致性**：直接参考 `task_complete.md` 现有格式追加内容。
+- **格式一致性**：直接参考 `TASK_COMPLETE.md` 现有格式追加内容。
 - **进度更新**：仅在阶段性里程碑时更新进度百分比。

 ### 🔍 完整性检查清单 (必做)

-每次更新 `task_complete.md` 时，必须**逐一检查**以下所有板块：
+每次更新 `TASK_COMPLETE.md` 时，必须**逐一检查**以下所有板块：

 1. **文件头部 & 导航**
   - [ ] `更新时间`：必须是当天日期
@@ -316,4 +304,4 @@ ViGent2/Docs/

 ---

-**最后更新**：2026-02-04
+**最后更新**：2026-02-08
--- a/Docs/FRONTEND_DEV.md
+++ b/Docs/FRONTEND_DEV.md
@@ -2,22 +2,74 @@

 ## 目录结构

+采用轻量 FSD（Feature-Sliced Design）结构：
+
 ```
 frontend/src/
-├── app/                # Next.js App Router 页面
-│   ├── page.tsx        # 首页（视频生成）
-│   ├── publish/        # 发布页面
-│   ├── admin/          # 管理员页面
-│   ├── login/          # 登录页面
-│   └── register/       # 注册页面
-├── components/         # 可复用组件
-│   ├── home/            # 首页拆分组件
-│   └── ...
-├── lib/                # 公共工具函数
-│   ├── axios.ts        # Axios 实例（含 401/403 拦截器）
-│   ├── auth.ts         # 认证相关函数
-│   └── media.ts        # API Base / URL / 日期等通用工具
-└── proxy.ts            # 路由代理（原 middleware）
+├── app/                       # Next.js App Router 页面入口
+│   ├── page.tsx               # 首页（视频生成）
+│   ├── publish/               # 发布管理页
+│   ├── admin/                 # 管理员页面
+│   ├── login/                 # 登录
+│   └── register/              # 注册
+├── features/                  # 功能模块（按业务拆分）
+│   ├── home/
+│   │   ├── model/             # 业务逻辑 hooks
+│   │   │   ├── useHomeController.ts    # 主控制器
+│   │   │   ├── useHomePersistence.ts   # 持久化管理
+│   │   │   ├── useBgm.ts
+│   │   │   ├── useGeneratedVideos.ts
+│   │   │   ├── useGeneratedAudios.ts
+│   │   │   ├── useMaterials.ts
+│   │   │   ├── useMediaPlayers.ts
+│   │   │   ├── useRefAudios.ts
+│   │   │   ├── useSavedScripts.ts
+│   │   │   ├── useTimelineEditor.ts
+│   │   │   └── useTitleSubtitleStyles.ts
+│   │   └── ui/                # UI 组件（纯 props + 回调）
+│   │       ├── HomePage.tsx
+│   │       ├── HomeHeader.tsx
+│   │       ├── MaterialSelector.tsx
+│   │       ├── ScriptEditor.tsx
+│   │       ├── ScriptExtractionModal.tsx
+│   │       ├── script-extraction/
+│   │       │   └── useScriptExtraction.ts
+│   │       ├── TitleSubtitlePanel.tsx
+│   │       ├── FloatingStylePreview.tsx
+│   │       ├── VoiceSelector.tsx
+│   │       ├── RefAudioPanel.tsx
+│   │       ├── GeneratedAudiosPanel.tsx
+│   │       ├── TimelineEditor.tsx
+│   │       ├── ClipTrimmer.tsx
+│   │       ├── BgmPanel.tsx
+│   │       ├── GenerateActionBar.tsx
+│   │       ├── PreviewPanel.tsx
+│   │       └── HistoryList.tsx
+│   └── publish/
+│       ├── model/
+│       │   └── usePublishController.ts
+│       └── ui/
+│           └── PublishPage.tsx
+├── shared/                    # 跨功能共享
+│   ├── api/
+│   │   ├── axios.ts           # Axios 实例（含 401/403 拦截器）
+│   │   └── types.ts           # 统一响应类型
+│   ├── lib/
+│   │   ├── media.ts           # API Base / URL / 日期等通用工具
+│   │   ├── auth.ts            # 认证相关函数
+│   │   └── title.ts           # 标题输入处理
+│   ├── hooks/
+│   │   ├── useTitleInput.ts
+│   │   └── usePublishPrefetch.ts
+│   ├── types/
+│   │   ├── user.ts            # User 类型定义
+│   │   └── publish.ts         # 发布相关类型
+│   └── contexts/              # 全局 Context（Auth、Task）
+│       ├── AuthContext.tsx
+│       └── TaskContext.tsx
+├── components/                # 遗留通用组件
+│   └── VideoPreviewModal.tsx
+└── proxy.ts                   # Next.js middleware（路由保护）
 ```

 ---
@@ -228,10 +280,21 @@ import { formatDate } from '@/shared/lib/media';

 ## 轻量 FSD 结构

- `app/`：页面入口，保持轻量
- `features/*/model`：业务逻辑与状态 (hooks)
- `features/*/ui`：功能 UI 组件
- `shared/`：通用工具、通用 hooks、API 实例
+- `app/`：页面入口，保持轻量，只做组合与布局
+- `features/*/model`：业务逻辑与状态（Controller Hook + 子 Hook）
+- `features/*/ui`：功能 UI 组件（纯 props + 回调，不直接发 API）
+- `shared/api`：Axios 实例与统一响应类型
+- `shared/lib`：通用工具函数（media.ts / auth.ts / title.ts）
+- `shared/hooks`：跨功能通用 hooks
+- `shared/types`：跨功能实体类型（User / PublishVideo 等）
+- `shared/contexts`：全局 Context（AuthContext / TaskContext）
+- `components/`：遗留通用组件（VideoPreviewModal）
+
+## 类型定义规范
+
+- 通用实体类型（如 User, Account, Video）统一放置在 `src/shared/types/`。
+- 特定业务类型放在 feature 目录下的 types.ts 或 model 中。
+- **禁止**在多个地方重复定义 User 接口，统一引用 `import { User } from '@/shared/types/user';`。

 ---

@@ -244,12 +307,22 @@ import { formatDate } from '@/shared/lib/media';
  - 标题字号 / 字幕字号
  - 背景音乐选择 / 音量 / 开关状态
  - 素材选择 / 历史作品选择
+  - 选中配音 ID (`selectedAudioId`)
+  - 时间轴段信息 (`useTimelineEditor` 的 localStorage)
+
+### 历史文案（独立持久化）
+
+`useSavedScripts` hook 独立管理历史文案的 localStorage 持久化：
+- key: `vigent_{storageKey}_savedScripts`
+- 仅在用户手动保存/删除时写入 localStorage，不使用自动持久化 effect
+- 与 `useHomePersistence` 完全独立，互不影响

 ### 实施规范
 - 使用 `storageKey = userId || 'guest'`，按用户隔离。
 - **恢复先于保存**：恢复完成前禁止写入（`isRestored` 保护）。
 - 避免默认值覆盖用户选择（优先读取已保存值）。
 - 优先使用 `useHomePersistence` 集中管理恢复/保存，页面内避免分散的 localStorage 读写。
+- **禁止使用签名 URL 作为持久化标识**：Supabase Storage 签名 URL 每次请求都变化，必须使用后端返回的稳定 `id` 字段。
 - 如需新增持久化字段，必须加入恢复与保存逻辑，并更新本节。

 ---
@@ -264,6 +337,14 @@ import { formatDate } from '@/shared/lib/media';

 ---

+## 发布页交互规则
+
+- 发布按钮在未选择任何平台时禁用
+- 仅保留"立即发布"，不再提供定时发布 UI/参数
+- **作品选择持久化**：使用 `video.id`（稳定标识）而非 `video.path`（签名 URL）进行选择、比较和 localStorage 存储。发布时根据 `id` 查找对应 `path` 发送请求。
+
+---
+
 ## 新增页面 Checklist

 1. [ ] 导入 `import api from '@/shared/api/axios'`
--- a/Docs/FRONTEND_README.md
+++ b/Docs/FRONTEND_README.md
@@ -17,6 +17,9 @@ ViGent2 的前端界面，采用 Next.js 16 + TailwindCSS 构建。
 - **作品预览**: 生成完成后直接播放下载（作品预览 + 历史作品）。
 - **预览优化**: 预览视频 `metadata` 预取，首帧加载更快。
 - **本地保存**: 文案/标题/偏好由 `useHomePersistence` 统一持久化，刷新后恢复 (Day 14/17)。
+- **历史文案**: 手动保存/加载/删除历史文案，独立 localStorage 持久化 (Day 23)。
+- **选择持久化**: 首页/发布页作品选择均使用稳定 `id` 持久化，刷新保持用户选择；新视频生成后自动选中最新 (Day 21)。
+- **AI 多语言翻译**: 支持 9 种目标语言翻译文案 + 还原原文 (Day 22)。

 ### 2. 全自动发布 (`/publish`) [Day 7 新增]
 - **多平台管理**: 统一管理抖音、微信视频号、B站、小红书账号状态。
@@ -26,15 +29,25 @@ ViGent2 的前端界面，采用 Next.js 16 + TailwindCSS 构建。
  - Cookie 自动保存与状态同步。
 - **发布配置**: 设置视频标题、标签、简介。
 - **作品选择**: 卡片列表 + 搜索 + 预览弹窗。
+- **选择持久化**: 使用稳定 `video.id` 持久化选择，刷新保持；新视频生成自动选中最新 (Day 21)。
 - **预览兼容**: 签名 URL / 相对路径均可直接预览。
- **定时任务**: 支持 "立即发布" 或 "定时发布"。
+- **发布方式**: 仅支持 "立即发布"。

 ### 3. 声音克隆 [Day 13 新增]
 - **TTS 模式选择**: EdgeTTS (预设音色) / 声音克隆 (自定义音色) 切换。
 - **参考音频管理**: 上传/列表/删除参考音频 (3-20秒 WAV)。
 - **一键克隆**: 选择参考音频后自动调用 Qwen3-TTS 服务。
+- **多语言支持**: EdgeTTS 10 语言声音列表，声音克隆 language 透传 (Day 22)。

-### 4. 字幕与标题 [Day 13 新增]
+### 4. 配音前置 + 时间轴编排 [Day 23 新增]
+- **配音独立生成**: 先生成配音 → 选中配音 → 再选素材 → 生成视频。
+- **配音管理面板**: 生成/试听/改名/删除/选中，异步生成 + 进度轮询。
+- **时间轴编辑器**: wavesurfer.js 音频波形 + 色块可视化素材分配，拖拽分割线调整各段时长。
+- **素材截取设置**: ClipTrimmer 双手柄 range slider + HTML5 视频预览播放。
+- **拖拽排序**: 时间轴色块支持 HTML5 Drag & Drop 调换素材顺序。
+- **自定义分配**: 后端 `custom_assignments` 支持用户定义的素材分配方案。
+
+### 5. 字幕与标题 [Day 13 新增]
 - **片头标题**: 可选输入，限制 15 字，视频开头显示 3 秒淡入淡出标题。
 - **标题同步**: 首页片头标题修改会同步到发布信息标题。
 - **逐字高亮字幕**: 卡拉OK效果，默认开启，可关闭。
@@ -43,16 +56,16 @@ ViGent2 的前端界面，采用 Next.js 16 + TailwindCSS 构建。
 - **默认样式**: 标题 90px 站酷快乐体；字幕 60px 经典黄字 + DingTalkJinBuTi (Day 17)。
 - **样式持久化**: 标题/字幕样式与字号刷新保留 (Day 17)。

-### 5. 背景音乐 [Day 16 新增]
+### 6. 背景音乐 [Day 16 新增]
 - **试听预览**: 点击试听即选中，音量滑块实时生效。
 - **混音控制**: 仅影响 BGM，配音保持原音量。

-### 6. 账户设置 [Day 15 新增]
+### 7. 账户设置 [Day 15 新增]
 - **手机号登录**: 11位中国手机号验证登录。
 - **账户下拉菜单**: 显示有效期 + 修改密码 + 安全退出。
 - **修改密码**: 弹窗输入当前密码与新密码，修改后强制重新登录。

-### 7. 文案提取助手 (`ScriptExtractionModal`) [Day 15 新增]
+### 8. 文案提取助手 (`ScriptExtractionModal`) [Day 15 新增]
 - **多源提取**: 支持文件拖拽上传与 URL 粘贴 (B站/抖音/TikTok)。
 - **AI 洗稿**: 集成 GLM-4.7-Flash，自动改写为口播文案。
 - **一键填入**: 提取结果直接填充至视频生成输入框。
@@ -64,6 +77,7 @@ ViGent2 的前端界面，采用 Next.js 16 + TailwindCSS 构建。
 - **样式**: TailwindCSS
 - **图标**: Lucide React
 - **组件**: 自定义现代化组件 (Glassmorphism 风格)
+- **音频波形**: wavesurfer.js (时间轴编辑器)
 - **API**: Axios 实例 `@/shared/api/axios` (对接后端 FastAPI :8006)

 ## 🚀 开发指南
--- a/Docs/QWEN3_TTS_DEPLOY.md
+++ b/Docs/QWEN3_TTS_DEPLOY.md
@@ -298,12 +298,20 @@ Response: audio/wav 文件
 SoX could not be found!
 ```

-**解决**: 通过 conda 安装 sox：
+**解决**:
+
+1. 通过 conda 安装 sox：

 ```bash
 conda install -y -c conda-forge sox
 ```

+2. 确保启动脚本 `run_qwen_tts.sh` 中已 export conda env bin 到 PATH（PM2 启动时系统 PATH 不含 conda 环境目录）：
+
+```bash
+export PATH="/home/rongye/ProgramFiles/miniconda3/envs/qwen-tts/bin:$PATH"
+```
+
 ### CUDA 内存不足

 Qwen3-TTS 1.7B 通常需要 8-10GB VRAM。如果遇到 OOM：
@@ -371,6 +379,7 @@ FOR INSERT TO anon WITH CHECK (bucket_id = 'ref-audios');

 | 日期 | 版本 | 说明 |
 |------|------|------|
+| 2026-02-09 | 1.2.0 | 修复 SoX PATH 问题（run_qwen_tts.sh export conda bin），每次生成后 empty_cache() |
 | 2026-01-30 | 1.1.0 | 明确默认模型升级为 1.7B-Base，替换旧版 0.6B 路径 |

 ---
--- a/Docs/SUBTITLE_DEPLOY.md
+++ b/Docs/SUBTITLE_DEPLOY.md
@@ -15,9 +15,13 @@
 原有流程:
  文本 → EdgeTTS → 音频 → LatentSync → FFmpeg合成 → 最终视频

-新流程:
-  文本 → EdgeTTS → 音频 ─┬→ LatentSync → 唇形视频 ─┐
-                        └→ faster-whisper → 字幕JSON ─┴→ Remotion合成 → 最终视频
+新流程 (单素材):
+  文本 → EdgeTTS/Qwen3-TTS/预生成配音 → 音频 ─┬→ LatentSync → 唇形视频 ─┐
+                                              └→ faster-whisper → 字幕JSON ─┴→ Remotion合成 → 最终视频
+
+新流程 (多素材):
+  音频 → 多素材按 custom_assignments 拼接 → LatentSync (单次推理) → 唇形视频 ─┐
+  音频 → faster-whisper → 字幕JSON ─────────────────────────────────────────────┴→ Remotion合成 → 最终视频
 ```

 ## 系统要求
@@ -52,6 +56,9 @@ cd /home/rongye/ProgramFiles/ViGent2/remotion

 # 安装依赖
 npm install
+
+# 预编译渲染脚本 (生产环境必须)
+npm run build:render
 ```

 ### 步骤 3: 重启后端服务
@@ -137,7 +144,7 @@ remotion/
 | 阶段 | 进度 | 说明 |
 |------|------|------|
 | 下载素材 | 0% → 5% | 从 Supabase 下载输入视频 |
-| TTS 语音生成 | 5% → 25% | EdgeTTS 或 Qwen3-TTS 生成音频 |
+| TTS 语音生成 | 5% → 25% | EdgeTTS / Qwen3-TTS / 预生成配音下载 |
 | 唇形同步 | 25% → 80% | LatentSync 推理 |
 | 字幕对齐 | 80% → 85% | faster-whisper 生成字级别时间戳 |
 | Remotion 渲染 | 85% → 95% | 合成字幕和标题 |
@@ -279,4 +286,5 @@ WhisperService(device="cuda:0")  # 或 "cuda:1"
 | 日期 | 版本 | 说明 |
 |------|------|------|
 | 2026-01-29 | 1.0.0 | 初始版本，使用 faster-whisper + Remotion 实现逐字高亮字幕和片头标题 |
+| 2026-02-10 | 1.1.0 | 更新架构图：多素材 concat-then-infer、预生成配音选项 |
 | 2026-01-30 | 1.0.1 | 字幕高亮样式与标题动画优化，视觉表现更清晰 |
--- a/Docs/implementation_plan.md
+++ b/Docs/implementation_plan.md
@@ -1,416 +0,0 @@
-# 数字人口播视频生成系统 - 实现计划
-
-## 项目目标
-
-构建一个开源的数字人口播视频生成系统，功能包括：
- 上传静态人物视频 → 生成口播视频（唇形同步）
- TTS 配音或声音克隆
- 字幕自动生成与渲染
- AI 自动生成标题与标签
- 一键发布到多个社交平台
-
---
-
-## 技术架构
-
-```
-┌─────────────────────────────────────────────────────────┐
-│                    前端 (Next.js)                        │
-│         素材管理 | 视频生成 | 发布管理 | 任务状态         │
-└─────────────────────────────────────────────────────────┘
-                            │ REST API
-                            ▼
-┌─────────────────────────────────────────────────────────┐
-│                   后端 (FastAPI)                         │
-├─────────────────────────────────────────────────────────┤
-│  异步任务队列 (asyncio)                                 │
-│  ├── 视频生成任务                                        │
-│  ├── TTS 配音任务                                        │
-│  └── 自动发布任务                                        │
-└─────────────────────────────────────────────────────────┘
-        │               │               │
-        ▼               ▼               ▼
-  ┌──────────┐   ┌──────────┐   ┌──────────┐
-  │LatentSync│   │  FFmpeg  │   │Playwright│
-  │ 唇形同步  │   │ 视频合成  │   │ 自动发布  │
-  └──────────┘   └──────────┘   └──────────┘
-```
-
---
-
-## 技术选型
-
-| 模块 | 技术选择 | 备选方案 |
-|------|----------|----------|
-| **前端框架** | Next.js 16 | Vue 3 + Vite |
-| **UI 组件库** | TailwindCSS (自定义组件) | Ant Design |
-| **后端框架** | FastAPI | Flask |
-| **任务队列** | FastAPI BackgroundTasks (asyncio) | Celery + Redis |
-| **唇形同步** | **LatentSync 1.6** | MuseTalk / Wav2Lip |
-| **TTS 配音** | EdgeTTS | CosyVoice |
-| **声音克隆** | **Qwen3-TTS 1.7B** ✅ | GPT-SoVITS |
-| **视频处理** | FFmpeg | MoviePy |
-| **自动发布** | Playwright | 自行实现 |
-| **数据库** | Supabase (PostgreSQL) | MySQL |
-| **文件存储** | Supabase Storage | 阿里云 OSS |
-
-> **修正 (18:10)**：当前实现采用 Next.js 16、FastAPI BackgroundTasks 与 Supabase Storage/Auth，自动发布基于 Playwright。
-
---
-
-## ✅ 现状补充 (Day 17)
-
- 前端已拆分为组件化结构（`features/home/ui/`），主页面逻辑集中。
- 通用工具 `media.ts` 统一处理 API Base / 资源 URL / 日期格式化。
- 作品预览弹窗统一样式，并支持素材/发布预览复用。
- 标题/字幕预览按素材分辨率缩放，效果更接近成片。
-
---
-
-## 分阶段实施计划
-
-### 阶段一：核心功能验证 (MVP)
-
-> **目标**：验证 LatentSync + EdgeTTS 效果，跑通端到端流程
-
-#### 1.1 环境搭建
-
-参考 `models/LatentSync/DEPLOY.md` 完成 LatentSync 环境与权重部署。
-
-#### 1.2 集成 EdgeTTS
-
-```python
-# tts_engine.py
-import edge_tts
-import asyncio
-
-async def text_to_speech(text: str, voice: str = "zh-CN-YunxiNeural", output_path: str = "output.mp3"):
-    communicate = edge_tts.Communicate(text, voice)
-    await communicate.save(output_path)
-    return output_path
-```
-
-#### 1.3 端到端测试脚本
-
-```python
-# test_pipeline.py
-"""
-1. 文案 → EdgeTTS → 音频
-2. 静态视频 + 音频 → LatentSync → 口播视频
-3. 添加字幕 → FFmpeg → 最终视频
-"""
-```
-
-#### 1.4 验证标准
- [ ] LatentSync 能正常推理
- [ ] 唇形与音频同步率 > 90%
- [ ] 单个视频生成时间 < 2 分钟
-
---
-
-### 阶段二：后端 API 开发
-
-> **目标**：将核心功能封装为 API，支持异步任务
-
-#### 2.1 项目结构
-
-```
-backend/
-├── app/
-│   ├── main.py              # FastAPI 入口
-│   ├── api/
-│   │   ├── videos.py        # 视频生成 API
-│   │   ├── materials.py     # 素材管理 API
-│   │   └── publish.py       # 发布管理 API
-│   ├── services/
-│   │   ├── tts_service.py   # TTS 服务
-│   │   ├── lipsync_service.py  # 唇形同步服务
-│   │   └── video_service.py    # 视频合成服务
-│   ├── tasks/
-│   │   └── celery_tasks.py  # Celery 异步任务
-│   ├── models/
-│   │   └── schemas.py       # Pydantic 模型
-│   └── core/
-│       └── config.py        # 配置管理
-├── requirements.txt
-└── docker-compose.yml       # Redis + API
-```
-
-#### 2.2 核心 API 设计
-
-| 端点 | 方法 | 功能 |
-|------|------|------|
-| `/api/materials` | POST | 上传视频素材 | ✅ |
-| `/api/materials` | GET | 获取素材列表 | ✅ |
-| `/api/videos/generate` | POST | 创建视频生成任务 | ✅ |
-| `/api/videos/tasks/{id}` | GET | 查询任务状态 | ✅ |
-| `/api/videos/generated` | GET | 获取历史作品列表 | ✅ |
-| `/api/publish` | POST | 发布到社交平台 | ✅ |
-
-#### 2.3 BackgroundTasks 任务定义
-
-```python
-# app/api/videos.py
-background_tasks.add_task(_process_video_generation, task_id, req, user_id)
-```
-
---
-
-### 阶段三：前端 Web UI
-
-> **目标**：提供用户友好的操作界面
-
-#### 3.1 页面设计
-
-| 页面 | 功能 |
-|------|------|
-| **素材库** | 上传/管理多场景视频素材 |
-| **生成视频** | 输入文案、选择素材、生成预览 |
-| **任务中心** | 查看生成进度、下载视频 |
-| **发布管理** | 绑定平台、一键发布、定时发布 |
-
-#### 3.2 技术实现
-
-```bash
-# 创建 Next.js 项目
-npx create-next-app@latest frontend --typescript --tailwind --app
-
-# 安装依赖
-cd frontend
-npm install axios swr
-```
-
---
-
-### 阶段四：社交媒体发布
-
-> **目标**：集成 social-auto-upload，支持多平台发布
-
-#### 4.1 复用 social-auto-upload
-
-```bash
-# 复制模块
-cp -r SuperIPAgent/social-auto-upload backend/social_upload
-```
-
-#### 4.2 Cookie 管理
-
-```python
-# 用户通过浏览器登录 → 保存 Cookie → 后续自动发布
-```
-
-#### 4.3 支持平台
- 抖音
- 小红书
- 微信视频号
- 快手
-
---
-
-### 阶段五：优化与扩展
-
-| 功能 | 实现方式 |
-|------|----------|
-| **声音克隆** | 集成 GPT-SoVITS，用自己的声音 |
-| **AI 标题/标签生成** | 调用大模型 API 自动生成标题与标签 ✅ |
-| **批量生成** | 上传 Excel/CSV，批量生成视频 |
-| **字幕编辑器** | 可视化调整字幕样式、位置 |
-| **Docker 部署** | 一键部署到云服务器 | ✅ |
-
---
-
-### 阶段六：MuseTalk 服务器部署 (Day 2-3) ✅
-
-> **目标**：在双显卡服务器上部署 MuseTalk 环境
-
- [x] Conda 环境配置 (musetalk)
- [x] 模型权重下载 (~7GB)
- [x] Subprocess 调用方式实现
- [x] 健康检查功能
-
-### 阶段七：MuseTalk 完整修复 (Day 4) ✅
-
-> **目标**：解决推理脚本的各种兼容性问题
-
- [x] 权重检测路径修复 (软链接)
- [x] 音视频长度不匹配修复
- [x] 推理脚本错误日志增强
- [x] 视频合成 MP4 生成验证
-
-### 阶段八：前端功能增强 (Day 5) ✅
-
-> **目标**：提升用户体验
-
- [x] Web 视频上传功能
- [x] 上传进度显示
- [x] 自动刷新素材列表
-
-### 阶段九：唇形同步模型升级 (Day 6) ✅
-
-> **目标**：从 MuseTalk 迁移到 LatentSync 1.6
-
- [x] MuseTalk → LatentSync 1.6 迁移
- [x] 后端代码适配 (config.py, lipsync_service.py)
- [x] Latent Diffusion 架构 (512x512 高清)
- [x] 服务器端到端验证
-
-### 阶段十：性能优化 (Day 6) ✅
-
-> **目标**：提升系统响应速度和稳定性
-
- [x] 视频预压缩优化 (1080p → 720p 自动适配)
- [x] 进度更新细化 (实时反馈)
- [x] **常驻模型服务** (Persistent Server, 0s 加载)
- [x] **GPU 并发控制** (串行队列防崩溃)
-
-### 阶段十一：社交媒体发布完善 (Day 7) ✅
-
-> **目标**：实现全自动扫码登录和多平台发布
-
- [x] QR码自动登录 (Playwright headless + Stealth)
- [x] 多平台上传器架构 (B站/抖音/小红书)
- [x] Cookie 自动管理
- [x] 定时发布功能
-
-### 阶段十二：用户体验优化 (Day 8) ✅
-
-> **目标**：提升文件管理和历史记录功能
-
- [x] 文件名保留 (时间戳前缀 + 原始名称)
- [x] 视频持久化 (历史视频列表 API)
- [x] 素材/视频删除功能
-
-### 阶段十三：发布模块优化 (Day 9) ✅
-
-> **目标**：代码质量优化 + 发布功能验证
-
- [x] B站/抖音登录+发布验证通过
- [x] 资源清理保障 (try-finally)
- [x] 超时保护 (消除无限循环)
- [x] 完整类型提示
-
-### 阶段十四：用户认证系统 (Day 9) ✅
-
-> **目标**：实现安全、隔离的多用户认证体系
-
- [x] Supabase 云数据库集成 (本地自托管)
- [x] JWT + HttpOnly Cookie 认证架构
- [x] 用户表与权限表设计 (RLS 准备)
- [x] 认证部署文档 (Docs/SUPABASE_DEPLOY.md)
-
-### 阶段十五：部署稳定性优化 (Day 9) ✅
-
-> **目标**：确保生产环境服务长期稳定
-
- [x] 依赖冲突修复 (bcrypt)
- [x] 前端构建修复 (Production Build)
- [x] PM2 进程守护配置
- [x] 部署手册更新 (Docs/DEPLOY_MANUAL.md)
-
-### 阶段十六：HTTPS 全栈部署 (Day 10) ✅
-
-> **目标**：实现安全的公网 HTTPS 访问
-
- [x] 阿里云 Nginx 反向代理配置
- [x] Let's Encrypt SSL 证书集成
- [x] Supabase 自托管部署 (Docker)
- [x] 端口冲突解决 (3003/8008/8444)
- [x] Basic Auth 管理后台保护
-
-### 阶段十七：声音克隆功能集成 (Day 13) ✅
-
-> **目标**：实现用户自定义声音克隆能力
-
- [x] Qwen3-TTS HTTP 服务 (独立 FastAPI，端口 8009)
- [x] 声音克隆服务封装 (voice_clone_service.py)
- [x] 参考音频管理 API (上传/列表/删除)
- [x] 前端 TTS 模式选择 UI
- [x] Supabase ref-audios Bucket 配置
- [x] 端到端测试验证
-
-### 阶段十八：手机号登录迁移 (Day 15) ✅
-
-> **目标**：将认证系统从邮箱迁移到手机号
-
- [x] 数据库 Schema 迁移 (email → phone)
- [x] 后端 API 适配 (auth.py/admin.py)
- [x] 11位手机号校验 (正则验证)
- [x] 修改密码功能 (/api/auth/change-password)
- [x] 账户设置下拉菜单 (修改密码 + 有效期显示 + 退出)
- [x] 前端登录/注册页面更新
- [x] 数据库迁移脚本 (migrate_to_phone.sql)
-
-### 阶段十九：深度性能优化与服务守护 (Day 16) ✅
-
-> **目标**：提升系统响应速度与服务稳定性
-
- [x] Flash Attention 2 集成 (Qwen3-TTS 加速 5x)
- [x] LatentSync 性能调优 (OMP 线程限制 + 原生 Flash Attn)
- [x] Watchdog 服务守护 (自动重启僵死服务)
- [x] 文档体系更新 (部署手册与运维指南)
-
---
-
-## 项目目录结构 (最终)
-
---
-
-## 开发时间估算
-
-| 阶段 | 预计时间 | 说明 |
-|------|----------|------|
-| 阶段一 | 2-3 天 | 环境搭建 + 效果验证 |
-| 阶段二 | 3-4 天 | 后端 API 开发 |
-| 阶段三 | 3-4 天 | 前端 UI 开发 |
-| 阶段四 | 2 天 | 社交发布集成 |
-| 阶段五 | 按需 | 持续优化 |
-
-**总计**：约 10-13 天可完成 MVP
-
---
-
-## 验证计划
-
-### 阶段一验证
-1. 运行 `test_pipeline.py` 脚本
-2. 检查生成视频的唇形同步效果
-3. 确认音画同步
-
-### 阶段二验证
-1. 使用 Postman/curl 测试所有 API 端点
-2. 验证任务队列正常工作
-3. 检查视频生成完整流程
-
-### 阶段三验证
-1. 在浏览器中完成完整操作流程
-2. 验证上传、生成、下载功能
-3. 检查响应式布局
-
-### 阶段四验证
-1. 发布一个测试视频到抖音
-2. 验证定时发布功能
-3. 检查发布状态同步
-
---
-
-## 硬件要求
-
-| 配置 | 最低要求 | 推荐配置 |
-|------|----------|----------|
-| **GPU** | NVIDIA GTX 1060 6GB | RTX 3060 12GB+ |
-| **内存** | 16GB | 32GB |
-| **存储** | 100GB SSD | 500GB SSD |
-| **CUDA** | 11.7+ | 12.0+ |
-
---
-
-## 下一步行动
-
-1. **确认你的 GPU 配置** - MuseTalk 需要 NVIDIA GPU
-2. **选择开发起点** - 从阶段一开始验证效果
-3. **确定项目位置** - 在哪个目录创建项目
-
---
-
-> [!IMPORTANT]
-> 请确认以上计划是否符合你的需求，有任何需要调整的地方请告诉我。
--- a/Docs/task_complete.md
+++ b/Docs/task_complete.md
@@ -1,8 +1,8 @@
 # ViGent2 开发任务清单 (Task Log)

-**项目**: ViGent2 数字人口播视频生成系统  
-**进度**: 100% (Day 18 - 后端模块化与规范完善)  
-**更新时间**: 2026-02-05
+**项目**: ViGent2 数字人口播视频生成系统
+**进度**: 100% (Day 23 - 配音前置重构 + 素材时间轴编排 + UI 体验优化)
+**更新时间**: 2026-02-10

 ---

@@ -10,42 +10,117 @@

 > 这里记录了每一天的核心开发内容与 milestone。

-### Day 18: 后端模块化与规范完善 (Current) 🚀
- [x] **模块化迁移**: 路由透传 `modules/*`，业务逻辑集中到 service/workflow。
- [x] **视频生成拆分**: 生成流程下沉 workflow，任务状态统一 TaskStore。
- [x] **Redis 任务存储**: Redis 优先，不可用自动回退内存。
- [x] **仓储层抽离**: Supabase 访问统一 `repositories/*`，deps/auth/admin 全面替换。
- [x] **响应规范**: 统一 `success/message/data/code` + 全局异常处理。
- [x] **素材重命名**: 新增重命名接口与 Storage `move_file`。
- [x] **平台顺序调整**: 抖音/微信视频号/B站/小红书，移除快手。
- [x] **后端开发规范**: 新增 `BACKEND_DEV.md`，README 同步模块化结构。
- [x] **发布管理体验**: 首页预取路由 + 发布页骨架与缓存，进入更快。
- [x] **素材加载优化**: 素材列表并发签名 URL，骨架数量动态。
- [x] **预览加载优化**: `preload="metadata"` + hover 预取。
-
-### Day 17: 前端重构与体验优化
- [x] **UI 组件拆分**: 首页拆分为独立组件，降低 `page.tsx` 复杂度。
- [x] **轻量 FSD 迁移**: `app` 页面轻量化，逻辑集中到 `features/*/model`，通用能力下沉 `shared/*`。
- [x] **Controller Hooks**: Home/Publish 页面逻辑集中到 Controller Hook，Page 仅组合渲染。
- [x] **通用工具抽取**: `media.ts` 统一 API Base / URL / 日期格式化。
- [x] **交互优化**: 选择项持久化、列表内定位、刷新回顶部、最新作品优先预览。
- [x] **发布页改造**: 作品列表卡片化 + 搜索 + 预览弹窗。
- [x] **预览体验**: 预览弹窗统一头部样式与提示文案。
- [x] **预览一致性**: 标题/字幕预览按素材分辨率缩放。
- [x] **标题同步与限制**: 片头标题同步发布标题，输入法合成态兼容，限制 15 字。
- [x] **样式默认与持久化**: 默认样式与字号调整，刷新保留用户选择。
- [x] **性能微优化**: 列表渲染优化 + 并行请求 + localStorage 防抖。
- [x] **资源能力**: 字体/BGM 资源库 + `/api/assets` 接入。
- [x] **音频与字幕修复**: BGM 混音稳定性与字幕断句优化。
- [x] **持久化修复**: 接入 `useHomePersistence`，恢复 `isRestored` 逻辑并通过构建。
- [x] **预览与选择修复**: 发布预览兼容签名 URL，音频试听路径解析，素材/BGM 回退有效项。
- [x] **体验细节优化**: 录音预览 URL 回收，预览弹窗滚动恢复，全局任务提示挂载。
-
-### Day 16: 深度性能优化
- [x] **Qwen-TTS 加速**: 集成 Flash Attention 2，模型加载速度提升至 8.9s。
- [x] **服务守护**: 开发 `Watchdog` 看门狗机制，自动监控并重启僵死服务。
- [x] **LatentSync 性能确认**: 验证 DeepCache + 原生 Flash Attn 生效。
- [x] **文档重构**: 全面更新 README、部署手册及后端文档。
+### Day 23: 配音前置重构 + 素材时间轴编排 + UI 体验优化 + 历史文案 (Current)
+
+#### 第一阶段：配音前置
+- [x] **配音生成独立化**: 新增 `generated_audios` 后端模块（router/schemas/service），5 个 API 端点，复用现有 TTSService / voice_clone_service / task_store。
+- [x] **配音管理面板**: 前端新增 `useGeneratedAudios` hook + `GeneratedAudiosPanel` 组件，支持生成/试听/改名/删除/选中。
+- [x] **UI 面板重排序**: 文案 → 标题字幕 → 配音方式 → 配音列表 → 素材选择 → BGM → 生成视频。
+- [x] **素材区门控**: 未选中配音时素材区显示遮罩，选中后显示配音时长 + 素材均分信息。
+- [x] **视频生成对接**: workflow.py 新增预生成音频分支（`generated_audio_id`），跳过内联 TTS，向后兼容。
+- [x] **持久化**: selectedAudioId 加入 useHomePersistence，刷新页面恢复选中配音。
+
+#### 第二阶段：素材时间轴编排
+- [x] **时间轴编辑器**: 新增 `TimelineEditor` 组件，wavesurfer.js 音频波形 + 色块可视化素材分配，拖拽分割线调整各段时长。
+- [x] **素材截取设置**: 新增 `ClipTrimmer` 模态框，HTML5 视频预览 + 双端滑块设置源视频截取起点/终点。
+- [x] **后端自定义分配**: 新增 `CustomAssignment` 模型，`prepare_segment` 支持 `source_start`，workflow 多素材/单素材流水线支持 `custom_assignments`。
+- [x] **循环截取修复**: `stream_loop + source_start` 改为两步处理（先裁剪再循环），确保从截取起点循环而非从视频 0s 开始。
+- [x] **MaterialSelector 精简**: 移除旧的时长信息栏和拖拽排序区（功能迁移到 TimelineEditor）。
+
+#### 第三阶段：UI 体验优化 + TTS 稳定性
+- [x] **TTS SoX PATH 修复**: `run_qwen_tts.sh` export conda env bin 到 PATH，修复 `SoX could not be found!` 警告。
+- [x] **TTS 显存管理**: 每次生成后 `torch.cuda.empty_cache()`，asyncio.to_thread 避免阻塞事件循环。
+- [x] **配音列表按钮统一**: Play/Edit/Delete 按钮右侧同组 hover 显示，与 RefAudioPanel 一致，移除文案摘要。
+- [x] **素材区解除配音门控**: 移除 MaterialSelector 的 selectedAudio 遮罩，素材随时可上传管理。
+- [x] **时间轴拖拽排序**: TimelineEditor 色块支持 HTML5 Drag & Drop 调换素材顺序。
+- [x] **截取设置 Range Slider**: ClipTrimmer 改为单轨道双手柄（紫色起点+粉色终点），替换两个独立滑块。
+- [x] **截取设置视频预览**: 视频区域可播放/暂停，从 sourceStart 到 sourceEnd 自动停止，拖拽手柄时实时 seek。
+
+#### 第四阶段：历史文案 + Bug 修复
+- [x] **历史文案保存与加载**: 新增 `useSavedScripts` hook，手动保存/加载/删除历史文案，独立 localStorage 持久化。
+- [x] **时间轴拖拽修复**: `reorderSegments` 从属性交换改为数组移动（splice），修复拖拽后时长不跟随素材的 Bug。
+- [x] **按钮视觉统一**: 文案编辑区 4 个按钮统一为固定高度 `h-7`，移除多余 `<span>` 嵌套。
+- [x] **底部栏调整**: "保存文案"按钮移至底部右侧，移除预计时长显示。
+
+### Day 22: 多素材优化 + AI 翻译 + TTS 多语言
+- [x] **多素材 Bug 修复**: 6 个高优 Bug（边界溢出、单段 fallback、除零、duration 校验、Whisper 兜底、空列表检查）。
+- [x] **架构重构**: 多素材从"逐段 LatentSync"重构为"先拼接再推理"，推理次数 N→1。
+- [x] **前端优化**: payload 安全、进度消息、上传自动选中、Material 接口统一、拖拽修复、素材上限 4 个。
+- [x] **AI 多语言翻译**: 新增 `/api/ai/translate` 接口，前端 9 种语言翻译 + 还原原文。
+- [x] **TTS 多语言**: EdgeTTS 10 语言声音列表、翻译自动切换声音、声音克隆 language 透传、textLang 持久化。
+
+### Day 21: 缺陷修复 + 浮动预览 + 发布重构 + 架构优化 + 多素材生成
+- [x] **Remotion 崩溃容错**: 渲染进程 SIGABRT 退出时检查输出文件，避免误判失败导致标题/字幕丢失。
+- [x] **首页作品选择持久化**: 修复 `fetchGeneratedVideos` 无条件覆盖恢复值的问题，新增 `preferVideoId` 参数控制选中逻辑。
+- [x] **发布页作品选择持久化**: 根因为签名 URL 不稳定，全面改用 `video.id` 替代 `path` 进行选择/持久化/比较。
+- [x] **预取缓存补全**: 首页预取发布页数据时加入 `id` 字段，确保缓存数据可用于持久化匹配。
+- [x] **浮动样式预览窗口**: 标题字幕预览改为 `position: fixed` 浮动窗口，固定左上角，滚动时始终可见。
+- [x] **移动端适配**: ScriptEditor 按钮换行、预览默认比例改为 9:16 竖屏。
+- [x] **多平台发布重构**: 平台配置独立化（DOUYIN_*/WEIXIN_*）、用户隔离 Cookie 管理、抖音刷脸验证二维码、微信发布流程优化。
+- [x] **前端结构微调**: ScriptExtractionModal 迁移到 features/、contexts 迁移到 shared/contexts/、清理空目录。
+- [x] **后端模块分层**: materials/tools/ref_audios 三个模块补全 router+schemas+service 分层。
+- [x] **开发规范更新**: BACKEND_DEV.md 新增渐进原则、DOC_RULES.md 取消 TASK_COMPLETE.md 手动触发约束。
+- [x] **文档全面更新**: BACKEND_DEV/README、FRONTEND_DEV、DEPLOY_MANUAL、README.md 同步更新。
+- [x] **多素材视频生成（多机位效果）**: 支持多选素材 + 拖拽排序，按素材数量均分音频时长（对齐 Whisper 字边界）自动切换机位。逐段 LatentSync + FFmpeg 拼接。前端 @dnd-kit 拖拽排序 UI。
+- [x] **字幕开关移除**: 默认启用逐字高亮字幕，移除开关及相关死代码。
+- [x] **视频格式扩展**: 上传支持 mkv/webm/flv/wmv/m4v/ts/mts 等常见格式。
+- [x] **Watchdog 优化**: 健康检查阈值提高到 5 次，新增重启冷却期 120 秒，避免误重启。
+- [x] **多素材 Bug 修复**: 修复标点分句方案对无句末标点文案无效（改为均分方案）、音频时间偏移导致口型不对齐等缺陷。
+
+### Day 20: 代码质量与安全优化
+- [x] **功能性修复**: LatentSync 回退逻辑、任务状态接口认证、User 类型统一。
+- [x] **性能优化**: N+1 查询修复、视频上传流式处理、httpx 异步替换、GLM 异步包装。
+- [x] **安全修复**: 硬编码 Cookie 配置化、日志敏感信息脱敏、ffprobe 安全调用、CORS 配置化。
+- [x] **配置优化**: 存储路径环境变量化、Remotion 预编译加速、LatentSync 绝对路径。
+- [x] **文档更新**: 更新 DOC_RULES.md 清单，补齐后端与部署文档；更新 SUBTITLE_DEPLOY.md, FRONTEND_DEV.md, implementation_plan.md。
+- [x] **缺陷修复**: 修复 Remotion 路径解析、发布页持久化竞态、首页选中回归、素材闭包陷阱。
+
+### Day 19: 自动发布稳定性与发布体验优化 🚀
+- [x] **抖音发布稳定性**: 上传入口、封面流程、发布重试、登录失效识别与网络失败快速返回全面增强。
+- [x] **视频号发布修复**: 标题+标签统一写入“视频描述”，`post_create` 成功信号快速判定，超时改为失败返回。
+- [x] **成功截图闭环**: 抖音/视频号发布成功截图接入前端，支持用户隔离存储与鉴权访问。
+- [x] **截图观感优化**: 成功截图延后 3 秒并改为视口截图，修复“截图内容仅占 1/3”问题。
+- [x] **调试能力开关化**: 新增视频号录屏配置，默认可按环境变量开关，失败排障更直观。
+- [x] **启动链路统一**: 合并为 `run_backend.sh`（xvfb + headful），统一端口 `8006`，减少多进程混淆。
+- [x] **发布页防误操作**: 发布中按钮提示“请勿刷新或关闭网页”，并启用刷新/关页二次确认拦截。
+- [ ] **后续优化**: 发布任务状态恢复机制（任务化 + 状态持久化 + 前端轮询恢复）。
+
+### Day 18: 后端模块化与规范完善
+- [x] **模块化迁移**: 路由透传 `modules/*`，业务逻辑集中到 service/workflow。
+- [x] **视频生成拆分**: 生成流程下沉 workflow，任务状态统一 TaskStore。
+- [x] **Redis 任务存储**: Redis 优先，不可用自动回退内存。
+- [x] **仓储层抽离**: Supabase 访问统一 `repositories/*`，deps/auth/admin 全面替换。
+- [x] **响应规范**: 统一 `success/message/data/code` + 全局异常处理。
+- [x] **素材重命名**: 新增重命名接口与 Storage `move_file`。
+- [x] **平台顺序调整**: 抖音/微信视频号/B站/小红书，移除快手。
+- [x] **后端开发规范**: 新增 `BACKEND_DEV.md`，README 同步模块化结构。
+- [x] **发布管理体验**: 首页预取路由 + 发布页骨架与缓存，进入更快。
+- [x] **素材加载优化**: 素材列表并发签名 URL，骨架数量动态。
+- [x] **预览加载优化**: `preload="metadata"` + hover 预取。
+
+### Day 17: 前端重构与体验优化
+- [x] **UI 组件拆分**: 首页拆分为独立组件，降低 `page.tsx` 复杂度。
+- [x] **轻量 FSD 迁移**: `app` 页面轻量化，逻辑集中到 `features/*/model`，通用能力下沉 `shared/*`。
+- [x] **Controller Hooks**: Home/Publish 页面逻辑集中到 Controller Hook，Page 仅组合渲染。
+- [x] **通用工具抽取**: `media.ts` 统一 API Base / URL / 日期格式化。
+- [x] **交互优化**: 选择项持久化、列表内定位、刷新回顶部、最新作品优先预览。
+- [x] **发布页改造**: 作品列表卡片化 + 搜索 + 预览弹窗。
+- [x] **预览体验**: 预览弹窗统一头部样式与提示文案。
+- [x] **预览一致性**: 标题/字幕预览按素材分辨率缩放。
+- [x] **标题同步与限制**: 片头标题同步发布标题，输入法合成态兼容，限制 15 字。
+- [x] **样式默认与持久化**: 默认样式与字号调整，刷新保留用户选择。
+- [x] **性能微优化**: 列表渲染优化 + 并行请求 + localStorage 防抖。
+- [x] **资源能力**: 字体/BGM 资源库 + `/api/assets` 接入。
+- [x] **音频与字幕修复**: BGM 混音稳定性与字幕断句优化。
+- [x] **持久化修复**: 接入 `useHomePersistence`，恢复 `isRestored` 逻辑并通过构建。
+- [x] **预览与选择修复**: 发布预览兼容签名 URL，音频试听路径解析，素材/BGM 回退有效项。
+- [x] **体验细节优化**: 录音预览 URL 回收，预览弹窗滚动恢复，全局任务提示挂载。
+
+### Day 16: 深度性能优化
+- [x] **Qwen-TTS 加速**: 集成 Flash Attention 2，模型加载速度提升至 8.9s。
+- [x] **服务守护**: 开发 `Watchdog` 看门狗机制，自动监控并重启僵死服务。
+- [x] **LatentSync 性能确认**: 验证 DeepCache + 原生 Flash Attn 生效。
+- [x] **文档重构**: 全面更新 README、部署手册及后端文档。

 ### Day 15: 手机号认证迁移
 - [x] **认证系统升级**: 从邮箱迁移至 11 位手机号注册/登录。
@@ -93,8 +168,10 @@
 ## 🛤️ 后续规划 (Roadmap)

 ### 🔴 优先待办
+- [x] ~~**配音前置重构 — 第二阶段**: 素材片段截取 + 语音时间轴编排~~ ✅ Day 23 已完成
 - [ ] **批量生成架构**: 支持 Excel 导入，批量生产视频。
 - [ ] **定时任务后台化**: 迁移前端触发的定时发布到后端 APScheduler。
+- [ ] **发布任务恢复机制**: 发布任务化 + 状态持久化 + 前端断点恢复，解决刷新后状态丢失。

 ### 🔵 长期探索
 - [ ] **容器化交付**: 提供完整的 Docker Compose 一键部署包。
@@ -109,8 +186,8 @@
 | **核心 API** | 100% | ✅ 稳定 |
 | **Web UI** | 100% | ✅ 稳定 (移动端适配) |
 | **唇形同步** | 100% | ✅ LatentSync 1.6 |
-| **TTS 配音** | 100% | ✅ EdgeTTS + Qwen3 |
-| **自动发布** | 100% | ✅ B站/抖音/小红书 |
+| **TTS 配音** | 100% | ✅ EdgeTTS + Qwen3 + 配音前置 + 时间轴编排 |
+| **自动发布** | 100% | ✅ 抖音/微信视频号/B站/小红书 |
 | **用户认证** | 100% | ✅ 手机号 + JWT |
 | **部署运维** | 100% | ✅ PM2 + Watchdog |

@@ -118,5 +195,5 @@

 ## 📎 相关文档

- [详细开发日志 (DevLogs)](file:///d:/CodingProjects/Antigravity/ViGent2/Docs/DevLogs/)
- [部署手册 (DEPLOY_MANUAL)](file:///d:/CodingProjects/Antigravity/ViGent2/Docs/DEPLOY_MANUAL.md)
+- [详细开发日志 (DevLogs)](Docs/DevLogs/)
+- [部署手册 (DEPLOY_MANUAL)](Docs/DEPLOY_MANUAL.md)
--- a/README.md
+++ b/README.md
@@ -15,22 +15,25 @@

 ## ✨ 功能特性

-### 核心能力
- 🎬 **高清唇形同步** - LatentSync 1.6 驱动，512×512 高分辨率 Latent Diffusion 模型。
- 🎙️ **多模态配音** - 支持 **EdgeTTS** (微软超自然语音) 和 **Qwen3-TTS** (3秒极速声音克隆)。
- 📝 **智能字幕** - 集成 faster-whisper + Remotion，自动生成逐字高亮 (卡拉OK效果) 字幕。
- 🎨 **样式预设** - 标题/字幕样式选择 + 预览 + 字号调节，支持自定义字体库。
- 🖼️ **作品预览一致性** - 标题/字幕预览按素材分辨率缩放，效果更接近成片。
- 💾 **用户偏好持久化** - 首页状态统一恢复/保存，刷新后延续上次配置。
- 🎵 **背景音乐** - 试听 + 音量控制 + 混音，保持配音音量稳定。
- 🤖 **AI 辅助创作** - 内置 GLM-4.7-Flash，支持 B站/抖音链接文案提取、AI 洗稿、标题/标签自动生成。
+### 核心能力
+- 🎬 **高清唇形同步** - LatentSync 1.6 驱动，512×512 高分辨率 Latent Diffusion 模型。
+- 🎙️ **多模态配音** - 支持 **EdgeTTS** (微软超自然语音, 10 语言) 和 **Qwen3-TTS** (3秒极速声音克隆)。配音前置工作流：先生成配音 → 选素材 → 生成视频。
+- 📝 **智能字幕** - 集成 faster-whisper + Remotion，自动生成逐字高亮 (卡拉OK效果) 字幕。
+- 🎨 **样式预设** - 标题/字幕样式选择 + 预览 + 字号调节，支持自定义字体库。
+- 🖼️ **作品预览一致性** - 标题/字幕预览按素材分辨率缩放，效果更接近成片。
+- 🎞️ **多素材多机位** - 支持多选素材 + 时间轴编辑器 (wavesurfer.js 波形可视化)，拖拽分割线调整时长、拖拽排序切换机位、截取源视频片段。
+- 💾 **用户偏好持久化** - 首页状态统一恢复/保存，刷新后延续上次配置。历史文案手动保存与加载。
+- 🎵 **背景音乐** - 试听 + 音量控制 + 混音，保持配音音量稳定。
+- 🤖 **AI 辅助创作** - 内置 GLM-4.7-Flash，支持 B站/抖音链接文案提取、AI 洗稿、标题/标签自动生成、9 语言翻译。

-### 平台化功能
- 📱 **全自动发布** - 支持抖音/B站/小红书定时发布，微信视频号预留配置；扫码登录 + Cookie 持久化。
- 🖥️ **发布管理预览** - 支持签名 URL / 相对路径作品预览，确保可直接播放。
- 🔐 **认证与隔离** - 基于 Supabase 的用户隔离，支持手机号注册/登录、密码管理。
+### 平台化功能
+- 📱 **全自动发布** - 支持抖音/微信视频号/B站/小红书立即发布；扫码登录 + Cookie 持久化。
+- 🖥️ **发布管理预览** - 支持签名 URL / 相对路径作品预览，确保可直接播放。
+- 📸 **发布结果可视化** - 抖音/微信视频号发布成功后返回截图，发布页结果卡片可直接查看。
+- 🛡️ **发布防误操作** - 发布进行中自动提示“请勿刷新或关闭网页”，并拦截刷新/关页二次确认。
+- 🔐 **认证与隔离** - 基于 Supabase 的用户隔离，支持手机号注册/登录、密码管理。
 - 🛡️ **服务守护** - 内置 Watchdog 看门狗机制，自动监控并重启僵死服务，确保 7x24h 稳定运行。
- 🚀 **性能优化** - 视频预压缩、模型常驻服务（近实时加载）、双 GPU 流水线并发。
+- 🚀 **性能优化** - 视频预压缩、模型常驻服务（近实时加载）、双 GPU 流水线并发。

 ---

@@ -38,7 +41,7 @@

 | 领域 | 核心技术 | 说明 |
 |------|----------|------|
-| **前端** | Next.js 16 | TypeScript, TailwindCSS, SWR |
+| **前端** | Next.js 16 | TypeScript, TailwindCSS, SWR, wavesurfer.js |
 | **后端** | FastAPI | Python 3.10, AsyncIO, PM2 |
 | **数据库** | Supabase | PostgreSQL, Storage (本地/S3), Auth |
 | **唇形同步** | LatentSync 1.6 | PyTorch 2.5, Diffusers, DeepCache |
@@ -56,11 +59,11 @@
 - **[部署手册 (DEPLOY_MANUAL.md)](Docs/DEPLOY_MANUAL.md)** - 👈 **部署请看这里**！包含完整的环境搭建步骤。
 - [参考音频服务部署 (QWEN3_TTS_DEPLOY.md)](Docs/QWEN3_TTS_DEPLOY.md) - 声音克隆模型部署指南。
 - [LatentSync 部署指南](models/LatentSync/DEPLOY.md) - 唇形同步模型独立部署。
- [用户认证部署 (AUTH_DEPLOY.md)](Docs/AUTH_DEPLOY.md) - Supabase 与 Auth 系统配置。
+- [Supabase 部署指南 (SUPABASE_DEPLOY.md)](Docs/SUPABASE_DEPLOY.md) - Supabase 与认证系统配置。

-### 开发文档
- [后端开发指南](Docs/BACKEND_README.md) - 接口规范与开发流程。
- [后端开发规范](Docs/BACKEND_DEV.md) - 分层约定与开发习惯。
+### 开发文档
+- [后端开发指南](Docs/BACKEND_README.md) - 接口规范与开发流程。
+- [后端开发规范](Docs/BACKEND_DEV.md) - 分层约定与开发习惯。
 - [前端开发指南](Docs/FRONTEND_DEV.md) - UI 组件与页面规范。
 - [开发日志 (DevLogs)](Docs/DevLogs/) - 每日开发进度与技术决策记录。

@@ -72,9 +75,11 @@
 ViGent2/
 ├── backend/              # FastAPI 后端服务
 │   ├── app/              # 核心业务逻辑
-│   ├── scripts/          # 运维脚本 (Watchdog 等)
-│   └── tests/            # 测试用例
+│   ├── assets/           # 字体 / 样式 / BGM
+│   ├── user_data/        # 用户隔离数据 (Cookie 等)
+│   └── scripts/          # 运维脚本 (Watchdog 等)
 ├── frontend/             # Next.js 前端应用
+├── remotion/             # Remotion 视频渲染 (标题/字幕合成)
 ├── models/               # AI 模型仓库
 │   ├── LatentSync/       # 唇形同步服务
 │   └── Qwen3-TTS/        # 声音克隆服务
--- a/backend/.env.example
+++ b/backend/.env.example
@@ -15,7 +15,6 @@ DEFAULT_TTS_VOICE=zh-CN-YunxiNeural
 # GPU 选择 (0=第一块GPU, 1=第二块GPU)
 LATENTSYNC_GPU_ID=1

-# 使用本地模式 (true) 或远程 API (false)
 # 使用本地模式 (true) 或远程 API (false)
 LATENTSYNC_LOCAL=true

@@ -66,3 +65,11 @@ ADMIN_PASSWORD=lam1988324
 # 智谱 GLM API 配置 (用于生成标题和标签)
 GLM_API_KEY=32440cd3f3444d1f8fe721304acea8bd.YXNLrk7eIJMKcg4t
 GLM_MODEL=glm-4.7-flash
+
+# =============== Supabase Storage 本地路径 ===============
+# 确保存储卷映射正确，避免硬编码路径
+SUPABASE_STORAGE_LOCAL_PATH=/home/rongye/ProgramFiles/Supabase/volumes/storage/stub/stub
+
+# =============== 抖音视频下载 Cookie ===============
+# 用于从抖音 URL 提取视频文案功能，会过期需要定期更新
+DOUYIN_COOKIE=douyin.com; device_web_cpu_core=10; device_web_memory_size=8; __ac_nonce=06760391f00b9b51264ae; __ac_signature=_02B4Z6wo00f019a5ceAAAIDAhEZR-X3jjWfWmXVAAJLXd4; ttwid=1%7C7MTKBSMsP4eOv9h5NAh8p0E-NYIud09ftNmB0mjLpWc%7C1734359327%7C8794abeabbd47447e1f56e5abc726be089f2a0344d6343b5f75f23e7b0f0028f; UIFID_TEMP=0de8750d2b188f4235dbfd208e44abbb976428f0720eb983255afefa45d39c0c6532e1d4768dd8587bf919f866ff1396912bcb2af71efee56a14a2a9f37b74010d0a0413795262f6d4afe02a032ac7ab; s_v_web_id=verify_m4r4ribr_c7krmY1z_WoeI_43po_ATpO_I4o8U1bex2D7; hevc_supported=true; home_can_add_dy_2_desktop=%220%22; dy_swidth=2560; dy_sheight=1440; stream_recommend_feed_params=%22%7B%5C%22cookie_enabled%5C%22%3Atrue%2C%5C%22screen_width%5C%22%3A2560%2C%5C%22screen_height%5C%22%3A1440%2C%5C%22browser_online%5C%22%3Atrue%2C%5C%22cpu_core_num%5C%22%3A10%2C%5C%22device_memory%5C%22%3A8%2C%5C%22downlink%5C%22%3A10%2C%5C%22effective_type%5C%22%3A%5C%224g%5C%22%2C%5C%22round_trip_time%5C%22%3A50%7D%22; strategyABtestKey=%221734359328.577%22; csrf_session_id=2f53aed9aa6974e83aa9a1014180c3a4; fpk1=U2FsdGVkX1/IpBh0qdmlKAVhGyYHgur4/VtL9AReZoeSxadXn4juKvsakahRGqjxOPytHWspYoBogyhS/V6QSw==; fpk2=0845b309c7b9b957afd9ecf775a4c21f; passport_csrf_token=d80e0c5b2fa2328219856be5ba7e671e; passport_csrf_token_default=d80e0c5b2fa2328219856be5ba7e671e; odin_tt=3c891091d2eb0f4718c1d5645bc4a0017032d4d5aa989decb729e9da2ad570918cbe5e9133dc6b145fa8c758de98efe32ff1f81aa0d611e838cc73ab08ef7d3f6adf66ab4d10e8372ddd628f94f16b8e; volume_info=%7B%22isUserMute%22%3Afalse%2C%22isMute%22%3Afalse%2C%22volume%22%3A0.5%7D; bd_ticket_guard_client_web_domain=2; FORCE_LOGIN=%7B%22videoConsumedRemainSeconds%22%3A180%7D; UIFID=0de8750d2b188f4235dbfd208e44abbb976428f0720eb983255afefa45d39c0c6532e1d4768dd8587bf919f866ff139655a3c2b735923234f371c699560c657923fd3d6c5b63ab7bb9b83423b6cb4787e2ce66a7fbc4ecb24c8570f520fe6de068bbb95115023c0c6c1b6ee31b49fb7e3996fb8349f43a3fd8b7a61cd9e18e8fe65eb6a7c13de4c0960d84e344b644725db3eb2fa6b7caf821de1b50527979f2; is_dash_user=1; biz_trace_id=b57a241f; bd_ticket_guard_client_data=eyJiZC10aWNrZXQtZ3VhcmQtdmVyc2lvbiI6MiwiYmQtdGlja2V0LWd1YXJkLWl0ZXJhdGlvbi12ZXJzaW9uIjoxLCJiZC10aWNrZXQtZ3VhcmQtcmVlLXB1YmxpYy1rZXkiOiJCTEo2R0lDalVoWW1XcHpGOFdrN0Vrc0dXcCtaUzNKY1g4NGNGY2k0TTl1TEowNjdUb21mbFU5aDdvWVBGamhNRWNRQWtKdnN1MnM3RmpTWnlJQXpHMjA9IiwiYmQtdGlja2V0LWd1YXJkLXdlYi12ZXJzaW9uIjoyfQ%3D%3D; download_guide=%221%2F20241216%2F0%22; sdk_source_info=7e276470716a68645a606960273f276364697660272927676c715a6d6069756077273f276364697660272927666d776a68605a607d71606b766c6a6b5a7666776c7571273f275e58272927666a6b766a69605a696c6061273f27636469766027292762696a6764695a7364776c6467696076273f275e5827292771273f273d33323131333c3036313632342778; bit_env=RiOY4jzzpxZoVCl6zdVSVhVRjdwHRTxqcqWdqMBZLPGjMdB4Tax1kAELHNTVAAh72KuhumewE4Lq6f0-VJ2UpJrkrhSxoPw9LUb3zQrq1OSwbeSPHkRlRgRQvO89sItdGUyq1oFr0XyRCnMYG87KSeWyc4x0czGR0o50hTDoDLG5rJVoRcdQOLvjiAegsqyytKF59sPX_QM9qffK2SqYsg0hCggURc_AI6kguDDE5DvG0bnyz1utw4z1eEnIoLrkGDqzqBZj4dOAr0BVU6ofbsS-pOQ2u2PM1dLP9FlBVBlVaqYVgHJeSLsR5k76BRTddUjTb4zEilVIEwAMJWGN4I1BxVt6fC9B5tBQpuT0lj3n3eKXCKXZsd8FrEs5_pbfDsxV-e_WMiXI2ff4qxiTC0U73sfo9OpicKICtZjdq8qsHxJuu6wVR36zvXeL2Wch5C6MzprNvkivv0l8nbh2mSgy1nabZr3dmU6NcR-Bg3Q3xTWUlR9aAUmpopC-cNuXjgLpT-Lw1AYGilSUnCvosth1Gfypq-b0MpgmdSDgTrQ%3D; gulu_source_res=eyJwX2luIjoiMDhjOGQ3ZTJiODQyNjZkZWI5Y2VkMGJiODNlNmY1ZWY0ZjMyNTE2ZmYyZjAzNDMzZjI0OWU1Y2Q1NTczNTk5NyJ9; passport_auth_mix_state=hp9bc3dgb1tm5wd8p82zawus27g0e3ue; IsDouyinActive=false
--- a/backend/app/api/init.py
+++ b/backend/app/api/init.py
@@ -1,10 +0,0 @@
-from . import admin
-from . import ai
-from . import assets
-from . import auth
-from . import login_helper
-from . import materials
-from . import publish
-from . import ref_audios
-from . import tools
-from . import videos
--- a/backend/app/api/admin.py
+++ b/backend/app/api/admin.py
@@ -1 +0,0 @@
-from app.modules.admin.router import router
--- a/backend/app/api/ai.py
+++ b/backend/app/api/ai.py
@@ -1 +0,0 @@
-from app.modules.ai.router import router
--- a/backend/app/api/assets.py
+++ b/backend/app/api/assets.py
@@ -1 +0,0 @@
-from app.modules.assets.router import router
--- a/backend/app/api/auth.py
+++ b/backend/app/api/auth.py
@@ -1 +0,0 @@
-from app.modules.auth.router import router
--- a/backend/app/api/login_helper.py
+++ b/backend/app/api/login_helper.py
@@ -1 +0,0 @@
-from app.modules.login_helper.router import router
--- a/backend/app/api/materials.py
+++ b/backend/app/api/materials.py
@@ -1 +0,0 @@
-from app.modules.materials.router import router
--- a/backend/app/api/publish.py
+++ b/backend/app/api/publish.py
@@ -1 +0,0 @@
-from app.modules.publish.router import router
--- a/backend/app/api/ref_audios.py
+++ b/backend/app/api/ref_audios.py
@@ -1 +0,0 @@
-from app.modules.ref_audios.router import router
--- a/backend/app/api/tools.py
+++ b/backend/app/api/tools.py
@@ -1 +0,0 @@
-from app.modules.tools.router import router
--- a/backend/app/api/videos.py
+++ b/backend/app/api/videos.py
@@ -1 +0,0 @@
-from app.modules.videos.router import router
--- a/backend/app/core/config.py
+++ b/backend/app/core/config.py
@@ -3,14 +3,46 @@ from pathlib import Path

 class Settings(BaseSettings):
    # 基础路径配置
-    BASE_DIR: Path = Path(__file__).resolve().parent.parent
-    UPLOAD_DIR: Path = BASE_DIR.parent / "uploads"
-    OUTPUT_DIR: Path = BASE_DIR.parent / "outputs"
-    ASSETS_DIR: Path = BASE_DIR.parent / "assets"
+    BASE_DIR: Path = Path(__file__).resolve().parent.parent
+    UPLOAD_DIR: Path = BASE_DIR.parent / "uploads"
+    OUTPUT_DIR: Path = BASE_DIR.parent / "outputs"
+    ASSETS_DIR: Path = BASE_DIR.parent / "assets"
+    PUBLISH_SCREENSHOT_DIR: Path = BASE_DIR.parent / "private_outputs" / "publish_screenshots"
    
    # 数据库/缓存
    REDIS_URL: str = "redis://localhost:6379/0"
    DEBUG: bool = True
+
+    # Playwright 配置
+    WEIXIN_HEADLESS_MODE: str = "headless-new"
+    WEIXIN_USER_AGENT: str = "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36"
+    WEIXIN_LOCALE: str = "zh-CN"
+    WEIXIN_TIMEZONE_ID: str = "Asia/Shanghai"
+    WEIXIN_CHROME_PATH: str = "/usr/bin/google-chrome"
+    WEIXIN_BROWSER_CHANNEL: str = ""
+    WEIXIN_FORCE_SWIFTSHADER: bool = True
+    WEIXIN_TRANSCODE_MODE: str = "reencode"
+    WEIXIN_DEBUG_ARTIFACTS: bool = False
+    WEIXIN_RECORD_VIDEO: bool = False
+    WEIXIN_KEEP_SUCCESS_VIDEO: bool = False
+    WEIXIN_RECORD_VIDEO_WIDTH: int = 1280
+    WEIXIN_RECORD_VIDEO_HEIGHT: int = 720
+
+    # Douyin Playwright 配置
+    DOUYIN_HEADLESS_MODE: str = "headless-new"
+    DOUYIN_USER_AGENT: str = "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/144.0.0.0 Safari/537.36"
+    DOUYIN_LOCALE: str = "zh-CN"
+    DOUYIN_TIMEZONE_ID: str = "Asia/Shanghai"
+    DOUYIN_CHROME_PATH: str = "/usr/bin/google-chrome"
+    DOUYIN_BROWSER_CHANNEL: str = ""
+    DOUYIN_FORCE_SWIFTSHADER: bool = True
+
+    # Douyin 调试录屏
+    DOUYIN_DEBUG_ARTIFACTS: bool = False
+    DOUYIN_RECORD_VIDEO: bool = False
+    DOUYIN_KEEP_SUCCESS_VIDEO: bool = False
+    DOUYIN_RECORD_VIDEO_WIDTH: int = 1280
+    DOUYIN_RECORD_VIDEO_HEIGHT: int = 720
    
    # TTS 配置
    DEFAULT_TTS_VOICE: str = "zh-CN-YunxiNeural"
@@ -44,6 +76,12 @@ class Settings(BaseSettings):
    GLM_API_KEY: str = ""
    GLM_MODEL: str = "glm-4.7-flash"
    
+    # CORS 配置 (逗号分隔的域名列表，* 表示允许所有)
+    CORS_ORIGINS: str = "*"
+    
+    # 抖音 Cookie (用于视频下载功能，会过期需要定期更新)
+    DOUYIN_COOKIE: str = ""
+    
    @property
    def LATENTSYNC_DIR(self) -> Path:
        """LatentSync 目录路径 (动态计算)"""
--- a/backend/app/main.py
+++ b/backend/app/main.py
@@ -1,10 +1,21 @@
-from fastapi import FastAPI, HTTPException
-from fastapi.staticfiles import StaticFiles
-from fastapi.middleware.cors import CORSMiddleware
-from fastapi.responses import JSONResponse
-from app.core import config
-from app.core.response import error_response
-from app.api import materials, videos, publish, login_helper, auth, admin, ref_audios, ai, tools, assets
+from fastapi import FastAPI, HTTPException
+from fastapi.staticfiles import StaticFiles
+from fastapi.middleware.cors import CORSMiddleware
+from fastapi.responses import JSONResponse
+from app.core import config
+from app.core.response import error_response
+# 直接从 modules 导入路由，消除 api 转发层
+from app.modules.materials.router import router as materials_router
+from app.modules.videos.router import router as videos_router
+from app.modules.publish.router import router as publish_router
+from app.modules.login_helper.router import router as login_helper_router
+from app.modules.auth.router import router as auth_router
+from app.modules.admin.router import router as admin_router
+from app.modules.ref_audios.router import router as ref_audios_router
+from app.modules.ai.router import router as ai_router
+from app.modules.tools.router import router as tools_router
+from app.modules.assets.router import router as assets_router
+from app.modules.generated_audios.router import router as generated_audios_router
 from loguru import logger
 import os

@@ -12,17 +23,34 @@ settings = config.settings

 app = FastAPI(title="ViGent TalkingHead Agent")

-from fastapi import Request
-from fastapi.exceptions import RequestValidationError
+from fastapi import Request
+from fastapi.exceptions import RequestValidationError
 from starlette.middleware.base import BaseHTTPMiddleware
 import time
 import traceback

 class LoggingMiddleware(BaseHTTPMiddleware):
+    # 敏感 header 名称列表（小写）
+    SENSITIVE_HEADERS = {'authorization', 'cookie', 'set-cookie', 'x-api-key', 'api-key'}
+    
+    def _sanitize_headers(self, headers: dict) -> dict:
+        """脱敏处理请求头，隐藏敏感信息"""
+        sanitized = {}
+        for key, value in headers.items():
+            if key.lower() in self.SENSITIVE_HEADERS:
+                # 显示前8个字符 + 掩码
+                if len(value) > 8:
+                    sanitized[key] = value[:8] + "..." + f"[{len(value)} chars]"
+                else:
+                    sanitized[key] = "[REDACTED]"
+            else:
+                sanitized[key] = value
+        return sanitized
+    
    async def dispatch(self, request: Request, call_next):
        start_time = time.time()
        logger.info(f"START Request: {request.method} {request.url}")
-        logger.info(f"HEADERS: {dict(request.headers)}")
+        logger.debug(f"HEADERS: {self._sanitize_headers(dict(request.headers))}")
        try:
            response = await call_next(request)
            process_time = time.time() - start_time
@@ -33,65 +61,71 @@ class LoggingMiddleware(BaseHTTPMiddleware):
            logger.error(f"EXCEPTION during request {request.method} {request.url}: {str(e)}\n{traceback.format_exc()}")
            raise e

-app.add_middleware(LoggingMiddleware)
-
-
-@app.exception_handler(RequestValidationError)
-async def validation_exception_handler(request: Request, exc: RequestValidationError):
-    return JSONResponse(
-        status_code=422,
-        content=error_response("参数校验失败", 422, data=exc.errors()),
-    )
-
-
-@app.exception_handler(HTTPException)
-async def http_exception_handler(request: Request, exc: HTTPException):
-    detail = exc.detail
-    message = detail if isinstance(detail, str) else "请求失败"
-    data = detail if not isinstance(detail, str) else None
-    return JSONResponse(
-        status_code=exc.status_code,
-        content=error_response(message, exc.status_code, data=data),
-        headers=exc.headers,
-    )
-
-
-@app.exception_handler(Exception)
-async def unhandled_exception_handler(request: Request, exc: Exception):
-    return JSONResponse(
-        status_code=500,
-        content=error_response("服务器内部错误", 500),
-    )
+app.add_middleware(LoggingMiddleware)
+
+
+@app.exception_handler(RequestValidationError)
+async def validation_exception_handler(request: Request, exc: RequestValidationError):
+    return JSONResponse(
+        status_code=422,
+        content=error_response("参数校验失败", 422, data=exc.errors()),
+    )
+
+
+@app.exception_handler(HTTPException)
+async def http_exception_handler(request: Request, exc: HTTPException):
+    detail = exc.detail
+    message = detail if isinstance(detail, str) else "请求失败"
+    data = detail if not isinstance(detail, str) else None
+    return JSONResponse(
+        status_code=exc.status_code,
+        content=error_response(message, exc.status_code, data=data),
+        headers=exc.headers,
+    )
+
+
+@app.exception_handler(Exception)
+async def unhandled_exception_handler(request: Request, exc: Exception):
+    return JSONResponse(
+        status_code=500,
+        content=error_response("服务器内部错误", 500),
+    )
+
+# CORS 配置：从环境变量读取允许的域名
+# 当使用 credentials 时，不能使用 * 通配符
+cors_origins = settings.CORS_ORIGINS.split(",") if settings.CORS_ORIGINS != "*" else ["*"]
+allow_credentials = settings.CORS_ORIGINS != "*"  # 使用 * 时不能 allow_credentials

 app.add_middleware(
    CORSMiddleware,
-    allow_origins=["*"],
-    allow_credentials=True,
+    allow_origins=cors_origins,
+    allow_credentials=allow_credentials,
    allow_methods=["*"],
    allow_headers=["*"],
 )

 # Create dirs
-settings.UPLOAD_DIR.mkdir(parents=True, exist_ok=True)
-settings.OUTPUT_DIR.mkdir(parents=True, exist_ok=True)
-(settings.UPLOAD_DIR / "materials").mkdir(exist_ok=True)
-settings.ASSETS_DIR.mkdir(parents=True, exist_ok=True)
+settings.UPLOAD_DIR.mkdir(parents=True, exist_ok=True)
+settings.OUTPUT_DIR.mkdir(parents=True, exist_ok=True)
+(settings.UPLOAD_DIR / "materials").mkdir(exist_ok=True)
+settings.ASSETS_DIR.mkdir(parents=True, exist_ok=True)

-app.mount("/outputs", StaticFiles(directory=str(settings.OUTPUT_DIR)), name="outputs")
-app.mount("/uploads", StaticFiles(directory=str(settings.UPLOAD_DIR)), name="uploads")
-app.mount("/assets", StaticFiles(directory=str(settings.ASSETS_DIR)), name="assets")
+app.mount("/outputs", StaticFiles(directory=str(settings.OUTPUT_DIR)), name="outputs")
+app.mount("/uploads", StaticFiles(directory=str(settings.UPLOAD_DIR)), name="uploads")
+app.mount("/assets", StaticFiles(directory=str(settings.ASSETS_DIR)), name="assets")

 # 注册路由
-app.include_router(materials.router, prefix="/api/materials", tags=["Materials"])
-app.include_router(videos.router, prefix="/api/videos", tags=["Videos"])
-app.include_router(publish.router, prefix="/api/publish", tags=["Publish"])
-app.include_router(login_helper.router, prefix="/api", tags=["LoginHelper"])
-app.include_router(auth.router)  # /api/auth
-app.include_router(admin.router)  # /api/admin
-app.include_router(ref_audios.router, prefix="/api/ref-audios", tags=["RefAudios"])
-app.include_router(ai.router)  # /api/ai
-app.include_router(tools.router, prefix="/api/tools", tags=["Tools"])
-app.include_router(assets.router, prefix="/api/assets", tags=["Assets"])
+app.include_router(materials_router, prefix="/api/materials", tags=["Materials"])
+app.include_router(videos_router, prefix="/api/videos", tags=["Videos"])
+app.include_router(publish_router, prefix="/api/publish", tags=["Publish"])
+app.include_router(login_helper_router, prefix="/api", tags=["LoginHelper"])
+app.include_router(auth_router)  # /api/auth
+app.include_router(admin_router)  # /api/admin
+app.include_router(ref_audios_router, prefix="/api/ref-audios", tags=["RefAudios"])
+app.include_router(ai_router)  # /api/ai
+app.include_router(tools_router, prefix="/api/tools", tags=["Tools"])
+app.include_router(assets_router, prefix="/api/assets", tags=["Assets"])
+app.include_router(generated_audios_router, prefix="/api/generated-audios", tags=["GeneratedAudios"])


@app.on_event("startup")
@@ -107,21 +141,21 @@ async def init_admin():
        return
    
    try:
-        from app.core.security import get_password_hash
-        from app.repositories.users import create_user, user_exists_by_phone
-
-        if user_exists_by_phone(admin_phone):
-            logger.info(f"管理员账号已存在: {admin_phone}")
-            return
-
-        create_user({
-            "phone": admin_phone,
-            "password_hash": get_password_hash(admin_password),
-            "username": "Admin",
-            "role": "admin",
-            "is_active": True,
-            "expires_at": None  # 永不过期
-        })
+        from app.core.security import get_password_hash
+        from app.repositories.users import create_user, user_exists_by_phone
+
+        if user_exists_by_phone(admin_phone):
+            logger.info(f"管理员账号已存在: {admin_phone}")
+            return
+
+        create_user({
+            "phone": admin_phone,
+            "password_hash": get_password_hash(admin_password),
+            "username": "Admin",
+            "role": "admin",
+            "is_active": True,
+            "expires_at": None  # 永不过期
+        })
        
        logger.success(f"管理员账号已创建: {admin_phone}")
    except Exception as e:
--- a/backend/app/modules/ai/router.py
+++ b/backend/app/modules/ai/router.py
@@ -24,6 +24,33 @@ class GenerateMetaResponse(BaseModel):
    tags: list[str]


+class TranslateRequest(BaseModel):
+    """翻译请求"""
+    text: str
+    target_lang: str
+
+
+@router.post("/translate")
+async def translate_text(req: TranslateRequest):
+    """
+    AI 翻译文案
+
+    将文案翻译为指定目标语言
+    """
+    if not req.text or not req.text.strip():
+        raise HTTPException(status_code=400, detail="文案不能为空")
+    if not req.target_lang or not req.target_lang.strip():
+        raise HTTPException(status_code=400, detail="目标语言不能为空")
+
+    try:
+        logger.info(f"Translating text to {req.target_lang}: {req.text[:50]}...")
+        translated = await glm_service.translate_text(req.text.strip(), req.target_lang.strip())
+        return success_response({"translated_text": translated})
+    except Exception as e:
+        logger.error(f"Translate failed: {e}")
+        raise HTTPException(status_code=500, detail=str(e))
+
+
@router.post("/generate-meta")
 async def generate_meta(req: GenerateMetaRequest):
    """
--- a/backend/app/modules/generated_audios/init.py
+++ b/backend/app/modules/generated_audios/init.py
--- a/backend/app/modules/generated_audios/router.py
+++ b/backend/app/modules/generated_audios/router.py
@@ -0,0 +1,77 @@
+"""生成配音 API"""
+from fastapi import APIRouter, BackgroundTasks, Depends, HTTPException
+import uuid
+from loguru import logger
+
+from app.core.deps import get_current_user
+from app.core.response import success_response
+from app.modules.videos.task_store import create_task, get_task
+from app.modules.generated_audios.schemas import GenerateAudioRequest, RenameAudioRequest
+from app.modules.generated_audios import service
+
+router = APIRouter()
+
+
+@router.post("/generate")
+async def generate_audio(
+    req: GenerateAudioRequest,
+    background_tasks: BackgroundTasks,
+    user: dict = Depends(get_current_user),
+):
+    """异步生成配音（返回 task_id）"""
+    task_id = str(uuid.uuid4())
+    create_task(task_id, user["id"])
+    background_tasks.add_task(service.generate_audio_task, task_id, req, user["id"])
+    return success_response({"task_id": task_id})
+
+
+@router.get("/tasks/{task_id}")
+async def get_audio_task(task_id: str, user: dict = Depends(get_current_user)):
+    """轮询配音生成进度"""
+    task = get_task(task_id)
+    if task.get("status") != "not_found" and task.get("user_id") != user["id"]:
+        return success_response({"status": "not_found"})
+    return success_response(task)
+
+
+@router.get("")
+async def list_audios(user: dict = Depends(get_current_user)):
+    """列出当前用户所有已生成配音"""
+    try:
+        result = await service.list_generated_audios(user["id"])
+        return success_response(result)
+    except Exception as e:
+        logger.error(f"列出配音失败: {e}")
+        raise HTTPException(status_code=500, detail=f"获取列表失败: {str(e)}")
+
+
+@router.delete("/{audio_id:path}")
+async def delete_audio(audio_id: str, user: dict = Depends(get_current_user)):
+    """删除配音"""
+    try:
+        await service.delete_generated_audio(audio_id, user["id"])
+        return success_response(message="删除成功")
+    except PermissionError as e:
+        raise HTTPException(status_code=403, detail=str(e))
+    except Exception as e:
+        logger.error(f"删除配音失败: {e}")
+        raise HTTPException(status_code=500, detail=f"删除失败: {str(e)}")
+
+
+@router.put("/{audio_id:path}")
+async def rename_audio(
+    audio_id: str,
+    request: RenameAudioRequest,
+    user: dict = Depends(get_current_user),
+):
+    """重命名配音"""
+    try:
+        result = await service.rename_generated_audio(audio_id, request.new_name, user["id"])
+        return success_response(result, message="重命名成功")
+    except PermissionError as e:
+        raise HTTPException(status_code=403, detail=str(e))
+    except ValueError as e:
+        raise HTTPException(status_code=400, detail=str(e))
+    except Exception as e:
+        logger.error(f"重命名配音失败: {e}")
+        raise HTTPException(status_code=500, detail=f"重命名失败: {str(e)}")
--- a/backend/app/modules/generated_audios/schemas.py
+++ b/backend/app/modules/generated_audios/schemas.py
@@ -0,0 +1,30 @@
+from pydantic import BaseModel
+from typing import Optional, List
+
+
+class GenerateAudioRequest(BaseModel):
+    text: str
+    tts_mode: str = "edgetts"
+    voice: str = "zh-CN-YunxiNeural"
+    ref_audio_id: Optional[str] = None
+    ref_text: Optional[str] = None
+    language: str = "zh-CN"
+
+
+class RenameAudioRequest(BaseModel):
+    new_name: str
+
+
+class GeneratedAudioItem(BaseModel):
+    id: str
+    name: str
+    path: str
+    duration_sec: float
+    text: str
+    tts_mode: str
+    language: str
+    created_at: int
+
+
+class GeneratedAudioListResponse(BaseModel):
+    items: List[GeneratedAudioItem]
--- a/backend/app/modules/generated_audios/service.py
+++ b/backend/app/modules/generated_audios/service.py
@@ -0,0 +1,263 @@
+"""生成配音 - 业务逻辑"""
+import re
+import json
+import time
+import asyncio
+import subprocess
+import tempfile
+import os
+from pathlib import Path
+from typing import Optional
+
+import httpx
+from loguru import logger
+
+from app.services.storage import storage_service
+from app.services.tts_service import TTSService
+from app.services.voice_clone_service import voice_clone_service
+from app.modules.videos.task_store import task_store
+from app.modules.generated_audios.schemas import (
+    GenerateAudioRequest,
+    GeneratedAudioItem,
+    GeneratedAudioListResponse,
+)
+
+BUCKET = "generated-audios"
+
+
+def _locale_to_qwen_lang(locale: str) -> str:
+    mapping = {"zh": "Chinese", "en": "English"}
+    return mapping.get(locale.split("-")[0], "Auto")
+
+
+def _get_audio_duration(file_path: str) -> float:
+    try:
+        result = subprocess.run(
+            ['ffprobe', '-v', 'quiet', '-show_entries', 'format=duration',
+             '-of', 'csv=p=0', file_path],
+            capture_output=True, text=True, timeout=10
+        )
+        return float(result.stdout.strip())
+    except Exception as e:
+        logger.warning(f"获取音频时长失败: {e}")
+        return 0.0
+
+
+async def generate_audio_task(task_id: str, req: GenerateAudioRequest, user_id: str):
+    """后台任务：生成配音"""
+    try:
+        task_store.update(task_id, {"status": "processing", "progress": 10, "message": "正在生成配音..."})
+
+        with tempfile.NamedTemporaryFile(delete=False, suffix=".wav") as tmp:
+            audio_path = tmp.name
+
+        try:
+            if req.tts_mode == "voiceclone":
+                if not req.ref_audio_id or not req.ref_text:
+                    raise ValueError("声音克隆模式需要提供参考音频和参考文字")
+
+                task_store.update(task_id, {"progress": 20, "message": "正在下载参考音频..."})
+
+                with tempfile.NamedTemporaryFile(delete=False, suffix=".wav") as tmp_ref:
+                    ref_local = tmp_ref.name
+
+                try:
+                    ref_url = await storage_service.get_signed_url(
+                        bucket="ref-audios", path=req.ref_audio_id
+                    )
+                    timeout = httpx.Timeout(None)
+                    async with httpx.AsyncClient(timeout=timeout) as client:
+                        async with client.stream("GET", ref_url) as resp:
+                            resp.raise_for_status()
+                            with open(ref_local, "wb") as f:
+                                async for chunk in resp.aiter_bytes():
+                                    f.write(chunk)
+
+                    task_store.update(task_id, {"progress": 40, "message": "正在克隆声音 (Qwen3-TTS)..."})
+                    await voice_clone_service.generate_audio(
+                        text=req.text,
+                        ref_audio_path=ref_local,
+                        ref_text=req.ref_text,
+                        output_path=audio_path,
+                        language=_locale_to_qwen_lang(req.language),
+                    )
+                finally:
+                    if os.path.exists(ref_local):
+                        os.unlink(ref_local)
+            else:
+                task_store.update(task_id, {"progress": 30, "message": "正在生成语音 (EdgeTTS)..."})
+                tts = TTSService()
+                await tts.generate_audio(req.text, req.voice, audio_path)
+
+            task_store.update(task_id, {"progress": 70, "message": "正在上传配音..."})
+
+            duration = _get_audio_duration(audio_path)
+            timestamp = int(time.time())
+            audio_id = f"{user_id}/{timestamp}_audio.wav"
+            meta_id = f"{user_id}/{timestamp}_audio.json"
+
+            # 生成 display_name
+            now = time.strftime("%Y%m%d_%H%M", time.localtime(timestamp))
+            display_name = f"配音_{now}"
+
+            with open(audio_path, "rb") as f:
+                wav_data = f.read()
+
+            await storage_service.upload_file(
+                bucket=BUCKET, path=audio_id,
+                file_data=wav_data, content_type="audio/wav",
+            )
+
+            metadata = {
+                "display_name": display_name,
+                "text": req.text,
+                "tts_mode": req.tts_mode,
+                "voice": req.voice if req.tts_mode == "edgetts" else None,
+                "ref_audio_id": req.ref_audio_id,
+                "language": req.language,
+                "duration_sec": duration,
+                "created_at": timestamp,
+            }
+            await storage_service.upload_file(
+                bucket=BUCKET, path=meta_id,
+                file_data=json.dumps(metadata, ensure_ascii=False).encode("utf-8"),
+                content_type="application/json",
+            )
+
+            signed_url = await storage_service.get_signed_url(BUCKET, audio_id)
+
+            task_store.update(task_id, {
+                "status": "completed",
+                "progress": 100,
+                "message": f"配音生成完成 ({duration:.1f}s)",
+                "output": {
+                    "audio_id": audio_id,
+                    "name": display_name,
+                    "path": signed_url,
+                    "duration_sec": duration,
+                    "text": req.text,
+                    "tts_mode": req.tts_mode,
+                    "language": req.language,
+                    "created_at": timestamp,
+                },
+            })
+        finally:
+            if os.path.exists(audio_path):
+                os.unlink(audio_path)
+
+    except Exception as e:
+        import traceback
+        task_store.update(task_id, {
+            "status": "failed",
+            "message": f"配音生成失败: {str(e)}",
+            "error": traceback.format_exc(),
+        })
+        logger.error(f"Generate audio failed: {e}")
+
+
+async def list_generated_audios(user_id: str) -> dict:
+    """列出用户的所有已生成配音"""
+    files = await storage_service.list_files(BUCKET, user_id)
+    wav_files = [f for f in files if f.get("name", "").endswith("_audio.wav")]
+
+    if not wav_files:
+        return GeneratedAudioListResponse(items=[]).model_dump()
+
+    async def fetch_info(f):
+        name = f.get("name", "")
+        storage_path = f"{user_id}/{name}"
+        meta_name = name.replace("_audio.wav", "_audio.json")
+        meta_path = f"{user_id}/{meta_name}"
+
+        display_name = name
+        text = ""
+        tts_mode = "edgetts"
+        language = "zh-CN"
+        duration_sec = 0.0
+        created_at = 0
+
+        try:
+            meta_url = await storage_service.get_signed_url(BUCKET, meta_path)
+            async with httpx.AsyncClient(timeout=5.0) as client:
+                resp = await client.get(meta_url)
+                if resp.status_code == 200:
+                    meta = resp.json()
+                    display_name = meta.get("display_name", name)
+                    text = meta.get("text", "")
+                    tts_mode = meta.get("tts_mode", "edgetts")
+                    language = meta.get("language", "zh-CN")
+                    duration_sec = meta.get("duration_sec", 0.0)
+                    created_at = meta.get("created_at", 0)
+        except Exception as e:
+            logger.debug(f"读取配音 metadata 失败: {e}")
+            try:
+                created_at = int(name.split("_")[0])
+            except:
+                pass
+
+        signed_url = await storage_service.get_signed_url(BUCKET, storage_path)
+
+        return GeneratedAudioItem(
+            id=storage_path,
+            name=display_name,
+            path=signed_url,
+            duration_sec=duration_sec,
+            text=text,
+            tts_mode=tts_mode,
+            language=language,
+            created_at=created_at,
+        )
+
+    items = await asyncio.gather(*[fetch_info(f) for f in wav_files])
+    items = sorted(items, key=lambda x: x.created_at, reverse=True)
+    return GeneratedAudioListResponse(items=items).model_dump()
+
+
+async def delete_generated_audio(audio_id: str, user_id: str) -> None:
+    if not audio_id.startswith(f"{user_id}/"):
+        raise PermissionError("无权删除此文件")
+
+    await storage_service.delete_file(BUCKET, audio_id)
+    meta_path = audio_id.replace("_audio.wav", "_audio.json")
+    try:
+        await storage_service.delete_file(BUCKET, meta_path)
+    except:
+        pass
+
+
+async def rename_generated_audio(audio_id: str, new_name: str, user_id: str) -> dict:
+    if not audio_id.startswith(f"{user_id}/"):
+        raise PermissionError("无权修改此文件")
+
+    new_name = new_name.strip()
+    if not new_name:
+        raise ValueError("新名称不能为空")
+
+    meta_path = audio_id.replace("_audio.wav", "_audio.json")
+    try:
+        meta_url = await storage_service.get_signed_url(BUCKET, meta_path)
+        async with httpx.AsyncClient() as client:
+            resp = await client.get(meta_url)
+            if resp.status_code == 200:
+                metadata = resp.json()
+            else:
+                raise Exception(f"Failed to fetch metadata: {resp.status_code}")
+    except Exception as e:
+        logger.warning(f"无法读取配音元数据: {e}, 将创建新的")
+        metadata = {
+            "display_name": new_name,
+            "text": "",
+            "tts_mode": "edgetts",
+            "language": "zh-CN",
+            "duration_sec": 0.0,
+            "created_at": int(time.time()),
+        }
+
+    metadata["display_name"] = new_name
+    await storage_service.upload_file(
+        bucket=BUCKET,
+        path=meta_path,
+        file_data=json.dumps(metadata, ensure_ascii=False).encode("utf-8"),
+        content_type="application/json",
+    )
+    return {"name": new_name}
--- a/backend/app/modules/login_helper/router.py
+++ b/backend/app/modules/login_helper/router.py
@@ -15,17 +15,19 @@ async def login_helper_page(platform: str, request: Request):
    登录后JavaScript自动提取Cookie并POST回服务器
    """
    
-    platform_urls = {
-        "bilibili": "https://www.bilibili.com/",
-        "douyin": "https://creator.douyin.com/",
-        "xiaohongshu": "https://creator.xiaohongshu.com/"
-    }
+    platform_urls = {
+        "bilibili": "https://www.bilibili.com/",
+        "douyin": "https://creator.douyin.com/",
+        "xiaohongshu": "https://creator.xiaohongshu.com/",
+        "weixin": "https://channels.weixin.qq.com/"
+    }
    
-    platform_names = {
-        "bilibili": "B站",
-        "douyin": "抖音",
-        "xiaohongshu": "小红书"
-    }
+    platform_names = {
+        "bilibili": "B站",
+        "douyin": "抖音",
+        "xiaohongshu": "小红书",
+        "weixin": "微信视频号"
+    }
    
    if platform not in platform_urls:
        return "<h1>不支持的平台</h1>"
--- a/backend/app/modules/materials/router.py
+++ b/backend/app/modules/materials/router.py
@@ -1,416 +1,62 @@
-from fastapi import APIRouter, UploadFile, File, HTTPException, Request, BackgroundTasks, Depends
-from app.core.config import settings
-from app.core.deps import get_current_user
-from app.core.response import success_response
-from app.services.storage import storage_service
-import re
-import time
-import traceback
-import os
-import aiofiles
-from pathlib import Path
-from loguru import logger
-import asyncio
-from pydantic import BaseModel
-from typing import Optional
-import httpx
+from fastapi import APIRouter, HTTPException, Request, Depends
+from loguru import logger

+from app.core.deps import get_current_user
+from app.core.response import success_response
+from app.modules.materials.schemas import RenameMaterialRequest
+from app.modules.materials import service

-router = APIRouter()
-
-
-class RenameMaterialRequest(BaseModel):
-    new_name: str
-
-def sanitize_filename(filename: str) -> str:
-    safe_name = re.sub(r'[<>:"/\\|?*]', '_', filename)
-    if len(safe_name) > 100:
-        ext = Path(safe_name).suffix
-        safe_name = safe_name[:100 - len(ext)] + ext
-    return safe_name
-
-async def process_and_upload(temp_file_path: str, original_filename: str, content_type: str, user_id: str):
-    """Background task to strip multipart headers and upload to Supabase"""
-    try:
-        logger.info(f"Processing raw upload: {temp_file_path} for user {user_id}")
-
-        # 1. Analyze file to find actual video content (strip multipart boundaries)
-        # This is a simplified manual parser for a SINGLE file upload.
-        # Structure:
-        # --boundary
-        # Content-Disposition: form-data; name="file"; filename="..."
-        # Content-Type: video/mp4
-        # \r\n\r\n
-        # [DATA]
-        # \r\n--boundary--
-
-        # We need to read the first few KB to find the header end
-        start_offset = 0
-        end_offset = 0
-        boundary = b""
-
-        file_size = os.path.getsize(temp_file_path)
-
-        with open(temp_file_path, 'rb') as f:
-            # Read first 4KB to find header
-            head = f.read(4096)
-
-            # Find boundary
-            first_line_end = head.find(b'\r\n')
-            if first_line_end == -1:
-                raise Exception("Could not find boundary in multipart body")
-
-            boundary = head[:first_line_end] # e.g. --boundary123
-            logger.info(f"Detected boundary: {boundary}")
-
-            # Find end of headers (\r\n\r\n)
-            header_end = head.find(b'\r\n\r\n')
-            if header_end == -1:
-                raise Exception("Could not find end of multipart headers")
-
-            start_offset = header_end + 4
-            logger.info(f"Video data starts at offset: {start_offset}")
-
-            # Find end boundary (read from end of file)
-            # It should be \r\n + boundary + -- + \r\n
-            # We seek to end-200 bytes
-            f.seek(max(0, file_size - 200))
-            tail = f.read()
-
-            # The closing boundary is usually --boundary--
-            # We look for the last occurrence of the boundary
-            last_boundary_pos = tail.rfind(boundary)
-            if last_boundary_pos != -1:
-                # The data ends before \r\n + boundary
-                # The tail buffer relative position needs to be converted to absolute
-                end_pos_in_tail = last_boundary_pos
-                # We also need to check for the preceding \r\n
-                if end_pos_in_tail >= 2 and tail[end_pos_in_tail-2:end_pos_in_tail] == b'\r\n':
-                     end_pos_in_tail -= 2
-
-                # Absolute end offset
-                end_offset = (file_size - 200) + last_boundary_pos
-                # Correction for CRLF before boundary
-                # Actually, simply: read until (file_size - len(tail) + last_boundary_pos) - 2
-                end_offset = (max(0, file_size - 200) + last_boundary_pos) - 2
-            else:
-                logger.warning("Could not find closing boundary, assuming EOF")
-                end_offset = file_size
-
-            logger.info(f"Video data ends at offset: {end_offset}. Total video size: {end_offset - start_offset}")
-
-        # 2. Extract and Upload to Supabase
-        # Since we have the file on disk, we can just pass the file object (seeked) to upload_file?
-        # Or if upload_file expects bytes/path, checking storage.py...
-        # It takes `file_data` (bytes) or file-like?
-        # supabase-py's `upload` method handles parsing if we pass a file object.
-        # But we need to pass ONLY the video slice.
-        # So we create a generator or a sliced file object?
-        # Simpler: Read the slice into memory if < 1GB? Or copy to new temp file?
-        # Copying to new temp file is safer for memory.
-
-        video_path = temp_file_path + "_video.mp4"
-        with open(temp_file_path, 'rb') as src, open(video_path, 'wb') as dst:
-            src.seek(start_offset)
-            # Copy in chunks
-            bytes_to_copy = end_offset - start_offset
-            copied = 0
-            while copied < bytes_to_copy:
-                chunk_size = min(1024*1024*10, bytes_to_copy - copied) # 10MB chunks
-                chunk = src.read(chunk_size)
-                if not chunk:
-                    break
-                dst.write(chunk)
-                copied += len(chunk)
-
-        logger.info(f"Extracted video content to {video_path}")
-
-        # 3. Upload to Supabase with user isolation
-        timestamp = int(time.time())
-        safe_name = re.sub(r'[^a-zA-Z0-9._-]', '', original_filename)
-        # 使用 user_id 作为目录前缀实现隔离
-        storage_path = f"{user_id}/{timestamp}_{safe_name}"
-
-        # Use storage service (this calls Supabase which might do its own http request)
-        # We read the cleaned video file
-        with open(video_path, 'rb') as f:
-            file_content = f.read() # Still reading into memory for simple upload call, but server has 32GB RAM so ok for 500MB
-            await storage_service.upload_file(
-                bucket=storage_service.BUCKET_MATERIALS,
-                path=storage_path,
-                file_data=file_content,
-                content_type=content_type
-            )
-
-        logger.info(f"Upload to Supabase complete: {storage_path}")
-
-        # Cleanup
-        os.remove(temp_file_path)
-        os.remove(video_path)
-
-        return storage_path
-
-    except Exception as e:
-        logger.error(f"Background upload processing failed: {e}\n{traceback.format_exc()}")
-        raise
+router = APIRouter()


@router.post("")
 async def upload_material(
    request: Request,
-    background_tasks: BackgroundTasks,
    current_user: dict = Depends(get_current_user)
 ):
    user_id = current_user["id"]
-    logger.info(f"ENTERED upload_material (Streaming Mode) for user {user_id}. Headers: {request.headers}")
-    
-    filename = "unknown_video.mp4" # Fallback
-    content_type = "video/mp4"
-    
-    # Try to parse filename from header if possible (unreliable in raw stream)
-    # We will rely on post-processing or client hint
-    # Frontend sends standard multipart.
-    
-    # Create temp file
-    timestamp = int(time.time())
-    temp_filename = f"upload_{timestamp}.raw"
-    temp_path = os.path.join("/tmp", temp_filename) # Use /tmp on Linux
-    # Ensure /tmp exists (it does) but verify paths
-    if os.name == 'nt': # Local dev
-        temp_path = f"d:/tmp/{temp_filename}"
-        os.makedirs("d:/tmp", exist_ok=True)
-
+    logger.info(f"Upload material request from user {user_id}")
    try:
-        total_size = 0
-        last_log = 0
-        
-        async with aiofiles.open(temp_path, 'wb') as f:
-            async for chunk in request.stream():
-                await f.write(chunk)
-                total_size += len(chunk)
-                
-                # Log progress every 20MB
-                if total_size - last_log > 20 * 1024 * 1024:
-                    logger.info(f"Receiving stream... Processed {total_size / (1024*1024):.2f} MB")
-                    last_log = total_size
-                    
-        logger.info(f"Stream reception complete. Total size: {total_size} bytes. Saved to {temp_path}")
-        
-        if total_size == 0:
-            raise HTTPException(400, "Received empty body")
-
-        # Attempt to extract filename from the saved file's first bytes?
-        # Or just accept it as "uploaded_video.mp4" for now to prove it works.
-        # We can try to regex the header in the file content we just wrote.
-        # Implemented in background task to return success immediately.
-        
-        # Wait, if we return immediately, the user's UI might not show the file yet?
-        # The prompt says "Wait for upload". 
-        # But to avoid User Waiting Timeout, maybe returning early is better?
-        # NO, user expects the file to be in the list.
-        # So we Must await the processing.
-        # But "Processing" (Strip + Upload to Supabase) takes time.
-        # Receiving took time.
-        # If we await Supabase upload, does it timeout?
-        # Supabase upload is outgoing. Usually faster/stable.
-        
-        # Let's await the processing to ensure "List Materials" shows it.
-        # We need to extract the filename for the list.
-        
-        # Quick extract filename from first 4kb
-        with open(temp_path, 'rb') as f:
-            head = f.read(4096).decode('utf-8', errors='ignore')
-            match = re.search(r'filename="([^"]+)"', head)
-            if match:
-                filename = match.group(1)
-                logger.info(f"Extracted filename from body: {filename}")
-            
-        # Run processing sync (in await)
-        storage_path = await process_and_upload(temp_path, filename, content_type, user_id)
-
-        # Get signed URL (it exists now)
-        signed_url = await storage_service.get_signed_url(
-            bucket=storage_service.BUCKET_MATERIALS,
-            path=storage_path
-        )
-        
-        size_mb = total_size / (1024 * 1024) # Approximate (includes headers)
-
-        # 从 storage_path 提取显示名
-        display_name = storage_path.split('/')[-1]  # 去掉 user_id 前缀
-        if '_' in display_name:
-            parts = display_name.split('_', 1)
-            if parts[0].isdigit():
-                display_name = parts[1]
-
-        return success_response({
-            "id": storage_path,
-            "name": display_name,
-            "path": signed_url,
-            "size_mb": size_mb,
-            "type": "video"
-        })
-        
+        result = await service.upload_material(request, user_id)
+        return success_response(result)
+    except ValueError as e:
+        raise HTTPException(400, str(e))
    except Exception as e:
-        error_msg = f"Streaming upload failed: {str(e)}"
-        detail_msg = f"Exception: {repr(e)}\nArgs: {e.args}\n{traceback.format_exc()}"
-        logger.error(error_msg + "\n" + detail_msg)
-        
-        # Write to debug file
-        try:
-             with open("debug_upload.log", "a") as logf:
-                 logf.write(f"\n--- Error at {time.ctime()} ---\n")
-                 logf.write(detail_msg)
-                 logf.write("\n-----------------------------\n")
-        except:
-             pass
-
-        if os.path.exists(temp_path):
-            try:
-                os.remove(temp_path)
-            except:
-                pass
-        raise HTTPException(500, f"Upload failed. Check server logs. Error: {str(e)}")
+        raise HTTPException(500, f"Upload failed. Error: {str(e)}")


@router.get("")
 async def list_materials(current_user: dict = Depends(get_current_user)):
    user_id = current_user["id"]
-    try:
-        # 只列出当前用户目录下的文件
-        files_obj = await storage_service.list_files(
-            bucket=storage_service.BUCKET_MATERIALS,
-            path=user_id
-        )
-        semaphore = asyncio.Semaphore(8)
-
-        async def build_item(f):
-            name = f.get('name')
-            if not name or name == '.emptyFolderPlaceholder':
-                return None
-            display_name = name
-            if '_' in name:
-                parts = name.split('_', 1)
-                if parts[0].isdigit():
-                    display_name = parts[1]
-            full_path = f"{user_id}/{name}"
-            async with semaphore:
-                signed_url = await storage_service.get_signed_url(
-                    bucket=storage_service.BUCKET_MATERIALS,
-                    path=full_path
-                )
-            metadata = f.get('metadata', {})
-            size = metadata.get('size', 0)
-            created_at_str = f.get('created_at', '')
-            created_at = 0
-            if created_at_str:
-                from datetime import datetime
-                try:
-                    dt = datetime.fromisoformat(created_at_str.replace('Z', '+00:00'))
-                    created_at = int(dt.timestamp())
-                except Exception:
-                    pass
-            return {
-                "id": full_path,
-                "name": display_name,
-                "path": signed_url,
-                "size_mb": size / (1024 * 1024),
-                "type": "video",
-                "created_at": created_at
-            }
-
-        tasks = [build_item(f) for f in files_obj]
-        results = await asyncio.gather(*tasks, return_exceptions=True)
-
-        materials = []
-        for item in results:
-            if not item:
-                continue
-            if isinstance(item, Exception):
-                logger.warning(f"Material signed url build failed: {item}")
-                continue
-            materials.append(item)
-        materials.sort(key=lambda x: x['id'], reverse=True)
-        return success_response({"materials": materials})
-    except Exception as e:
-        logger.error(f"List materials failed: {e}")
-        return success_response({"materials": []}, message="获取素材失败")
+    materials = await service.list_materials(user_id)
+    return success_response({"materials": materials})


-@router.delete("/{material_id:path}")
-async def delete_material(material_id: str, current_user: dict = Depends(get_current_user)):
+@router.delete("/{material_id:path}")
+async def delete_material(material_id: str, current_user: dict = Depends(get_current_user)):
    user_id = current_user["id"]
-    # 验证 material_id 属于当前用户
-    if not material_id.startswith(f"{user_id}/"):
-        raise HTTPException(403, "无权删除此素材")
    try:
-        await storage_service.delete_file(
-            bucket=storage_service.BUCKET_MATERIALS,
-            path=material_id
-        )
-        return success_response(message="素材已删除")
-    except Exception as e:
-        raise HTTPException(500, f"删除失败: {str(e)}")
-
-
-@router.put("/{material_id:path}")
-async def rename_material(
-    material_id: str,
-    payload: RenameMaterialRequest,
-    current_user: dict = Depends(get_current_user)
-):
-    user_id = current_user["id"]
-    if not material_id.startswith(f"{user_id}/"):
-        raise HTTPException(403, "无权重命名此素材")
-
-    new_name_raw = payload.new_name.strip() if payload.new_name else ""
-    if not new_name_raw:
-        raise HTTPException(400, "新名称不能为空")
-
-    old_name = material_id.split("/", 1)[1]
-    old_ext = Path(old_name).suffix
-    base_name = Path(new_name_raw).stem if Path(new_name_raw).suffix else new_name_raw
-    safe_base = sanitize_filename(base_name).strip()
-    if not safe_base:
-        raise HTTPException(400, "新名称无效")
-
-    new_filename = f"{safe_base}{old_ext}"
-
-    prefix = None
-    if "_" in old_name:
-        maybe_prefix, _ = old_name.split("_", 1)
-        if maybe_prefix.isdigit():
-            prefix = maybe_prefix
-    if prefix:
-        new_filename = f"{prefix}_{new_filename}"
-
-    new_path = f"{user_id}/{new_filename}"
-    try:
-        if new_path != material_id:
-            await storage_service.move_file(
-                bucket=storage_service.BUCKET_MATERIALS,
-                from_path=material_id,
-                to_path=new_path
-            )
-
-        signed_url = await storage_service.get_signed_url(
-            bucket=storage_service.BUCKET_MATERIALS,
-            path=new_path
-        )
-
-        display_name = new_filename
-        if "_" in new_filename:
-            parts = new_filename.split("_", 1)
-            if parts[0].isdigit():
-                display_name = parts[1]
-
-        return success_response({
-            "id": new_path,
-            "name": display_name,
-            "path": signed_url,
-        }, message="重命名成功")
-    except Exception as e:
-        raise HTTPException(500, f"重命名失败: {str(e)}")
-
+        await service.delete_material(material_id, user_id)
+        return success_response(message="素材已删除")
+    except PermissionError as e:
+        raise HTTPException(403, str(e))
+    except Exception as e:
+        raise HTTPException(500, f"删除失败: {str(e)}")


+@router.put("/{material_id:path}")
+async def rename_material(
+    material_id: str,
+    payload: RenameMaterialRequest,
+    current_user: dict = Depends(get_current_user)
+):
+    user_id = current_user["id"]
+    try:
+        result = await service.rename_material(material_id, payload.new_name, user_id)
+        return success_response(result, message="重命名成功")
+    except PermissionError as e:
+        raise HTTPException(403, str(e))
+    except ValueError as e:
+        raise HTTPException(400, str(e))
+    except Exception as e:
+        raise HTTPException(500, f"重命名失败: {str(e)}")
--- a/backend/app/modules/materials/schemas.py
+++ b/backend/app/modules/materials/schemas.py
@@ -0,0 +1,14 @@
+from pydantic import BaseModel
+
+
+class RenameMaterialRequest(BaseModel):
+    new_name: str
+
+
+class MaterialItem(BaseModel):
+    id: str
+    name: str
+    path: str
+    size_mb: float
+    type: str = "video"
+    created_at: int = 0
--- a/backend/app/modules/materials/service.py
+++ b/backend/app/modules/materials/service.py
@@ -0,0 +1,296 @@
+import re
+import os
+import time
+import asyncio
+import traceback
+import aiofiles
+from pathlib import Path
+from loguru import logger
+
+from app.services.storage import storage_service
+
+
+def sanitize_filename(filename: str) -> str:
+    safe_name = re.sub(r'[<>:"/\\|?*]', '_', filename)
+    if len(safe_name) > 100:
+        ext = Path(safe_name).suffix
+        safe_name = safe_name[:100 - len(ext)] + ext
+    return safe_name
+
+
+def _extract_display_name(storage_name: str) -> str:
+    """从存储文件名中提取显示名（去掉时间戳前缀）"""
+    if '_' in storage_name:
+        parts = storage_name.split('_', 1)
+        if parts[0].isdigit():
+            return parts[1]
+    return storage_name
+
+
+async def _process_and_upload(temp_file_path: str, original_filename: str, content_type: str, user_id: str) -> str:
+    """Strip multipart headers and upload to Supabase, return storage_path"""
+    try:
+        logger.info(f"Processing raw upload: {temp_file_path} for user {user_id}")
+
+        file_size = os.path.getsize(temp_file_path)
+
+        with open(temp_file_path, 'rb') as f:
+            head = f.read(4096)
+
+            first_line_end = head.find(b'\r\n')
+            if first_line_end == -1:
+                raise Exception("Could not find boundary in multipart body")
+
+            boundary = head[:first_line_end]
+            logger.info(f"Detected boundary: {boundary}")
+
+            header_end = head.find(b'\r\n\r\n')
+            if header_end == -1:
+                raise Exception("Could not find end of multipart headers")
+
+            start_offset = header_end + 4
+            logger.info(f"Video data starts at offset: {start_offset}")
+
+            f.seek(max(0, file_size - 200))
+            tail = f.read()
+
+            last_boundary_pos = tail.rfind(boundary)
+            if last_boundary_pos != -1:
+                end_offset = (max(0, file_size - 200) + last_boundary_pos) - 2
+            else:
+                logger.warning("Could not find closing boundary, assuming EOF")
+                end_offset = file_size
+
+            logger.info(f"Video data ends at offset: {end_offset}. Total video size: {end_offset - start_offset}")
+
+        video_path = temp_file_path + "_video.mp4"
+        with open(temp_file_path, 'rb') as src, open(video_path, 'wb') as dst:
+            src.seek(start_offset)
+            bytes_to_copy = end_offset - start_offset
+            copied = 0
+            while copied < bytes_to_copy:
+                chunk_size = min(1024 * 1024 * 10, bytes_to_copy - copied)
+                chunk = src.read(chunk_size)
+                if not chunk:
+                    break
+                dst.write(chunk)
+                copied += len(chunk)
+
+        logger.info(f"Extracted video content to {video_path}")
+
+        timestamp = int(time.time())
+        safe_name = re.sub(r'[^a-zA-Z0-9._-]', '', original_filename)
+        storage_path = f"{user_id}/{timestamp}_{safe_name}"
+
+        with open(video_path, 'rb') as f:
+            file_content = f.read()
+            await storage_service.upload_file(
+                bucket=storage_service.BUCKET_MATERIALS,
+                path=storage_path,
+                file_data=file_content,
+                content_type=content_type
+            )
+
+        logger.info(f"Upload to Supabase complete: {storage_path}")
+
+        os.remove(temp_file_path)
+        os.remove(video_path)
+
+        return storage_path
+
+    except Exception as e:
+        logger.error(f"Background upload processing failed: {e}\n{traceback.format_exc()}")
+        raise
+
+
+async def upload_material(request, user_id: str) -> dict:
+    """接收流式上传并存储到 Supabase，返回素材信息"""
+    filename = "unknown_video.mp4"
+    content_type = "video/mp4"
+
+    timestamp = int(time.time())
+    temp_filename = f"upload_{timestamp}.raw"
+    temp_path = os.path.join("/tmp", temp_filename)
+    if os.name == 'nt':
+        temp_path = f"d:/tmp/{temp_filename}"
+        os.makedirs("d:/tmp", exist_ok=True)
+
+    try:
+        total_size = 0
+        last_log = 0
+
+        async with aiofiles.open(temp_path, 'wb') as f:
+            async for chunk in request.stream():
+                await f.write(chunk)
+                total_size += len(chunk)
+
+                if total_size - last_log > 20 * 1024 * 1024:
+                    logger.info(f"Receiving stream... Processed {total_size / (1024*1024):.2f} MB")
+                    last_log = total_size
+
+        logger.info(f"Stream reception complete. Total size: {total_size} bytes. Saved to {temp_path}")
+
+        if total_size == 0:
+            raise ValueError("Received empty body")
+
+        with open(temp_path, 'rb') as f:
+            head = f.read(4096).decode('utf-8', errors='ignore')
+            match = re.search(r'filename="([^"]+)"', head)
+            if match:
+                filename = match.group(1)
+                logger.info(f"Extracted filename from body: {filename}")
+
+        storage_path = await _process_and_upload(temp_path, filename, content_type, user_id)
+
+        signed_url = await storage_service.get_signed_url(
+            bucket=storage_service.BUCKET_MATERIALS,
+            path=storage_path
+        )
+
+        size_mb = total_size / (1024 * 1024)
+        display_name = _extract_display_name(storage_path.split('/')[-1])
+
+        return {
+            "id": storage_path,
+            "name": display_name,
+            "path": signed_url,
+            "size_mb": size_mb,
+            "type": "video"
+        }
+
+    except Exception as e:
+        error_msg = f"Streaming upload failed: {str(e)}"
+        detail_msg = f"Exception: {repr(e)}\nArgs: {e.args}\n{traceback.format_exc()}"
+        logger.error(error_msg + "\n" + detail_msg)
+
+        try:
+            with open("debug_upload.log", "a") as logf:
+                logf.write(f"\n--- Error at {time.ctime()} ---\n")
+                logf.write(detail_msg)
+                logf.write("\n-----------------------------\n")
+        except:
+            pass
+
+        if os.path.exists(temp_path):
+            try:
+                os.remove(temp_path)
+            except:
+                pass
+        raise
+
+
+async def list_materials(user_id: str) -> list[dict]:
+    """列出用户的所有素材"""
+    try:
+        files_obj = await storage_service.list_files(
+            bucket=storage_service.BUCKET_MATERIALS,
+            path=user_id
+        )
+        semaphore = asyncio.Semaphore(8)
+
+        async def build_item(f):
+            name = f.get('name')
+            if not name or name == '.emptyFolderPlaceholder':
+                return None
+            display_name = _extract_display_name(name)
+            full_path = f"{user_id}/{name}"
+            async with semaphore:
+                signed_url = await storage_service.get_signed_url(
+                    bucket=storage_service.BUCKET_MATERIALS,
+                    path=full_path
+                )
+            metadata = f.get('metadata', {})
+            size = metadata.get('size', 0)
+            created_at_str = f.get('created_at', '')
+            created_at = 0
+            if created_at_str:
+                from datetime import datetime
+                try:
+                    dt = datetime.fromisoformat(created_at_str.replace('Z', '+00:00'))
+                    created_at = int(dt.timestamp())
+                except Exception:
+                    pass
+            return {
+                "id": full_path,
+                "name": display_name,
+                "path": signed_url,
+                "size_mb": size / (1024 * 1024),
+                "type": "video",
+                "created_at": created_at
+            }
+
+        tasks = [build_item(f) for f in files_obj]
+        results = await asyncio.gather(*tasks, return_exceptions=True)
+
+        materials = []
+        for item in results:
+            if not item:
+                continue
+            if isinstance(item, Exception):
+                logger.warning(f"Material signed url build failed: {item}")
+                continue
+            materials.append(item)
+        materials.sort(key=lambda x: x['id'], reverse=True)
+        return materials
+    except Exception as e:
+        logger.error(f"List materials failed: {e}")
+        return []
+
+
+async def delete_material(material_id: str, user_id: str) -> None:
+    """删除素材"""
+    if not material_id.startswith(f"{user_id}/"):
+        raise PermissionError("无权删除此素材")
+    await storage_service.delete_file(
+        bucket=storage_service.BUCKET_MATERIALS,
+        path=material_id
+    )
+
+
+async def rename_material(material_id: str, new_name_raw: str, user_id: str) -> dict:
+    """重命名素材，返回更新后的素材信息"""
+    if not material_id.startswith(f"{user_id}/"):
+        raise PermissionError("无权重命名此素材")
+
+    new_name_raw = new_name_raw.strip() if new_name_raw else ""
+    if not new_name_raw:
+        raise ValueError("新名称不能为空")
+
+    old_name = material_id.split("/", 1)[1]
+    old_ext = Path(old_name).suffix
+    base_name = Path(new_name_raw).stem if Path(new_name_raw).suffix else new_name_raw
+    safe_base = sanitize_filename(base_name).strip()
+    if not safe_base:
+        raise ValueError("新名称无效")
+
+    new_filename = f"{safe_base}{old_ext}"
+
+    prefix = None
+    if "_" in old_name:
+        maybe_prefix, _ = old_name.split("_", 1)
+        if maybe_prefix.isdigit():
+            prefix = maybe_prefix
+    if prefix:
+        new_filename = f"{prefix}_{new_filename}"
+
+    new_path = f"{user_id}/{new_filename}"
+
+    if new_path != material_id:
+        await storage_service.move_file(
+            bucket=storage_service.BUCKET_MATERIALS,
+            from_path=material_id,
+            to_path=new_path
+        )
+
+    signed_url = await storage_service.get_signed_url(
+        bucket=storage_service.BUCKET_MATERIALS,
+        path=new_path
+    )
+
+    display_name = _extract_display_name(new_filename)
+
+    return {
+        "id": new_path,
+        "name": display_name,
+        "path": signed_url,
+    }
--- a/backend/app/modules/publish/router.py
+++ b/backend/app/modules/publish/router.py
@@ -1,13 +1,17 @@
 """
 发布管理 API (支持用户认证)
 """
-from fastapi import APIRouter, HTTPException, BackgroundTasks, Depends, Request
-from pydantic import BaseModel
-from typing import List, Optional
-from datetime import datetime
+from fastapi import APIRouter, HTTPException, BackgroundTasks, Depends, Request
+from fastapi.responses import FileResponse
+from pydantic import BaseModel
+from typing import List, Optional
+from datetime import datetime
+import re
 from loguru import logger
 from app.services.publish_service import PublishService
 from app.core.response import success_response
+from app.core.config import settings
+from app.core.deps import get_current_user

 router = APIRouter()
 publish_service = PublishService()
@@ -29,7 +33,7 @@ class PublishResponse(BaseModel):
    url: Optional[str] = None

 # Supported platforms for validation
-SUPPORTED_PLATFORMS = {"bilibili", "douyin", "xiaohongshu"}
+SUPPORTED_PLATFORMS = {"bilibili", "douyin", "xiaohongshu", "weixin"}


 def _get_user_id(request: Request) -> Optional[str]:
@@ -118,7 +122,7 @@ async def get_login_status(platform: str, req: Request):
    message = result.get("message", "")
    return success_response(result, message=message)

-@router.post("/cookies/save/{platform}")
+@router.post("/cookies/save/{platform}")
 async def save_platform_cookie(platform: str, cookie_data: dict, req: Request):
    """
    保存从客户端浏览器提取的Cookie
@@ -139,3 +143,23 @@ async def save_platform_cookie(platform: str, cookie_data: dict, req: Request):
    
    message = result.get("message", "")
    return success_response(result, message=message)
+
+
+@router.get("/screenshot/{filename}")
+async def get_publish_screenshot(
+    filename: str,
+    current_user: dict = Depends(get_current_user),
+):
+    if not re.match(r"^[A-Za-z0-9_.-]+$", filename):
+        raise HTTPException(status_code=400, detail="非法文件名")
+
+    user_id = str(current_user.get("id") or "")
+    if not user_id:
+        raise HTTPException(status_code=401, detail="未登录")
+
+    user_dir = re.sub(r"[^A-Za-z0-9_-]", "_", user_id)[:64] or "legacy"
+    file_path = settings.PUBLISH_SCREENSHOT_DIR / user_dir / filename
+    if not file_path.exists() or not file_path.is_file():
+        raise HTTPException(status_code=404, detail="截图不存在")
+
+    return FileResponse(path=str(file_path), media_type="image/png")
--- a/backend/app/modules/ref_audios/router.py
+++ b/backend/app/modules/ref_audios/router.py
@@ -1,83 +1,14 @@
-"""
-参考音频管理 API
-支持上传/列表/删除参考音频，用于 Qwen3-TTS 声音克隆
-"""
+"""参考音频管理 API"""
 from fastapi import APIRouter, UploadFile, File, Form, HTTPException, Depends
-from pydantic import BaseModel
-from typing import List, Optional
-from pathlib import Path
 from loguru import logger
-import time
-import json
-import subprocess
-import tempfile
-import os
-import re

 from app.core.deps import get_current_user
-from app.services.storage import storage_service
 from app.core.response import success_response
+from app.modules.ref_audios.schemas import RenameRequest
+from app.modules.ref_audios import service

 router = APIRouter()

-# 支持的音频格式
-ALLOWED_AUDIO_EXTENSIONS = {'.wav', '.mp3', '.m4a', '.webm', '.ogg', '.flac', '.aac'}
-
-# 参考音频 bucket
-BUCKET_REF_AUDIOS = "ref-audios"
-
-
-class RefAudioResponse(BaseModel):
-    id: str
-    name: str
-    path: str  # signed URL for playback
-    ref_text: str
-    duration_sec: float
-    created_at: int
-
-
-class RefAudioListResponse(BaseModel):
-    items: List[RefAudioResponse]
-
-
-def sanitize_filename(filename: str) -> str:
-    """清理文件名，移除特殊字符"""
-    safe_name = re.sub(r'[<>:"/\\|?*\s]', '_', filename)
-    if len(safe_name) > 50:
-        ext = Path(safe_name).suffix
-        safe_name = safe_name[:50 - len(ext)] + ext
-    return safe_name
-
-
-def get_audio_duration(file_path: str) -> float:
-    """获取音频时长 (秒)"""
-    try:
-        result = subprocess.run(
-            ['ffprobe', '-v', 'quiet', '-show_entries', 'format=duration',
-             '-of', 'csv=p=0', file_path],
-            capture_output=True, text=True, timeout=10
-        )
-        return float(result.stdout.strip())
-    except Exception as e:
-        logger.warning(f"获取音频时长失败: {e}")
-        return 0.0
-
-
-def convert_to_wav(input_path: str, output_path: str) -> bool:
-    """将音频转换为 WAV 格式 (16kHz, mono)"""
-    try:
-        subprocess.run([
-            'ffmpeg', '-y', '-i', input_path,
-            '-ar', '16000',  # 16kHz 采样率
-            '-ac', '1',      # 单声道
-            '-acodec', 'pcm_s16le',  # 16-bit PCM
-            output_path
-        ], capture_output=True, timeout=60, check=True)
-        return True
-    except Exception as e:
-        logger.error(f"音频转换失败: {e}")
-        return False
-

@router.post("")
 async def upload_ref_audio(
@@ -85,156 +16,12 @@ async def upload_ref_audio(
    ref_text: str = Form(...),
    user: dict = Depends(get_current_user)
 ):
-    """
-    上传参考音频
-
-    - file: 音频文件 (支持 wav, mp3, m4a, webm 等)
-    - ref_text: 参考音频的转写文字 (必填)
-    """
-    user_id = user["id"]
-
-    if not file.filename:
-        raise HTTPException(status_code=400, detail="文件名无效")
-    filename = file.filename
-
-    # 验证文件扩展名
-    ext = Path(filename).suffix.lower()
-    if ext not in ALLOWED_AUDIO_EXTENSIONS:
-        raise HTTPException(
-            status_code=400,
-            detail=f"不支持的音频格式: {ext}。支持的格式: {', '.join(ALLOWED_AUDIO_EXTENSIONS)}"
-        )
-
-    # 验证 ref_text
-    if not ref_text or len(ref_text.strip()) < 2:
-        raise HTTPException(status_code=400, detail="参考文字不能为空")
-
+    """上传参考音频"""
    try:
-        # 创建临时文件
-        with tempfile.NamedTemporaryFile(delete=False, suffix=ext) as tmp_input:
-            content = await file.read()
-            tmp_input.write(content)
-            tmp_input_path = tmp_input.name
-
-        # 转换为 WAV 格式
-        tmp_wav_path = tmp_input_path + ".wav"
-        if ext != '.wav':
-            if not convert_to_wav(tmp_input_path, tmp_wav_path):
-                raise HTTPException(status_code=500, detail="音频格式转换失败")
-        else:
-            # 即使是 wav 也要标准化格式
-            convert_to_wav(tmp_input_path, tmp_wav_path)
-
-        # 获取音频时长
-        duration = get_audio_duration(tmp_wav_path)
-        if duration < 1.0:
-            raise HTTPException(status_code=400, detail="音频时长过短，至少需要 1 秒")
-        if duration > 60.0:
-            raise HTTPException(status_code=400, detail="音频时长过长，最多 60 秒")
-
-
-        # 3. 处理重名逻辑 (Friendly Display Name)
-        original_name = filename
-        
-        # 获取用户现有的所有参考音频列表 (为了检查文件名冲突)
-        # 注意: 这种列表方式在文件极多时性能一般，但考虑到单用户参考音频数量有限，目前可行
-        existing_files = await storage_service.list_files(BUCKET_REF_AUDIOS, user_id)
-        existing_names = set()
-        
-        # 预加载所有现有的 display name
-        # 这里需要并发请求 metadata 可能会慢，优化: 仅检查 metadata 文件并解析
-        # 简易方案: 仅在 metadata 中读取 original_filename 
-        # 但 list_files 返回的是 name，我们需要 metadata
-        # 考虑到性能，这里使用一种妥协方案：
-        # 我们不做全量检查，而是简单的检查：如果用户上传 myvoice.wav
-        # 我们看看有没有 (timestamp)_myvoice.wav 这种其实并不能准确判断 display name 是否冲突
-        # 
-        # 正确做法: 应该有个数据库表存 metadata。但目前是无数据库设计。
-        # 
-        # 改用简单方案: 
-        # 既然我们无法快速获取所有 display name，
-        # 我们暂时只处理 "在新上传时，original_filename 保持原样"
-        # 但用户希望 "如果在列表中看到重复的，自动加(1)"
-        # 
-        # 鉴于无数据库架构的限制，要在上传时知道"已有的 display name" 成本太高(需遍历下载所有json)。
-        # 
-        # 💡 替代方案: 
-        # 我们不检查旧的。我们只保证**存储**唯一。
-        # 对于用户提到的 "新上传的文件名后加个数字" -> 这通常是指 "另存为" 的逻辑。
-        # 既然用户现在的痛点是 "显示了时间戳太丑"，而我已经去掉了时间戳显示。
-        # 那么如果用户上传两个 "TEST.wav"，列表里就会有两个 "TEST.wav" (但时间不同)。
-        # 这其实是可以接受的。
-        # 
-        # 但如果用户强求 "自动重命名":
-        # 我们可以在这里做一个轻量级的 "同名检测"：
-        # 检查有没有 *_{original_name} 的文件存在。
-        # 如果 storage 里已经有 123_abc.wav, 456_abc.wav
-        # 我们可以认为 abc.wav 已经存在。
-        
-        dup_count = 0
-        search_suffix = f"_{original_name}" # 比如 _test.wav
-        
-        for f in existing_files:
-            fname = f.get('name', '')
-            if fname.endswith(search_suffix):
-                dup_count += 1
-                
-        final_display_name = original_name
-        if dup_count > 0:
-            name_stem = Path(original_name).stem
-            name_ext = Path(original_name).suffix
-            final_display_name = f"{name_stem}({dup_count}){name_ext}"
-            
-        # 生成存储路径 (唯一ID)
-        timestamp = int(time.time())
-        safe_name = sanitize_filename(Path(filename).stem)
-        storage_path = f"{user_id}/{timestamp}_{safe_name}.wav"
-
-        # 上传 WAV 文件到 Supabase
-        with open(tmp_wav_path, 'rb') as f:
-            wav_data = f.read()
-
-        await storage_service.upload_file(
-            bucket=BUCKET_REF_AUDIOS,
-            path=storage_path,
-            file_data=wav_data,
-            content_type="audio/wav"
-        )
-
-        # 上传元数据 JSON
-        metadata = {
-            "ref_text": ref_text.strip(),
-            "original_filename": final_display_name, # 这里的名字如果有重复会自动加(1)
-            "duration_sec": duration,
-            "created_at": timestamp
-        }
-        metadata_path = f"{user_id}/{timestamp}_{safe_name}.json"
-        await storage_service.upload_file(
-            bucket=BUCKET_REF_AUDIOS,
-            path=metadata_path,
-            file_data=json.dumps(metadata, ensure_ascii=False).encode('utf-8'),
-            content_type="application/json"
-        )
-
-        # 获取签名 URL
-        signed_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, storage_path)
-
-        # 清理临时文件
-        os.unlink(tmp_input_path)
-        if os.path.exists(tmp_wav_path):
-            os.unlink(tmp_wav_path)
-
-        return success_response(RefAudioResponse(
-            id=storage_path,
-            name=filename,
-            path=signed_url,
-            ref_text=ref_text.strip(),
-            duration_sec=duration,
-            created_at=timestamp
-        ).model_dump())
-
-    except HTTPException:
-        raise
+        result = await service.upload_ref_audio(file, ref_text, user["id"])
+        return success_response(result)
+    except ValueError as e:
+        raise HTTPException(status_code=400, detail=str(e))
    except Exception as e:
        logger.error(f"上传参考音频失败: {e}")
        raise HTTPException(status_code=500, detail=f"上传失败: {str(e)}")
@@ -243,76 +30,9 @@ async def upload_ref_audio(
@router.get("")
 async def list_ref_audios(user: dict = Depends(get_current_user)):
    """列出当前用户的所有参考音频"""
-    user_id = user["id"]
-
    try:
-        # 列出用户目录下的文件
-        files = await storage_service.list_files(BUCKET_REF_AUDIOS, user_id)
-
-        # 过滤出 .wav 文件并获取对应的 metadata
-        items = []
-        for f in files:
-            name = f.get("name", "")
-            if not name.endswith(".wav"):
-                continue
-
-            storage_path = f"{user_id}/{name}"
-
-            # 尝试读取 metadata
-            metadata_name = name.replace(".wav", ".json")
-            metadata_path = f"{user_id}/{metadata_name}"
-
-            ref_text = ""
-            duration_sec = 0.0
-            created_at = 0
-            original_filename = ""
-
-            try:
-                # 获取 metadata 内容
-                metadata_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, metadata_path)
-                import httpx
-                async with httpx.AsyncClient() as client:
-                    resp = await client.get(metadata_url)
-                    if resp.status_code == 200:
-                        metadata = resp.json()
-                        ref_text = metadata.get("ref_text", "")
-                        duration_sec = metadata.get("duration_sec", 0.0)
-                        created_at = metadata.get("created_at", 0)
-                        original_filename = metadata.get("original_filename", "")
-            except Exception as e:
-                logger.warning(f"读取 metadata 失败: {e}")
-                # 从文件名提取时间戳
-                try:
-                    created_at = int(name.split("_")[0])
-                except:
-                    pass
-
-            # 获取音频签名 URL
-            signed_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, storage_path)
-
-            # 优先显示原始文件名 (去掉时间戳前缀)
-            display_name = original_filename if original_filename else name
-            # 如果原始文件名丢失，尝试从现有文件名中通过正则去掉时间戳
-            if not display_name or display_name == name:
-                 # 匹配 "1234567890_filename.wav"
-                 match = re.match(r'^\d+_(.+)$', name)
-                 if match:
-                     display_name = match.group(1)
-
-            items.append(RefAudioResponse(
-                id=storage_path,
-                name=display_name,
-                path=signed_url,
-                ref_text=ref_text,
-                duration_sec=duration_sec,
-                created_at=created_at
-            ))
-
-        # 按创建时间倒序排列
-        items.sort(key=lambda x: x.created_at, reverse=True)
-
-        return success_response(RefAudioListResponse(items=items).model_dump())
-
+        result = await service.list_ref_audios(user["id"])
+        return success_response(result)
    except Exception as e:
        logger.error(f"列出参考音频失败: {e}")
        raise HTTPException(status_code=500, detail=f"获取列表失败: {str(e)}")
@@ -321,96 +41,30 @@ async def list_ref_audios(user: dict = Depends(get_current_user)):
@router.delete("/{audio_id:path}")
 async def delete_ref_audio(audio_id: str, user: dict = Depends(get_current_user)):
    """删除参考音频"""
-    user_id = user["id"]
-
-    # 安全检查：确保只能删除自己的文件
-    if not audio_id.startswith(f"{user_id}/"):
-        raise HTTPException(status_code=403, detail="无权删除此文件")
-
    try:
-        # 删除 WAV 文件
-        await storage_service.delete_file(BUCKET_REF_AUDIOS, audio_id)
-
-        # 删除 metadata JSON
-        metadata_path = audio_id.replace(".wav", ".json")
-        try:
-            await storage_service.delete_file(BUCKET_REF_AUDIOS, metadata_path)
-        except:
-            pass  # metadata 可能不存在
-
+        await service.delete_ref_audio(audio_id, user["id"])
        return success_response(message="删除成功")
-
+    except PermissionError as e:
+        raise HTTPException(status_code=403, detail=str(e))
    except Exception as e:
        logger.error(f"删除参考音频失败: {e}")
        raise HTTPException(status_code=500, detail=f"删除失败: {str(e)}")


-class RenameRequest(BaseModel):
-    new_name: str
-
-
@router.put("/{audio_id:path}")
 async def rename_ref_audio(
    audio_id: str,
    request: RenameRequest,
    user: dict = Depends(get_current_user)
 ):
-    """重命名参考音频 (修改 metadata 中的 display name)"""
-    user_id = user["id"]
-
-    # 安全检查
-    if not audio_id.startswith(f"{user_id}/"):
-        raise HTTPException(status_code=403, detail="无权修改此文件")
-    
-    new_name = request.new_name.strip()
-    if not new_name:
-         raise HTTPException(status_code=400, detail="新名称不能为空")
-         
-    # 确保新名称有后缀 (保留原后缀或添加 .wav)
-    if not Path(new_name).suffix:
-        new_name += ".wav"
-
+    """重命名参考音频"""
    try:
-        # 1. 下载现有的 metadata
-        metadata_path = audio_id.replace(".wav", ".json")
-        try:
-             # 获取已有的 JSON
-             import httpx
-             metadata_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, metadata_path)
-             if not metadata_url:
-                  # 如果 json 不存在，则需要新建一个基础的
-                  raise Exception("Metadata not found")
-             
-             async with httpx.AsyncClient() as client:
-                resp = await client.get(metadata_url)
-                if resp.status_code == 200:
-                    metadata = resp.json()
-                else:
-                    raise Exception(f"Failed to fetch metadata: {resp.status_code}")
-                    
-        except Exception as e:
-            logger.warning(f"无法读取元数据: {e}, 将创建新的元数据")
-            # 兜底：如果读取失败，构建最小元数据
-            metadata = {
-                "ref_text": "", # 可能丢失
-                "duration_sec": 0.0,
-                "created_at": int(time.time()),
-                "original_filename": new_name
-            }
-
-        # 2. 更新 original_filename
-        metadata["original_filename"] = new_name
-        
-        # 3. 覆盖上传 metadata
-        await storage_service.upload_file(
-            bucket=BUCKET_REF_AUDIOS,
-            path=metadata_path,
-            file_data=json.dumps(metadata, ensure_ascii=False).encode('utf-8'),
-            content_type="application/json"
-        )
-        
-        return success_response({"name": new_name}, message="重命名成功")
-
+        result = await service.rename_ref_audio(audio_id, request.new_name, user["id"])
+        return success_response(result, message="重命名成功")
+    except PermissionError as e:
+        raise HTTPException(status_code=403, detail=str(e))
+    except ValueError as e:
+        raise HTTPException(status_code=400, detail=str(e))
    except Exception as e:
        logger.error(f"重命名失败: {e}")
        raise HTTPException(status_code=500, detail=f"重命名失败: {str(e)}")
--- a/backend/app/modules/ref_audios/schemas.py
+++ b/backend/app/modules/ref_audios/schemas.py
@@ -0,0 +1,19 @@
+from pydantic import BaseModel
+from typing import List
+
+
+class RefAudioResponse(BaseModel):
+    id: str
+    name: str
+    path: str
+    ref_text: str
+    duration_sec: float
+    created_at: int
+
+
+class RefAudioListResponse(BaseModel):
+    items: List[RefAudioResponse]
+
+
+class RenameRequest(BaseModel):
+    new_name: str
--- a/backend/app/modules/ref_audios/service.py
+++ b/backend/app/modules/ref_audios/service.py
@@ -0,0 +1,269 @@
+import re
+import os
+import time
+import json
+import asyncio
+import subprocess
+import tempfile
+from pathlib import Path
+from typing import Optional
+
+import httpx
+from loguru import logger
+
+from app.services.storage import storage_service
+from app.modules.ref_audios.schemas import RefAudioResponse, RefAudioListResponse
+
+ALLOWED_AUDIO_EXTENSIONS = {'.wav', '.mp3', '.m4a', '.webm', '.ogg', '.flac', '.aac'}
+BUCKET_REF_AUDIOS = "ref-audios"
+
+
+def sanitize_filename(filename: str) -> str:
+    """清理文件名，移除特殊字符"""
+    safe_name = re.sub(r'[<>:"/\\|?*\s]', '_', filename)
+    if len(safe_name) > 50:
+        ext = Path(safe_name).suffix
+        safe_name = safe_name[:50 - len(ext)] + ext
+    return safe_name
+
+
+def _get_audio_duration(file_path: str) -> float:
+    """获取音频时长 (秒)"""
+    try:
+        result = subprocess.run(
+            ['ffprobe', '-v', 'quiet', '-show_entries', 'format=duration',
+             '-of', 'csv=p=0', file_path],
+            capture_output=True, text=True, timeout=10
+        )
+        return float(result.stdout.strip())
+    except Exception as e:
+        logger.warning(f"获取音频时长失败: {e}")
+        return 0.0
+
+
+def _convert_to_wav(input_path: str, output_path: str) -> bool:
+    """将音频转换为 WAV 格式 (16kHz, mono)"""
+    try:
+        subprocess.run([
+            'ffmpeg', '-y', '-i', input_path,
+            '-ar', '16000',
+            '-ac', '1',
+            '-acodec', 'pcm_s16le',
+            output_path
+        ], capture_output=True, timeout=60, check=True)
+        return True
+    except Exception as e:
+        logger.error(f"音频转换失败: {e}")
+        return False
+
+
+async def upload_ref_audio(file, ref_text: str, user_id: str) -> dict:
+    """上传参考音频：转码、获取时长、存储到 Supabase"""
+    if not file.filename:
+        raise ValueError("文件名无效")
+    filename = file.filename
+
+    ext = Path(filename).suffix.lower()
+    if ext not in ALLOWED_AUDIO_EXTENSIONS:
+        raise ValueError(f"不支持的音频格式: {ext}。支持的格式: {', '.join(ALLOWED_AUDIO_EXTENSIONS)}")
+
+    if not ref_text or len(ref_text.strip()) < 2:
+        raise ValueError("参考文字不能为空")
+
+    # 创建临时文件
+    with tempfile.NamedTemporaryFile(delete=False, suffix=ext) as tmp_input:
+        content = await file.read()
+        tmp_input.write(content)
+        tmp_input_path = tmp_input.name
+
+    try:
+        # 转换为 WAV 格式
+        tmp_wav_path = tmp_input_path + ".wav"
+        if not _convert_to_wav(tmp_input_path, tmp_wav_path):
+            raise RuntimeError("音频格式转换失败")
+
+        # 获取音频时长
+        duration = _get_audio_duration(tmp_wav_path)
+        if duration < 1.0:
+            raise ValueError("音频时长过短，至少需要 1 秒")
+        if duration > 60.0:
+            raise ValueError("音频时长过长，最多 60 秒")
+
+        # 检查重名
+        existing_files = await storage_service.list_files(BUCKET_REF_AUDIOS, user_id)
+        dup_count = 0
+        search_suffix = f"_{filename}"
+        for f in existing_files:
+            fname = f.get('name', '')
+            if fname.endswith(search_suffix):
+                dup_count += 1
+
+        final_display_name = filename
+        if dup_count > 0:
+            name_stem = Path(filename).stem
+            name_ext = Path(filename).suffix
+            final_display_name = f"{name_stem}({dup_count}){name_ext}"
+
+        # 生成存储路径
+        timestamp = int(time.time())
+        safe_name = sanitize_filename(Path(filename).stem)
+        storage_path = f"{user_id}/{timestamp}_{safe_name}.wav"
+
+        # 上传 WAV 文件
+        with open(tmp_wav_path, 'rb') as f:
+            wav_data = f.read()
+
+        await storage_service.upload_file(
+            bucket=BUCKET_REF_AUDIOS,
+            path=storage_path,
+            file_data=wav_data,
+            content_type="audio/wav"
+        )
+
+        # 上传元数据 JSON
+        metadata = {
+            "ref_text": ref_text.strip(),
+            "original_filename": final_display_name,
+            "duration_sec": duration,
+            "created_at": timestamp
+        }
+        metadata_path = f"{user_id}/{timestamp}_{safe_name}.json"
+        await storage_service.upload_file(
+            bucket=BUCKET_REF_AUDIOS,
+            path=metadata_path,
+            file_data=json.dumps(metadata, ensure_ascii=False).encode('utf-8'),
+            content_type="application/json"
+        )
+
+        # 获取签名 URL
+        signed_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, storage_path)
+
+        return RefAudioResponse(
+            id=storage_path,
+            name=filename,
+            path=signed_url,
+            ref_text=ref_text.strip(),
+            duration_sec=duration,
+            created_at=timestamp
+        ).model_dump()
+
+    finally:
+        os.unlink(tmp_input_path)
+        if os.path.exists(tmp_input_path + ".wav"):
+            os.unlink(tmp_input_path + ".wav")
+
+
+async def list_ref_audios(user_id: str) -> dict:
+    """列出用户的所有参考音频"""
+    files = await storage_service.list_files(BUCKET_REF_AUDIOS, user_id)
+    wav_files = [f for f in files if f.get("name", "").endswith(".wav")]
+
+    if not wav_files:
+        return RefAudioListResponse(items=[]).model_dump()
+
+    async def fetch_audio_info(f):
+        name = f.get("name", "")
+        storage_path = f"{user_id}/{name}"
+        metadata_name = name.replace(".wav", ".json")
+        metadata_path = f"{user_id}/{metadata_name}"
+
+        ref_text = ""
+        duration_sec = 0.0
+        created_at = 0
+        original_filename = ""
+
+        try:
+            metadata_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, metadata_path)
+            async with httpx.AsyncClient(timeout=5.0) as client:
+                resp = await client.get(metadata_url)
+                if resp.status_code == 200:
+                    metadata = resp.json()
+                    ref_text = metadata.get("ref_text", "")
+                    duration_sec = metadata.get("duration_sec", 0.0)
+                    created_at = metadata.get("created_at", 0)
+                    original_filename = metadata.get("original_filename", "")
+        except Exception as e:
+            logger.debug(f"读取 metadata 失败: {e}")
+            try:
+                created_at = int(name.split("_")[0])
+            except:
+                pass
+
+        signed_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, storage_path)
+
+        display_name = original_filename if original_filename else name
+        if not display_name or display_name == name:
+            match = re.match(r'^\d+_(.+)$', name)
+            if match:
+                display_name = match.group(1)
+
+        return RefAudioResponse(
+            id=storage_path,
+            name=display_name,
+            path=signed_url,
+            ref_text=ref_text,
+            duration_sec=duration_sec,
+            created_at=created_at
+        )
+
+    items = await asyncio.gather(*[fetch_audio_info(f) for f in wav_files])
+    items = sorted(items, key=lambda x: x.created_at, reverse=True)
+
+    return RefAudioListResponse(items=items).model_dump()
+
+
+async def delete_ref_audio(audio_id: str, user_id: str) -> None:
+    """删除参考音频及其元数据"""
+    if not audio_id.startswith(f"{user_id}/"):
+        raise PermissionError("无权删除此文件")
+
+    await storage_service.delete_file(BUCKET_REF_AUDIOS, audio_id)
+
+    metadata_path = audio_id.replace(".wav", ".json")
+    try:
+        await storage_service.delete_file(BUCKET_REF_AUDIOS, metadata_path)
+    except:
+        pass
+
+
+async def rename_ref_audio(audio_id: str, new_name: str, user_id: str) -> dict:
+    """重命名参考音频（修改 metadata 中的 display name）"""
+    if not audio_id.startswith(f"{user_id}/"):
+        raise PermissionError("无权修改此文件")
+
+    new_name = new_name.strip()
+    if not new_name:
+        raise ValueError("新名称不能为空")
+
+    if not Path(new_name).suffix:
+        new_name += ".wav"
+
+    # 下载现有 metadata
+    metadata_path = audio_id.replace(".wav", ".json")
+    try:
+        metadata_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, metadata_path)
+        async with httpx.AsyncClient() as client:
+            resp = await client.get(metadata_url)
+            if resp.status_code == 200:
+                metadata = resp.json()
+            else:
+                raise Exception(f"Failed to fetch metadata: {resp.status_code}")
+    except Exception as e:
+        logger.warning(f"无法读取元数据: {e}, 将创建新的元数据")
+        metadata = {
+            "ref_text": "",
+            "duration_sec": 0.0,
+            "created_at": int(time.time()),
+            "original_filename": new_name
+        }
+
+    # 更新并覆盖上传
+    metadata["original_filename"] = new_name
+    await storage_service.upload_file(
+        bucket=BUCKET_REF_AUDIOS,
+        path=metadata_path,
+        file_data=json.dumps(metadata, ensure_ascii=False).encode('utf-8'),
+        content_type="application/json"
+    )
+
+    return {"name": new_name}
--- a/backend/app/modules/tools/router.py
+++ b/backend/app/modules/tools/router.py
@@ -1,407 +1,32 @@
 from fastapi import APIRouter, UploadFile, File, Form, HTTPException
-from typing import Optional, Any, cast
-import asyncio
-import shutil
-import os
-import time
-from pathlib import Path
-from loguru import logger
+from typing import Optional
 import traceback
-import re
-import json
-import requests
-from urllib.parse import unquote
+from loguru import logger

-from app.services.whisper_service import whisper_service
-from app.services.glm_service import glm_service
 from app.core.response import success_response
+from app.modules.tools import service

 router = APIRouter()

+
@router.post("/extract-script")
 async def extract_script_tool(
    file: Optional[UploadFile] = File(None),
    url: Optional[str] = Form(None),
    rewrite: bool = Form(True)
 ):
-    """
-    独立文案提取工具
-    支持上传视频/音频 OR 输入视频链接 -> 提取文字 -> (可选) AI洗稿
-    """
-    if not file and not url:
-        raise HTTPException(400, "必须提供文件或视频链接")
-
-    temp_path = None
+    """独立文案提取工具"""
    try:
-        timestamp = int(time.time())
-        temp_dir = Path("/tmp")
-        if os.name == 'nt':
-            temp_dir = Path("d:/tmp")
-        temp_dir.mkdir(parents=True, exist_ok=True)
-
-        # 1. 获取/保存文件
-        loop = asyncio.get_event_loop()
-        
-        if file:
-            filename = file.filename
-            if not filename:
-                raise HTTPException(400, "文件名无效")
-            safe_filename = Path(filename).name.replace(" ", "_")
-            temp_path = temp_dir / f"tool_extract_{timestamp}_{safe_filename}"
-            # 文件 I/O 放入线程池
-            await loop.run_in_executor(None, lambda: shutil.copyfileobj(file.file, open(temp_path, "wb")))
-            logger.info(f"Tool processing upload file: {temp_path}")
-        else:
-            if not url:
-                raise HTTPException(400, "必须提供视频链接")
-            url_value: str = url
-            # URL 下载逻辑
-            # 自动提取文案中的链接 (支持 Douyin/Bilibili 等分享文案)
-            url_match = re.search(r'https?://[^\s]+', url_value)
-            if url_match:
-                extracted_url = url_match.group(0)
-                logger.info(f"Extracted URL from text: {extracted_url}")
-                url_value = extracted_url
-            
-            logger.info(f"Tool downloading URL: {url_value}")
-            
-            # 封装 yt-dlp 下载函数 (Blocking)
-            def _download_yt_dlp():
-                import yt_dlp
-                logger.info("Attempting download with yt-dlp...")
-
-                ydl_opts = {
-                    'format': 'bestaudio/best',
-                    'outtmpl': str(temp_dir / f"tool_download_{timestamp}_%(id)s.%(ext)s"),
-                    'quiet': True,
-                    'no_warnings': True,
-                    'http_headers': {
-                         'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36',
-                         'Referer': 'https://www.douyin.com/',
-                    }
-                }
-                
-                with yt_dlp.YoutubeDL() as ydl_raw:
-                    ydl: Any = ydl_raw
-                    ydl.params.update(ydl_opts)
-                    info = ydl.extract_info(url_value, download=True)
-                    if 'requested_downloads' in info:
-                        downloaded_file = info['requested_downloads'][0]['filepath']
-                    else:
-                        ext = info.get('ext', 'mp4')
-                        id = info.get('id')
-                        downloaded_file = str(temp_dir / f"tool_download_{timestamp}_{id}.{ext}")
-                    
-                    return Path(downloaded_file)
-
-            # 先尝试 yt-dlp (Run in Executor)
-            try:
-                temp_path = await loop.run_in_executor(None, _download_yt_dlp)
-                logger.info(f"yt-dlp downloaded to: {temp_path}")
-
-            except Exception as e:
-                logger.warning(f"yt-dlp download failed: {e}. Trying manual Douyin fallback...")
-                
-                # 失败则尝试手动解析 (Douyin Fallback)
-                if "douyin" in url_value:
-                    manual_path = await download_douyin_manual(url_value, temp_dir, timestamp)
-                    if manual_path:
-                        temp_path = manual_path
-                        logger.info(f"Manual Douyin fallback successful: {temp_path}")
-                    else:
-                         raise HTTPException(400, f"视频下载失败。yt-dlp 报错: {str(e)}")
-                elif "bilibili" in url_value:
-                    manual_path = await download_bilibili_manual(url_value, temp_dir, timestamp)
-                    if manual_path:
-                        temp_path = manual_path
-                        logger.info(f"Manual Bilibili fallback successful: {temp_path}")
-                    else:
-                         raise HTTPException(400, f"视频下载失败。yt-dlp 报错: {str(e)}")
-                else:
-                    raise HTTPException(400, f"视频下载失败: {str(e)}")
-
-        if not temp_path or not temp_path.exists():
-             raise HTTPException(400, "文件获取失败")
-        
-        # 1.5 安全转换: 强制转为 WAV (16k)
-        import subprocess
-        audio_path = temp_dir / f"extract_audio_{timestamp}.wav"
-        
-        def _convert_audio():
-            try:
-                convert_cmd = [
-                    'ffmpeg',
-                    '-i', str(temp_path),
-                    '-vn', # 忽略视频
-                    '-acodec', 'pcm_s16le',
-                    '-ar', '16000', # Whisper 推荐采样率
-                    '-ac', '1',    # 单声道
-                    '-y',          # 覆盖
-                    str(audio_path)
-                ]
-                # 捕获 stderr
-                subprocess.run(convert_cmd, check=True, stdout=subprocess.PIPE, stderr=subprocess.PIPE)
-                return True
-            except subprocess.CalledProcessError as e:
-                error_log = e.stderr.decode('utf-8', errors='ignore') if e.stderr else str(e)
-                logger.error(f"FFmpeg check/convert failed: {error_log}")
-                # 检查是否为 HTML
-                head = b""
-                try:
-                    with open(temp_path, 'rb') as f:
-                        head = f.read(100)
-                except: pass
-                if b'<!DOCTYPE html' in head or b'<html' in head:
-                    raise ValueError("HTML_DETECTED")
-                raise ValueError("CONVERT_FAILED")
-
-        # 执行转换 (Run in Executor)
-        try:
-            await loop.run_in_executor(None, _convert_audio)
-            logger.info(f"Converted to WAV: {audio_path}")
-            target_path = audio_path
-        except ValueError as ve:
-            if str(ve) == "HTML_DETECTED":
-                 raise HTTPException(400, "下载的文件是网页而非视频，请重试或手动上传。")
-            else:
-                 raise HTTPException(400, "下载的文件已损坏或格式无法识别。")
-            
-        # 2. 提取文案 (Whisper)
-        script = await whisper_service.transcribe(str(target_path))
-        
-        # 3. AI 洗稿 (GLM)
-        rewritten = None
-        if rewrite:
-            if script and len(script.strip()) > 0:
-                logger.info("Rewriting script...")
-                rewritten = await glm_service.rewrite_script(script)
-            else:
-                logger.warning("No script extracted, skipping rewrite")
-
-        return success_response({
-            "original_script": script,
-            "rewritten_script": rewritten
-        })
-
-    except HTTPException as he:
-        raise he
+        result = await service.extract_script(file=file, url=url, rewrite=rewrite)
+        return success_response(result)
+    except ValueError as e:
+        raise HTTPException(400, str(e))
+    except HTTPException:
+        raise
    except Exception as e:
        logger.error(f"Tool extract failed: {e}")
        logger.error(traceback.format_exc())
-        
-        # Friendly error message
        msg = str(e)
        if "Fresh cookies" in msg:
            msg = "下载失败：目标平台开启了反爬验证，请过段时间重试或直接上传视频文件。"
-            
        raise HTTPException(500, f"提取失败: {msg}")
-    finally:
-        # 清理临时文件
-        if temp_path and temp_path.exists():
-            try:
-                os.remove(temp_path)
-                logger.info(f"Cleaned up temp file: {temp_path}")
-            except Exception as e:
-                logger.warning(f"Failed to cleanup temp file {temp_path}: {e}")
-
-
-async def download_douyin_manual(url: str, temp_dir: Path, timestamp: int) -> Optional[Path]:
-    """
-    手动下载抖音视频 (Fallback logic - Ported from SuperIPAgent/douyinDownloader)
-    使用特定的 User Profile URL 和硬编码 Cookie 绕过反爬
-    """
-    logger.info(f"[SuperIPAgent] Starting download for: {url}")
-    
-    try:
-        # 1. 提取 Modal ID (支持短链跳转)
-        headers = {
-            "user-agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36"
-        }
-        
-        # 如果是短链或重定向
-        resp = requests.get(url, headers=headers, allow_redirects=True, timeout=10)
-        final_url = resp.url
-        logger.info(f"[SuperIPAgent] Final URL: {final_url}")
-        
-        modal_id = None
-        match = re.search(r'/video/(\d+)', final_url)
-        if match:
-            modal_id = match.group(1)
-        
-        if not modal_id:
-            logger.error("[SuperIPAgent] Could not extract modal_id")
-            return None
-
-        logger.info(f"[SuperIPAgent] Extracted modal_id: {modal_id}")
-        
-        # 2. 构造特定请求 URL (Copy from SuperIPAgent)
-        # 使用特定用户的 Profile 页 + modal_id 参数，配合特定 Cookie
-        target_url = f"https://www.douyin.com/user/MS4wLjABAAAAN_s_hups7LD0N4qnrM3o2gI0vuG3pozNaEolz2_py3cHTTrpVr1Z4dukFD9SOlwY?from_tab_name=main&modal_id={modal_id}"
-        
-        # 3. 使用硬编码 Cookie (Copy from SuperIPAgent)
-        headers_with_cookie = {
-            "accept": "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7",
-            "cookie": "douyin.com; device_web_cpu_core=10; device_web_memory_size=8; __ac_nonce=06760391f00b9b51264ae; __ac_signature=_02B4Z6wo00f019a5ceAAAIDAhEZR-X3jjWfWmXVAAJLXd4; ttwid=1%7C7MTKBSMsP4eOv9h5NAh8p0E-NYIud09ftNmB0mjLpWc%7C1734359327%7C8794abeabbd47447e1f56e5abc726be089f2a0344d6343b5f75f23e7b0f0028f; UIFID_TEMP=0de8750d2b188f4235dbfd208e44abbb976428f0720eb983255afefa45d39c0c6532e1d4768dd8587bf919f866ff1396912bcb2af71efee56a14a2a9f37b74010d0a0413795262f6d4afe02a032ac7ab; s_v_web_id=verify_m4r4ribr_c7krmY1z_WoeI_43po_ATpO_I4o8U1bex2D7; hevc_supported=true; home_can_add_dy_2_desktop=%220%22; dy_swidth=2560; dy_sheight=1440; stream_recommend_feed_params=%22%7B%5C%22cookie_enabled%5C%22%3Atrue%2C%5C%22screen_width%5C%22%3A2560%2C%5C%22screen_height%5C%22%3A1440%2C%5C%22browser_online%5C%22%3Atrue%2C%5C%22cpu_core_num%5C%22%3A10%2C%5C%22device_memory%5C%22%3A8%2C%5C%22downlink%5C%22%3A10%2C%5C%22effective_type%5C%22%3A%5C%224g%5C%22%2C%5C%22round_trip_time%5C%22%3A50%7D%22; strategyABtestKey=%221734359328.577%22; csrf_session_id=2f53aed9aa6974e83aa9a1014180c3a4; fpk1=U2FsdGVkX1/IpBh0qdmlKAVhGyYHgur4/VtL9AReZoeSxadXn4juKvsakahRGqjxOPytHWspYoBogyhS/V6QSw==; fpk2=0845b309c7b9b957afd9ecf775a4c21f; passport_csrf_token=d80e0c5b2fa2328219856be5ba7e671e; passport_csrf_token_default=d80e0c5b2fa2328219856be5ba7e671e; odin_tt=3c891091d2eb0f4718c1d5645bc4a0017032d4d5aa989decb729e9da2ad570918cbe5e9133dc6b145fa8c758de98efe32ff1f81aa0d611e838cc73ab08ef7d3f6adf66ab4d10e8372ddd628f94f16b8e; volume_info=%7B%22isUserMute%22%3Afalse%2C%22isMute%22%3Afalse%2C%22volume%22%3A0.5%7D; bd_ticket_guard_client_web_domain=2; FORCE_LOGIN=%7B%22videoConsumedRemainSeconds%22%3A180%7D; UIFID=0de8750d2b188f4235dbfd208e44abbb976428f0720eb983255afefa45d39c0c6532e1d4768dd8587bf919f866ff139655a3c2b735923234f371c699560c657923fd3d6c5b63ab7bb9b83423b6cb4787e2ce66a7fbc4ecb24c8570f520fe6de068bbb95115023c0c6c1b6ee31b49fb7e3996fb8349f43a3fd8b7a61cd9e18e8fe65eb6a7c13de4c0960d84e344b644725db3eb2fa6b7caf821de1b50527979f2; is_dash_user=1; biz_trace_id=b57a241f; bd_ticket_guard_client_data=eyJiZC10aWNrZXQtZ3VhcmQtdmVyc2lvbiI6MiwiYmQtdGlja2V0LWd1YXJkLWl0ZXJhdGlvbi12ZXJzaW9uIjoxLCJiZC10aWNrZXQtZ3VhcmQtcmVlLXB1YmxpYy1rZXkiOiJCTEo2R0lDalVoWW1XcHpGOFdrN0Vrc0dXcCtaUzNKY1g4NGNGY2k0TTl1TEowNjdUb21mbFU5aDdvWVBGamhNRWNRQWtKdnN1MnM3RmpTWnlJQXpHMjA9IiwiYmQtdGlja2V0LWd1YXJkLXdlYi12ZXJzaW9uIjoyfQ%3D%3D; download_guide=%221%2F20241216%2F0%22; sdk_source_info=7e276470716a68645a606960273f276364697660272927676c715a6d6069756077273f276364697660272927666d776a68605a607d71606b766c6a6b5a7666776c7571273f275e58272927666a6b766a69605a696c6061273f27636469766027292762696a6764695a7364776c6467696076273f275e5827292771273f273d33323131333c3036313632342778; bit_env=RiOY4jzzpxZoVCl6zdVSVhVRjdwHRTxqcqWdqMBZLPGjMdB4Tax1kAELHNTVAAh72KuhumewE4Lq6f0-VJ2UpJrkrhSxoPw9LUb3zQrq1OSwbeSPHkRlRgRQvO89sItdGUyq1oFr0XyRCnMYG87KSeWyc4x0czGR0o50hTDoDLG5rJVoRcdQOLvjiAegsqyytKF59sPX_QM9qffK2SqYsg0hCggURc_AI6kguDDE5DvG0bnyz1utw4z1eEnIoLrkGDqzqBZj4dOAr0BVU6ofbsS-pOQ2u2PM1dLP9FlBVBlVaqYVgHJeSLsR5k76BRTddUjTb4zEilVIEwAMJWGN4I1BxVt6fC9B5tBQpuT0lj3n3eKXCKXZsd8FrEs5_pbfDsxV-e_WMiXI2ff4qxiTC0U73sfo9OpicKICtZjdq8qsHxJuu6wVR36zvXeL2Wch5C6MzprNvkivv0l8nbh2mSgy1nabZr3dmU6NcR-Bg3Q3xTWUlR9aAUmpopC-cNuXjgLpT-Lw1AYGilSUnCvosth1Gfypq-b0MpgmdSDgTrQ%3D; gulu_source_res=eyJwX2luIjoiMDhjOGQ3ZTJiODQyNjZkZWI5Y2VkMGJiODNlNmY1ZWY0ZjMyNTE2ZmYyZjAzNDMzZjI0OWU1Y2Q1NTczNTk5NyJ9; passport_auth_mix_state=hp9bc3dgb1tm5wd8p82zawus27g0e3ue; IsDouyinActive=false",
-            "user-agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36",
-        }
-        
-        logger.info(f"[SuperIPAgent] Requesting page with Cookie...")
-        # 必须 verify=False 否则有些环境会报错
-        response = requests.get(target_url, headers=headers_with_cookie, timeout=10)
-        
-        # 4. 解析 RENDER_DATA
-        content_match = re.findall(r'<script id="RENDER_DATA" type="application/json">(.*?)</script>', response.text)
-        if not content_match:
-             # 尝试解码后再查找？或者结构变了
-             # 再尝试找 SSR_HYDRATED_DATA
-             if "SSR_HYDRATED_DATA" in response.text:
-                 content_match = re.findall(r'<script id="SSR_HYDRATED_DATA" type="application/json">(.*?)</script>', response.text)
-        
-        if not content_match:
-             logger.error(f"[SuperIPAgent] Could not find RENDER_DATA in page (len={len(response.text)})")
-             return None
-
-        content = unquote(content_match[0])
-        try:
-            data = json.loads(content)
-        except:
-            logger.error("[SuperIPAgent] JSON decode failed")
-            return None
-            
-        # 5. 提取视频流
-        video_url = None
-        try:
-            # 路径通常是: app -> videoDetail -> video -> bitRateList -> playAddr -> src
-            if "app" in data and "videoDetail" in data["app"]:
-                 info = data["app"]["videoDetail"]["video"]
-                 if "bitRateList" in info and info["bitRateList"]:
-                     video_url = info["bitRateList"][0]["playAddr"][0]["src"]
-                 elif "playAddr" in info and info["playAddr"]:
-                      video_url = info["playAddr"][0]["src"]
-        except Exception as e:
-            logger.error(f"[SuperIPAgent] Path extraction failed: {e}")
-            
-        if not video_url:
-            logger.error("[SuperIPAgent] No video_url found")
-            return None
-            
-        if video_url.startswith("//"):
-            video_url = "https:" + video_url
-            
-        logger.info(f"[SuperIPAgent] Found video URL: {video_url[:50]}...")
-        
-        # 6. 下载 (带 Header)
-        temp_path = temp_dir / f"douyin_manual_{timestamp}.mp4"
-        download_headers = {
-            'Referer': 'https://www.douyin.com/',
-            'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36',
-        }
-        
-        dl_resp = requests.get(video_url, headers=download_headers, stream=True, timeout=60)
-        if dl_resp.status_code == 200:
-             with open(temp_path, 'wb') as f:
-                 for chunk in dl_resp.iter_content(chunk_size=1024):
-                     f.write(chunk)
-                     
-             logger.info(f"[SuperIPAgent] Downloaded successfully: {temp_path}")
-             return temp_path
-        else:
-             logger.error(f"[SuperIPAgent] Download failed: {dl_resp.status_code}")
-             return None
-
-    except Exception as e:
-        logger.error(f"[SuperIPAgent] Logic failed: {e}")
-        return None
-
-async def download_bilibili_manual(url: str, temp_dir: Path, timestamp: int) -> Optional[Path]:
-    """
-    手动下载 Bilibili 视频 (Fallback logic - Playwright Version)
-    B站通常音视频分离，这里只提取音频即可（因为只需要文案）
-    """
-    from playwright.async_api import async_playwright
-    
-    logger.info(f"[Playwright] Starting Bilibili download for: {url}")
-    
-    playwright = None
-    browser = None
-    try:
-        playwright = await async_playwright().start()
-        # Launch browser (ensure chromium is installed: playwright install chromium)
-        browser = await playwright.chromium.launch(headless=True, args=['--no-sandbox', '--disable-setuid-sandbox'])
-        
-        # Mobile User Agent often gives single stream?
-        # But Bilibili mobile web is tricky. Desktop is fine.
-        context = await browser.new_context(
-            user_agent="Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36"
-        )
-        
-        page = await context.new_page()
-        
-        # Intercept audio responses?
-        # Bilibili streams are usually .m4s 
-        # But finding the initial state is easier.
-        
-        logger.info("[Playwright] Navigating to Bilibili...")
-        await page.goto(url, timeout=45000)
-        
-        # Wait for video element (triggers loading)
-        try:
-            await page.wait_for_selector('video', timeout=15000)
-        except:
-             logger.warning("[Playwright] Video selector timeout")
-
-        # 1. Try extracting from __playinfo__
-        # window.__playinfo__ contains dash streams
-        playinfo = await page.evaluate("window.__playinfo__")
-        
-        audio_url = None
-        
-        if playinfo and "data" in playinfo and "dash" in playinfo["data"]:
-            dash = playinfo["data"]["dash"]
-            if "audio" in dash and dash["audio"]:
-                audio_url = dash["audio"][0]["baseUrl"]
-                logger.info(f"[Playwright] Found audio stream in __playinfo__: {audio_url[:50]}...")
-        
-        # 2. If playinfo fails, try extracting video src (sometimes it's a blob, which we can't fetch easily without interception)
-        # But interception is complex. Let's try requests with Referer if we have URL.
-        
-        if not audio_url:
-            logger.warning("[Playwright] Could not find audio in __playinfo__")
-            return None
-            
-        # Download the audio stream
-        temp_path = temp_dir / f"bilibili_audio_{timestamp}.m4s" # usually m4s
-        
-        try:
-            api_request = context.request
-            headers = {
-                "User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36",
-                "Referer": "https://www.bilibili.com/"
-            }
-            
-            logger.info(f"[Playwright] Downloading audio stream...")
-            response = await api_request.get(audio_url, headers=headers)
-            
-            if response.status == 200:
-                body = await response.body()
-                with open(temp_path, 'wb') as f:
-                    f.write(body)
-                    
-                logger.info(f"[Playwright] Downloaded successfully: {temp_path}")
-                return temp_path
-            else:
-                logger.error(f"[Playwright] API Request failed: {response.status}")
-                return None
-                
-        except Exception as e:
-             logger.error(f"[Playwright] Download logic error: {e}")
-             return None
-
-    except Exception as e:
-        logger.error(f"[Playwright] Bilibili download failed: {e}")
-        return None
-    finally:
-        if browser:
-            await browser.close()
-        if playwright:
-            await playwright.stop()
--- a/backend/app/modules/tools/schemas.py
+++ b/backend/app/modules/tools/schemas.py
@@ -0,0 +1,7 @@
+from pydantic import BaseModel
+from typing import Optional
+
+
+class ExtractScriptResponse(BaseModel):
+    original_script: Optional[str] = None
+    rewritten_script: Optional[str] = None
--- a/backend/app/modules/tools/service.py
+++ b/backend/app/modules/tools/service.py
@@ -0,0 +1,355 @@
+import asyncio
+import os
+import re
+import json
+import time
+import shutil
+import subprocess
+import traceback
+from pathlib import Path
+from typing import Optional, Any
+from urllib.parse import unquote
+
+import httpx
+from loguru import logger
+
+from app.services.whisper_service import whisper_service
+from app.services.glm_service import glm_service
+
+
+async def extract_script(file=None, url: Optional[str] = None, rewrite: bool = True) -> dict:
+    """
+    文案提取：上传文件或视频链接 -> Whisper 转写 -> (可选) GLM 洗稿
+    """
+    if not file and not url:
+        raise ValueError("必须提供文件或视频链接")
+
+    temp_path = None
+    try:
+        timestamp = int(time.time())
+        temp_dir = Path("/tmp")
+        if os.name == 'nt':
+            temp_dir = Path("d:/tmp")
+        temp_dir.mkdir(parents=True, exist_ok=True)
+
+        loop = asyncio.get_event_loop()
+
+        # 1. 获取/保存文件
+        if file:
+            filename = file.filename
+            if not filename:
+                raise ValueError("文件名无效")
+            safe_filename = Path(filename).name.replace(" ", "_")
+            temp_path = temp_dir / f"tool_extract_{timestamp}_{safe_filename}"
+            await loop.run_in_executor(None, lambda: shutil.copyfileobj(file.file, open(temp_path, "wb")))
+            logger.info(f"Tool processing upload file: {temp_path}")
+        else:
+            temp_path = await _download_video(url, temp_dir, timestamp)
+
+        if not temp_path or not temp_path.exists():
+            raise ValueError("文件获取失败")
+
+        # 1.5 安全转换: 强制转为 WAV (16k)
+        audio_path = temp_dir / f"extract_audio_{timestamp}.wav"
+        try:
+            await loop.run_in_executor(None, lambda: _convert_to_wav(temp_path, audio_path))
+            logger.info(f"Converted to WAV: {audio_path}")
+        except ValueError as ve:
+            if str(ve) == "HTML_DETECTED":
+                raise ValueError("下载的文件是网页而非视频，请重试或手动上传。")
+            else:
+                raise ValueError("下载的文件已损坏或格式无法识别。")
+
+        # 2. 提取文案 (Whisper)
+        script = await whisper_service.transcribe(str(audio_path))
+
+        # 3. AI 洗稿 (GLM)
+        rewritten = None
+        if rewrite and script and len(script.strip()) > 0:
+            logger.info("Rewriting script...")
+            rewritten = await glm_service.rewrite_script(script)
+
+        return {
+            "original_script": script,
+            "rewritten_script": rewritten
+        }
+
+    finally:
+        if temp_path and temp_path.exists():
+            try:
+                os.remove(temp_path)
+                logger.info(f"Cleaned up temp file: {temp_path}")
+            except Exception as e:
+                logger.warning(f"Failed to cleanup temp file {temp_path}: {e}")
+
+
+def _convert_to_wav(input_path: Path, output_path: Path) -> None:
+    """FFmpeg 转换为 16k WAV"""
+    try:
+        convert_cmd = [
+            'ffmpeg',
+            '-i', str(input_path),
+            '-vn',
+            '-acodec', 'pcm_s16le',
+            '-ar', '16000',
+            '-ac', '1',
+            '-y',
+            str(output_path)
+        ]
+        subprocess.run(convert_cmd, check=True, stdout=subprocess.PIPE, stderr=subprocess.PIPE)
+    except subprocess.CalledProcessError as e:
+        error_log = e.stderr.decode('utf-8', errors='ignore') if e.stderr else str(e)
+        logger.error(f"FFmpeg check/convert failed: {error_log}")
+        head = b""
+        try:
+            with open(input_path, 'rb') as f:
+                head = f.read(100)
+        except:
+            pass
+        if b'<!DOCTYPE html' in head or b'<html' in head:
+            raise ValueError("HTML_DETECTED")
+        raise ValueError("CONVERT_FAILED")
+
+
+async def _download_video(url: str, temp_dir: Path, timestamp: int) -> Path:
+    """下载视频（yt-dlp 优先，失败回退手动解析）"""
+    url_value = url
+    url_match = re.search(r'https?://[^\s]+', url_value)
+    if url_match:
+        extracted_url = url_match.group(0)
+        logger.info(f"Extracted URL from text: {extracted_url}")
+        url_value = extracted_url
+
+    logger.info(f"Tool downloading URL: {url_value}")
+    loop = asyncio.get_event_loop()
+
+    # 先尝试 yt-dlp
+    try:
+        temp_path = await loop.run_in_executor(None, lambda: _download_yt_dlp(url_value, temp_dir, timestamp))
+        logger.info(f"yt-dlp downloaded to: {temp_path}")
+        return temp_path
+    except Exception as e:
+        logger.warning(f"yt-dlp download failed: {e}. Trying manual fallback...")
+
+        if "douyin" in url_value:
+            manual_path = await _download_douyin_manual(url_value, temp_dir, timestamp)
+            if manual_path:
+                return manual_path
+            raise ValueError(f"视频下载失败。yt-dlp 报错: {str(e)}")
+        elif "bilibili" in url_value:
+            manual_path = await _download_bilibili_manual(url_value, temp_dir, timestamp)
+            if manual_path:
+                return manual_path
+            raise ValueError(f"视频下载失败。yt-dlp 报错: {str(e)}")
+        else:
+            raise ValueError(f"视频下载失败: {str(e)}")
+
+
+def _download_yt_dlp(url_value: str, temp_dir: Path, timestamp: int) -> Path:
+    """yt-dlp 下载（阻塞调用，应在线程池中运行）"""
+    import yt_dlp
+    logger.info("Attempting download with yt-dlp...")
+
+    ydl_opts = {
+        'format': 'bestaudio/best',
+        'outtmpl': str(temp_dir / f"tool_download_{timestamp}_%(id)s.%(ext)s"),
+        'quiet': True,
+        'no_warnings': True,
+        'http_headers': {
+            'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36',
+            'Referer': 'https://www.douyin.com/',
+        }
+    }
+
+    with yt_dlp.YoutubeDL() as ydl_raw:
+        ydl: Any = ydl_raw
+        ydl.params.update(ydl_opts)
+        info = ydl.extract_info(url_value, download=True)
+        if 'requested_downloads' in info:
+            downloaded_file = info['requested_downloads'][0]['filepath']
+        else:
+            ext = info.get('ext', 'mp4')
+            id = info.get('id')
+            downloaded_file = str(temp_dir / f"tool_download_{timestamp}_{id}.{ext}")
+
+        return Path(downloaded_file)
+
+
+async def _download_douyin_manual(url: str, temp_dir: Path, timestamp: int) -> Optional[Path]:
+    """手动下载抖音视频 (Fallback)"""
+    logger.info(f"[SuperIPAgent] Starting download for: {url}")
+
+    try:
+        headers = {
+            "user-agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36"
+        }
+
+        async with httpx.AsyncClient(follow_redirects=True, timeout=10.0) as client:
+            resp = await client.get(url, headers=headers)
+            final_url = str(resp.url)
+
+        logger.info(f"[SuperIPAgent] Final URL: {final_url}")
+
+        modal_id = None
+        match = re.search(r'/video/(\d+)', final_url)
+        if match:
+            modal_id = match.group(1)
+
+        if not modal_id:
+            logger.error("[SuperIPAgent] Could not extract modal_id")
+            return None
+
+        logger.info(f"[SuperIPAgent] Extracted modal_id: {modal_id}")
+
+        target_url = f"https://www.douyin.com/user/MS4wLjABAAAAN_s_hups7LD0N4qnrM3o2gI0vuG3pozNaEolz2_py3cHTTrpVr1Z4dukFD9SOlwY?from_tab_name=main&modal_id={modal_id}"
+
+        from app.core.config import settings
+        if not settings.DOUYIN_COOKIE:
+            logger.warning("[SuperIPAgent] DOUYIN_COOKIE 未配置，视频下载可能失败")
+
+        headers_with_cookie = {
+            "accept": "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7",
+            "cookie": settings.DOUYIN_COOKIE,
+            "user-agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36",
+        }
+
+        logger.info(f"[SuperIPAgent] Requesting page with Cookie...")
+
+        async with httpx.AsyncClient(timeout=10.0) as client:
+            response = await client.get(target_url, headers=headers_with_cookie)
+
+        content_match = re.findall(r'<script id="RENDER_DATA" type="application/json">(.*?)</script>', response.text)
+        if not content_match:
+            if "SSR_HYDRATED_DATA" in response.text:
+                content_match = re.findall(r'<script id="SSR_HYDRATED_DATA" type="application/json">(.*?)</script>', response.text)
+
+        if not content_match:
+            logger.error(f"[SuperIPAgent] Could not find RENDER_DATA in page (len={len(response.text)})")
+            return None
+
+        content = unquote(content_match[0])
+        try:
+            data = json.loads(content)
+        except:
+            logger.error("[SuperIPAgent] JSON decode failed")
+            return None
+
+        video_url = None
+        try:
+            if "app" in data and "videoDetail" in data["app"]:
+                info = data["app"]["videoDetail"]["video"]
+                if "bitRateList" in info and info["bitRateList"]:
+                    video_url = info["bitRateList"][0]["playAddr"][0]["src"]
+                elif "playAddr" in info and info["playAddr"]:
+                    video_url = info["playAddr"][0]["src"]
+        except Exception as e:
+            logger.error(f"[SuperIPAgent] Path extraction failed: {e}")
+
+        if not video_url:
+            logger.error("[SuperIPAgent] No video_url found")
+            return None
+
+        if video_url.startswith("//"):
+            video_url = "https:" + video_url
+
+        logger.info(f"[SuperIPAgent] Found video URL: {video_url[:50]}...")
+
+        temp_path = temp_dir / f"douyin_manual_{timestamp}.mp4"
+        download_headers = {
+            'Referer': 'https://www.douyin.com/',
+            'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36',
+        }
+
+        async with httpx.AsyncClient(timeout=60.0) as client:
+            async with client.stream("GET", video_url, headers=download_headers) as dl_resp:
+                if dl_resp.status_code == 200:
+                    with open(temp_path, 'wb') as f:
+                        async for chunk in dl_resp.aiter_bytes(chunk_size=8192):
+                            f.write(chunk)
+
+                    logger.info(f"[SuperIPAgent] Downloaded successfully: {temp_path}")
+                    return temp_path
+                else:
+                    logger.error(f"[SuperIPAgent] Download failed: {dl_resp.status_code}")
+                    return None
+
+    except Exception as e:
+        logger.error(f"[SuperIPAgent] Logic failed: {e}")
+        return None
+
+
+async def _download_bilibili_manual(url: str, temp_dir: Path, timestamp: int) -> Optional[Path]:
+    """手动下载 Bilibili 视频 (Playwright Fallback)"""
+    from playwright.async_api import async_playwright
+
+    logger.info(f"[Playwright] Starting Bilibili download for: {url}")
+
+    playwright = None
+    browser = None
+    try:
+        playwright = await async_playwright().start()
+        browser = await playwright.chromium.launch(headless=True, args=['--no-sandbox', '--disable-setuid-sandbox'])
+
+        context = await browser.new_context(
+            user_agent="Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36"
+        )
+
+        page = await context.new_page()
+
+        logger.info("[Playwright] Navigating to Bilibili...")
+        await page.goto(url, timeout=45000)
+
+        try:
+            await page.wait_for_selector('video', timeout=15000)
+        except:
+            logger.warning("[Playwright] Video selector timeout")
+
+        playinfo = await page.evaluate("window.__playinfo__")
+
+        audio_url = None
+
+        if playinfo and "data" in playinfo and "dash" in playinfo["data"]:
+            dash = playinfo["data"]["dash"]
+            if "audio" in dash and dash["audio"]:
+                audio_url = dash["audio"][0]["baseUrl"]
+                logger.info(f"[Playwright] Found audio stream in __playinfo__: {audio_url[:50]}...")
+
+        if not audio_url:
+            logger.warning("[Playwright] Could not find audio in __playinfo__")
+            return None
+
+        temp_path = temp_dir / f"bilibili_audio_{timestamp}.m4s"
+
+        try:
+            api_request = context.request
+            headers = {
+                "User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36",
+                "Referer": "https://www.bilibili.com/"
+            }
+
+            logger.info(f"[Playwright] Downloading audio stream...")
+            response = await api_request.get(audio_url, headers=headers)
+
+            if response.status == 200:
+                body = await response.body()
+                with open(temp_path, 'wb') as f:
+                    f.write(body)
+
+                logger.info(f"[Playwright] Downloaded successfully: {temp_path}")
+                return temp_path
+            else:
+                logger.error(f"[Playwright] API Request failed: {response.status}")
+                return None
+
+        except Exception as e:
+            logger.error(f"[Playwright] Download logic error: {e}")
+            return None
+
+    except Exception as e:
+        logger.error(f"[Playwright] Bilibili download failed: {e}")
+        return None
+    finally:
+        if browser:
+            await browser.close()
+        if playwright:
+            await playwright.stop()
--- a/backend/app/modules/videos/router.py
+++ b/backend/app/modules/videos/router.py
@@ -27,13 +27,20 @@ async def generate_video(


@router.get("/tasks/{task_id}")
-async def get_task_status(task_id: str):
-    return success_response(get_task(task_id))
+async def get_task_status(task_id: str, current_user: dict = Depends(get_current_user)):
+    task = get_task(task_id)
+    # 验证任务归属：只能查看自己的任务
+    if task.get("status") != "not_found" and task.get("user_id") != current_user["id"]:
+        return success_response({"status": "not_found"})
+    return success_response(task)


@router.get("/tasks")
-async def list_tasks_view():
-    return success_response({"tasks": list_tasks()})
+async def list_tasks_view(current_user: dict = Depends(get_current_user)):
+    # 只返回当前用户的任务
+    all_tasks = list_tasks()
+    user_tasks = [t for t in all_tasks if t.get("user_id") == current_user["id"]]
+    return success_response({"tasks": user_tasks})


@router.get("/lipsync/health")
--- a/backend/app/modules/videos/schemas.py
+++ b/backend/app/modules/videos/schemas.py
@@ -1,19 +1,32 @@
 from pydantic import BaseModel
-from typing import Optional
+from typing import Optional, List
+
+
+class CustomAssignment(BaseModel):
+    material_path: str
+    start: float           # 音频时间轴起点
+    end: float             # 音频时间轴终点
+    source_start: float = 0.0  # 源视频截取起点


 class GenerateRequest(BaseModel):
    text: str
    voice: str = "zh-CN-YunxiNeural"
    material_path: str
+    material_paths: Optional[List[str]] = None
    tts_mode: str = "edgetts"
    ref_audio_id: Optional[str] = None
    ref_text: Optional[str] = None
+    language: str = "zh-CN"
+    generated_audio_id: Optional[str] = None  # 预生成配音 ID（存在时跳过内联 TTS）
    title: Optional[str] = None
    enable_subtitles: bool = True
    subtitle_style_id: Optional[str] = None
    title_style_id: Optional[str] = None
    subtitle_font_size: Optional[int] = None
    title_font_size: Optional[int] = None
+    title_top_margin: Optional[int] = None
+    subtitle_bottom_margin: Optional[int] = None
    bgm_id: Optional[str] = None
    bgm_volume: Optional[float] = 0.2
+    custom_assignments: Optional[List[CustomAssignment]] = None
--- a/backend/app/modules/videos/workflow.py
+++ b/backend/app/modules/videos/workflow.py
@@ -1,4 +1,4 @@
-from typing import Optional, Any
+from typing import Optional, Any, List
 from pathlib import Path
 import time
 import traceback
@@ -24,6 +24,17 @@ from .schemas import GenerateRequest
 from .task_store import task_store


+def _locale_to_whisper_lang(locale: str) -> str:
+    """'en-US' → 'en', 'zh-CN' → 'zh'"""
+    return locale.split("-")[0] if "-" in locale else locale
+
+
+def _locale_to_qwen_lang(locale: str) -> str:
+    """'zh-CN' → 'Chinese', 'en-US' → 'English', 其他 → 'Auto'"""
+    mapping = {"zh": "Chinese", "en": "English"}
+    return mapping.get(locale.split("-")[0], "Auto")
+
+
 _lipsync_service: Optional[LipSyncService] = None
 _lipsync_ready: Optional[bool] = None
 _lipsync_last_check: float = 0
@@ -79,26 +90,140 @@ def _update_task(task_id: str, **updates: Any) -> None:
    task_store.update(task_id, updates)


+# ── 多素材辅助函数 ──
+
+
+def _split_equal(segments: List[dict], material_paths: List[str]) -> List[dict]:
+    """按素材数量均分音频时长，对齐到最近的 Whisper 字边界。
+
+    Args:
+        segments: Whisper 产出的 segment 列表, 每个包含 words (字级时间戳)
+        material_paths: 素材路径列表
+
+    Returns:
+        [{"material_path": "...", "start": 0.0, "end": 5.2, "index": 0}, ...]
+    """
+    # 展平所有 Whisper 字符
+    all_chars: List[dict] = []
+    for seg in segments:
+        for w in seg.get("words", []):
+            all_chars.append(w)
+
+    n = len(material_paths)
+
+    if not all_chars or n == 0:
+        return [{"material_path": material_paths[0] if material_paths else "",
+                 "start": 0.0, "end": 99999.0, "index": 0}]
+
+    # 素材数不能超过字符数，否则边界会重复
+    if n > len(all_chars):
+        logger.warning(f"[MultiMat] 素材数({n}) > 字符数({len(all_chars)})，裁剪为 {len(all_chars)}")
+        n = len(all_chars)
+
+    total_start = all_chars[0]["start"]
+    total_end = all_chars[-1]["end"]
+    seg_dur = (total_end - total_start) / n
+
+    # 计算 N-1 个分割点，对齐到最近的字边界
+    boundaries = [0]  # 第一段从第 0 个字开始
+    for i in range(1, n):
+        target_time = total_start + i * seg_dur
+        # 找到 start 时间最接近 target_time 的字
+        best_idx = boundaries[-1] + 1  # 至少比上一个边界后移 1
+        best_diff = float("inf")
+        for j in range(boundaries[-1] + 1, len(all_chars)):
+            diff = abs(all_chars[j]["start"] - target_time)
+            if diff < best_diff:
+                best_diff = diff
+                best_idx = j
+            elif diff > best_diff:
+                break  # 时间递增，差值开始变大后可以停了
+        boundaries.append(min(best_idx, len(all_chars) - 1))
+    boundaries.append(len(all_chars))  # 最后一段到末尾
+
+    # 按边界生成分配结果
+    assignments: List[dict] = []
+    for i in range(n):
+        s_idx = boundaries[i]
+        e_idx = boundaries[i + 1]
+        if s_idx >= len(all_chars) or s_idx >= e_idx:
+            continue
+        assignments.append({
+            "material_path": material_paths[i],
+            "start": all_chars[s_idx]["start"],
+            "end": all_chars[e_idx - 1]["end"],
+            "text": "".join(c["word"] for c in all_chars[s_idx:e_idx]),
+            "index": len(assignments),
+        })
+
+    if not assignments:
+        return [{"material_path": material_paths[0], "start": 0.0, "end": 99999.0, "index": 0}]
+
+    logger.info(f"[MultiMat] 均分 {len(all_chars)} 字为 {len(assignments)} 段")
+    for a in assignments:
+        dur = a["end"] - a["start"]
+        logger.info(f"  段{a['index']}: [{a['start']:.2f}-{a['end']:.2f}s] ({dur:.1f}s) {a['text'][:20]}")
+
+    return assignments
+
+
 async def process_video_generation(task_id: str, req: GenerateRequest, user_id: str):
    temp_files = []
    try:
        start_time = time.time()
+
+        # ── 确定素材列表 ──
+        material_paths: List[str] = []
+        if req.material_paths and len(req.material_paths) > 1:
+            material_paths = req.material_paths
+        else:
+            material_paths = [req.material_path]
+
+        is_multi = len(material_paths) > 1
+
        _update_task(task_id, status="processing", progress=5, message="正在下载素材...")

        temp_dir = settings.UPLOAD_DIR / "temp"
        temp_dir.mkdir(parents=True, exist_ok=True)

-        input_material_path = temp_dir / f"{task_id}_input.mp4"
-        temp_files.append(input_material_path)
-
-        await _download_material(req.material_path, input_material_path)
+        # 单素材模式：下载主素材
+        if not is_multi:
+            input_material_path = temp_dir / f"{task_id}_input.mp4"
+            temp_files.append(input_material_path)
+            await _download_material(material_paths[0], input_material_path)

        _update_task(task_id, message="正在生成语音...", progress=10)

        audio_path = temp_dir / f"{task_id}_audio.wav"
        temp_files.append(audio_path)

-        if req.tts_mode == "voiceclone":
+        if req.generated_audio_id:
+            # 新流程：使用预生成的配音
+            _update_task(task_id, message="正在下载配音...", progress=12)
+            audio_url = await storage_service.get_signed_url(
+                bucket="generated-audios",
+                path=req.generated_audio_id,
+            )
+            await _download_material(audio_url, audio_path)
+
+            # 从元数据获取 language
+            meta_path = req.generated_audio_id.replace("_audio.wav", "_audio.json")
+            try:
+                meta_url = await storage_service.get_signed_url(
+                    bucket="generated-audios", path=meta_path,
+                )
+                import httpx as _httpx
+                async with _httpx.AsyncClient(timeout=5.0) as client:
+                    resp = await client.get(meta_url)
+                    if resp.status_code == 200:
+                        meta = resp.json()
+                        req.language = meta.get("language", req.language)
+                        if not req.text.strip():
+                            req.text = meta.get("text", req.text)
+            except Exception as e:
+                logger.warning(f"读取配音元数据失败: {e}")
+
+        elif req.tts_mode == "voiceclone":
            if not req.ref_audio_id or not req.ref_text:
                raise ValueError("声音克隆模式需要提供参考音频和参考文字")

@@ -119,7 +244,7 @@ async def process_video_generation(task_id: str, req: GenerateRequest, user_id:
                ref_audio_path=str(ref_audio_local),
                ref_text=req.ref_text,
                output_path=str(audio_path),
-                language="Chinese"
+                language=_locale_to_qwen_lang(req.language)
            )
        else:
            _update_task(task_id, message="正在生成语音 (EdgeTTS)...")
@@ -128,52 +253,237 @@ async def process_video_generation(task_id: str, req: GenerateRequest, user_id:

        tts_time = time.time() - start_time
        print(f"[Pipeline] TTS completed in {tts_time:.1f}s")
-        _update_task(task_id, progress=25)
-
-        _update_task(task_id, message="正在合成唇形 (LatentSync)...", progress=30)

        lipsync = _get_lipsync_service()
        lipsync_video_path = temp_dir / f"{task_id}_lipsync.mp4"
        temp_files.append(lipsync_video_path)

-        lipsync_start = time.time()
-        is_ready = await _check_lipsync_ready()
-
-        if is_ready:
-            print(f"[LipSync] Starting LatentSync inference...")
-            _update_task(task_id, progress=35, message="正在运行 LatentSync 推理...")
-            await lipsync.generate(str(input_material_path), str(audio_path), str(lipsync_video_path))
-        else:
-            print(f"[LipSync] LatentSync not ready, copying original video")
-            _update_task(task_id, message="唇形同步不可用，使用原始视频...")
-            import shutil
-            shutil.copy(str(input_material_path), lipsync_video_path)
-
-        lipsync_time = time.time() - lipsync_start
-        print(f"[Pipeline] LipSync completed in {lipsync_time:.1f}s")
-        _update_task(task_id, progress=80)
-
+        video = VideoService()
        captions_path = None
-        if req.enable_subtitles:
-            _update_task(task_id, message="正在生成字幕 (Whisper)...", progress=82)

-            captions_path = temp_dir / f"{task_id}_captions.json"
-            temp_files.append(captions_path)
+        if is_multi:
+            # ══════════════════════════════════════
+            # 多素材流水线
+            # ══════════════════════════════════════
+            _update_task(task_id, progress=12, message="正在分配素材...")

-            try:
-                await whisper_service.align(
-                    audio_path=str(audio_path),
-                    text=req.text,
-                    output_path=str(captions_path)
+            if req.custom_assignments:
+                # 用户自定义分配，跳过 Whisper 均分
+                assignments = [
+                    {
+                        "material_path": a.material_path,
+                        "start": a.start,
+                        "end": a.end,
+                        "source_start": a.source_start,
+                        "index": i,
+                    }
+                    for i, a in enumerate(req.custom_assignments)
+                ]
+                # 仍然需要 Whisper 生成字幕（如果启用）
+                captions_path = temp_dir / f"{task_id}_captions.json"
+                temp_files.append(captions_path)
+                if req.enable_subtitles:
+                    _update_task(task_id, message="正在生成字幕 (Whisper)...")
+                    try:
+                        await whisper_service.align(
+                            audio_path=str(audio_path),
+                            text=req.text,
+                            output_path=str(captions_path),
+                            language=_locale_to_whisper_lang(req.language),
+                        )
+                        print(f"[Pipeline] Whisper alignment completed (custom assignments)")
+                    except Exception as e:
+                        logger.warning(f"Whisper alignment failed: {e}")
+                        captions_path = None
+                else:
+                    captions_path = None
+            else:
+                # 原有逻辑：Whisper → _split_equal
+                _update_task(task_id, message="正在生成字幕 (Whisper)...")
+
+                captions_path = temp_dir / f"{task_id}_captions.json"
+                temp_files.append(captions_path)
+
+                try:
+                    captions_data = await whisper_service.align(
+                        audio_path=str(audio_path),
+                        text=req.text,
+                        output_path=str(captions_path),
+                        language=_locale_to_whisper_lang(req.language),
+                    )
+                    print(f"[Pipeline] Whisper alignment completed (multi-material)")
+                except Exception as e:
+                    logger.warning(f"Whisper alignment failed: {e}")
+                    captions_data = None
+                    captions_path = None
+
+                _update_task(task_id, progress=15, message="正在分配素材...")
+
+                if captions_data and captions_data.get("segments"):
+                    assignments = _split_equal(captions_data["segments"], material_paths)
+                else:
+                    # Whisper 失败 → 按时长均分（不依赖字符对齐）
+                    logger.warning("[MultiMat] Whisper 无数据，按时长均分")
+                    audio_dur = video._get_duration(str(audio_path))
+                    if audio_dur <= 0:
+                        audio_dur = 30.0  # 安全兜底
+                    seg_dur = audio_dur / len(material_paths)
+                    assignments = [
+                        {"material_path": material_paths[i], "start": i * seg_dur,
+                         "end": (i + 1) * seg_dur, "index": i}
+                        for i in range(len(material_paths))
+                    ]
+
+            # 扩展段覆盖完整音频范围：首段从0开始，末段到音频结尾
+            audio_duration = video._get_duration(str(audio_path))
+            if assignments and audio_duration > 0:
+                assignments[0]["start"] = 0.0
+                assignments[-1]["end"] = audio_duration
+
+            num_segments = len(assignments)
+            print(f"[Pipeline] Multi-material: {num_segments} segments, {len(material_paths)} materials")
+
+            if num_segments == 0:
+                raise RuntimeError("Multi-material: no valid segments after splitting")
+
+            lipsync_start = time.time()
+
+            # ── 第一步：下载所有素材并检测分辨率 ──
+            material_locals: List[Path] = []
+            resolutions = []
+
+            for i, assignment in enumerate(assignments):
+                material_local = temp_dir / f"{task_id}_material_{i}.mp4"
+                temp_files.append(material_local)
+                await _download_material(assignment["material_path"], material_local)
+                material_locals.append(material_local)
+                resolutions.append(video.get_resolution(str(material_local)))
+
+            # 分辨率不一致时，统一到第一个素材的分辨率
+            base_res = resolutions[0] if resolutions else (0, 0)
+            need_scale = any(r != base_res for r in resolutions) and base_res[0] > 0
+            if need_scale:
+                logger.info(f"[MultiMat] 素材分辨率不一致，统一到 {base_res[0]}x{base_res[1]}")
+
+            # ── 第二步：裁剪每段素材到对应时长 ──
+            prepared_segments: List[Path] = []
+
+            for i, assignment in enumerate(assignments):
+                seg_progress = 15 + int((i / num_segments) * 30)  # 15% → 45%
+                seg_dur = assignment["end"] - assignment["start"]
+                _update_task(
+                    task_id,
+                    progress=seg_progress,
+                    message=f"正在准备素材 {i+1}/{num_segments}..."
                )
-                print(f"[Pipeline] Whisper alignment completed")
-            except Exception as e:
-                logger.warning(f"Whisper alignment failed, skipping subtitles: {e}")
+
+                prepared_path = temp_dir / f"{task_id}_prepared_{i}.mp4"
+                temp_files.append(prepared_path)
+                video.prepare_segment(
+                    str(material_locals[i]), seg_dur, str(prepared_path),
+                    target_resolution=base_res if need_scale else None,
+                    source_start=assignment.get("source_start", 0.0),
+                )
+                prepared_segments.append(prepared_path)
+
+            # ── 第二步：拼接所有素材片段 ──
+            _update_task(task_id, progress=50, message="正在拼接素材片段...")
+            concat_path = temp_dir / f"{task_id}_concat.mp4"
+            temp_files.append(concat_path)
+            video.concat_videos(
+                [str(p) for p in prepared_segments],
+                str(concat_path)
+            )
+
+            # ── 第三步：一次 LatentSync 推理 ──
+            is_ready = await _check_lipsync_ready()
+
+            if is_ready:
+                _update_task(task_id, progress=55, message="正在合成唇形 (LatentSync)...")
+                print(f"[LipSync] Multi-material: single LatentSync on concatenated video")
+                try:
+                    await lipsync.generate(str(concat_path), str(audio_path), str(lipsync_video_path))
+                except Exception as e:
+                    logger.warning(f"[LipSync] Failed, fallback to concat without lipsync: {e}")
+                    import shutil
+                    shutil.copy(str(concat_path), str(lipsync_video_path))
+            else:
+                print(f"[LipSync] Not ready, using concatenated video without lipsync")
+                import shutil
+                shutil.copy(str(concat_path), str(lipsync_video_path))
+
+            lipsync_time = time.time() - lipsync_start
+            print(f"[Pipeline] Multi-material prepare + concat + LipSync completed in {lipsync_time:.1f}s")
+            _update_task(task_id, progress=80)
+
+            # 如果用户关闭了字幕，清除 captions_path（Whisper 仅用于句子切分）
+            if not req.enable_subtitles:
                captions_path = None

+        else:
+            # ══════════════════════════════════════
+            # 单素材流水线（原有逻辑）
+            # ══════════════════════════════════════
+
+            # 单素材 + source_start：先截取片段
+            single_source_start = 0.0
+            if req.custom_assignments and len(req.custom_assignments) == 1:
+                single_source_start = req.custom_assignments[0].source_start
+
+            if single_source_start > 0:
+                _update_task(task_id, progress=20, message="正在截取素材片段...")
+                audio_dur = video._get_duration(str(audio_path))
+                if audio_dur <= 0:
+                    audio_dur = 30.0
+                trimmed_path = temp_dir / f"{task_id}_trimmed.mp4"
+                temp_files.append(trimmed_path)
+                video.prepare_segment(
+                    str(input_material_path), audio_dur, str(trimmed_path),
+                    source_start=single_source_start,
+                )
+                input_material_path = trimmed_path
+
+            _update_task(task_id, progress=25)
+            _update_task(task_id, message="正在合成唇形 (LatentSync)...", progress=30)
+
+            lipsync_start = time.time()
+            is_ready = await _check_lipsync_ready()
+
+            if is_ready:
+                print(f"[LipSync] Starting LatentSync inference...")
+                _update_task(task_id, progress=35, message="正在运行 LatentSync 推理...")
+                await lipsync.generate(str(input_material_path), str(audio_path), str(lipsync_video_path))
+            else:
+                print(f"[LipSync] LatentSync not ready, copying original video")
+                _update_task(task_id, message="唇形同步不可用，使用原始视频...")
+                import shutil
+                shutil.copy(str(input_material_path), lipsync_video_path)
+
+            lipsync_time = time.time() - lipsync_start
+            print(f"[Pipeline] LipSync completed in {lipsync_time:.1f}s")
+            _update_task(task_id, progress=80)
+
+            # 单素材模式：Whisper 在 LatentSync 之后
+            if req.enable_subtitles:
+                _update_task(task_id, message="正在生成字幕 (Whisper)...", progress=82)
+
+                captions_path = temp_dir / f"{task_id}_captions.json"
+                temp_files.append(captions_path)
+
+                try:
+                    await whisper_service.align(
+                        audio_path=str(audio_path),
+                        text=req.text,
+                        output_path=str(captions_path),
+                        language=_locale_to_whisper_lang(req.language),
+                    )
+                    print(f"[Pipeline] Whisper alignment completed")
+                except Exception as e:
+                    logger.warning(f"Whisper alignment failed, skipping subtitles: {e}")
+                    captions_path = None
+
        _update_task(task_id, progress=85)

-        video = VideoService()
        final_audio_path = audio_path
        if req.bgm_id:
            _update_task(task_id, message="正在合成背景音乐...", progress=86)
@@ -216,6 +526,16 @@ async def process_video_generation(task_id: str, req: GenerateRequest, user_id:
                title_style = {}
            title_style["font_size"] = int(req.title_font_size)

+        if req.title_top_margin is not None and req.title:
+            if title_style is None:
+                title_style = {}
+            title_style["top_margin"] = int(req.title_top_margin)
+
+        if req.subtitle_bottom_margin is not None and req.enable_subtitles:
+            if subtitle_style is None:
+                subtitle_style = {}
+            subtitle_style["bottom_margin"] = int(req.subtitle_bottom_margin)
+
        if use_remotion:
            subtitle_style = prepare_style_for_remotion(
                subtitle_style,
@@ -277,14 +597,12 @@ async def process_video_generation(task_id: str, req: GenerateRequest, user_id:
        _update_task(task_id, message="正在上传结果...", progress=95)

        storage_path = f"{user_id}/{task_id}_output.mp4"
-        with open(final_output_local_path, "rb") as f:
-            file_data = f.read()
-            await storage_service.upload_file(
-                bucket=storage_service.BUCKET_OUTPUTS,
-                path=storage_path,
-                file_data=file_data,
-                content_type="video/mp4"
-            )
+        await storage_service.upload_file_from_path(
+            bucket=storage_service.BUCKET_OUTPUTS,
+            storage_path=storage_path,
+            local_file_path=str(final_output_local_path),
+            content_type="video/mp4"
+        )

        signed_url = await storage_service.get_signed_url(
            bucket=storage_service.BUCKET_OUTPUTS,
--- a/backend/app/services/glm_service.py
+++ b/backend/app/services/glm_service.py
@@ -43,6 +43,7 @@ class GLMService:
 要求：
 1. 标题要简洁有力，能吸引观众点击，不超过10个字
 2. 标签要与内容相关，便于搜索和推荐，只要3个
+3. 标题和标签必须使用与口播文案相同的语言（如文案是英文就用英文，日文就用日文）

 请严格按以下JSON格式返回（不要包含其他内容）：
 {{"title": "标题", "tags": ["标签1", "标签2", "标签3"]}}"""
@@ -51,7 +52,10 @@ class GLMService:
            client = self._get_client()
            logger.info(f"Calling GLM API with model: {settings.GLM_MODEL}")
            
-            response = client.chat.completions.create(
+            # 使用 asyncio.to_thread 包装同步 SDK 调用，避免阻塞事件循环
+            import asyncio
+            response = await asyncio.to_thread(
+                client.chat.completions.create,
                model=settings.GLM_MODEL,
                messages=[{"role": "user", "content": prompt}],
                thinking={"type": "disabled"},  # 禁用思考模式，加快响应
@@ -96,7 +100,10 @@ class GLMService:
            client = self._get_client()
            logger.info(f"Using GLM to rewrite script")

-            response = client.chat.completions.create(
+            # 使用 asyncio.to_thread 包装同步 SDK 调用，避免阻塞事件循环
+            import asyncio
+            response = await asyncio.to_thread(
+                client.chat.completions.create,
                model=settings.GLM_MODEL,
                messages=[{"role": "user", "content": prompt}],
                thinking={"type": "disabled"},
@@ -114,6 +121,49 @@ class GLMService:



+    async def translate_text(self, text: str, target_lang: str) -> str:
+        """
+        将文案翻译为指定语言
+
+        Args:
+            text: 原始文案
+            target_lang: 目标语言（如 English, 日本語 等）
+
+        Returns:
+            翻译后的文案
+        """
+        prompt = f"""请将以下文案翻译为{target_lang}。
+
+原文：
+{text}
+
+要求：
+1. 只返回翻译后的文案，不要添加任何解释或说明
+2. 保持原文的语气和风格
+3. 翻译要自然流畅，符合目标语言的表达习惯"""
+
+        try:
+            client = self._get_client()
+            logger.info(f"Using GLM to translate text to {target_lang}")
+
+            import asyncio
+            response = await asyncio.to_thread(
+                client.chat.completions.create,
+                model=settings.GLM_MODEL,
+                messages=[{"role": "user", "content": prompt}],
+                thinking={"type": "disabled"},
+                max_tokens=2000,
+                temperature=0.3
+            )
+
+            content = response.choices[0].message.content
+            logger.info("GLM translation completed")
+            return content.strip()
+
+        except Exception as e:
+            logger.error(f"GLM translate error: {e}")
+            raise Exception(f"AI 翻译失败: {str(e)}")
+
    def _parse_json_response(self, content: str) -> dict:
        """解析 GLM 返回的 JSON 内容"""
        # 尝试直接解析
--- a/backend/app/services/lipsync_service.py
+++ b/backend/app/services/lipsync_service.py
@@ -398,18 +398,23 @@ class LipSyncService:
            raise e

    async def _local_generate_subprocess(self, video_path: str, audio_path: str, output_path: str) -> str:
-        """原有的 subprocess 逻辑提取为独立方法"""
-        logger.info("🔄 调用 LatentSync 推理 (subprocess)...")
-        # ... (此处仅为占位符提示，实际代码需要调整结构以避免重复，
-        # 但鉴于原有 _local_generate 的结构，最简单的方法是在 _local_generate 内部做判断，
-        # 如果 use_server 失败，可以 retry 或者 _local_generate 不做拆分，直接在里面写逻辑)
-        # 为了最小化改动且保持安全，上面的 _call_persistent_server 如果失败，
-        # 最好不要自动回退（可能导致双重资源消耗），而是直接报错让用户检查服务。
-        # 但为了用户体验，我们可以允许回退。
-        # *修正策略*: 
-        # 我将不拆分 _local_generate_subprocess，而是将 subprocess 逻辑保留在 _local_generate 的后半部分。
-        # 如果 self.use_server 为 True，先尝试调用 server，成功则 return，失败则继续往下走。
-        pass 
+        """
+        原有的 subprocess 回退逻辑
+        
+        注意：subprocess 回退已被禁用，原因如下：
+        1. subprocess 模式需要重新加载模型，消耗大量时间和显存
+        2. 如果常驻服务不可用，应该让用户知道并修复服务，而非静默回退
+        3. 避免双重资源消耗导致的 GPU OOM
+        
+        如果常驻服务不可用，请检查：
+        - 服务是否启动: python scripts/server.py (在 models/LatentSync 目录)
+        - 端口是否被占用: lsof -i:8007
+        - GPU 显存是否充足: nvidia-smi
+        """
+        raise RuntimeError(
+            "LatentSync 常驻服务不可用，无法进行唇形同步。"
+            "请确保 LatentSync 服务已启动 (cd models/LatentSync && python scripts/server.py)"
+        )
    
    async def _remote_generate(
        self, 
--- a/backend/app/services/publish_service.py
+++ b/backend/app/services/publish_service.py
@@ -18,18 +18,19 @@ from app.services.storage import storage_service
 from .uploader.bilibili_uploader import BilibiliUploader
 from .uploader.douyin_uploader import DouyinUploader
 from .uploader.xiaohongshu_uploader import XiaohongshuUploader
+from .uploader.weixin_uploader import WeixinUploader


 class PublishService:
    """Social media publishing service (with user isolation)"""

    # 支持的平台配置
-    PLATFORMS: Dict[str, Dict[str, Any]] = {
-        "douyin": {"name": "抖音", "url": "https://creator.douyin.com/", "enabled": True},
-        "weixin": {"name": "微信视频号", "url": "https://channels.weixin.qq.com/", "enabled": False},
-        "bilibili": {"name": "B站", "url": "https://member.bilibili.com/platform/upload/video/frame", "enabled": True},
-        "xiaohongshu": {"name": "小红书", "url": "https://creator.xiaohongshu.com/", "enabled": True},
-    }
+    PLATFORMS: Dict[str, Dict[str, Any]] = {
+        "douyin": {"name": "抖音", "url": "https://creator.douyin.com/", "enabled": True},
+        "weixin": {"name": "微信视频号", "url": "https://channels.weixin.qq.com/", "enabled": True},
+        "bilibili": {"name": "B站", "url": "https://member.bilibili.com/platform/upload/video/frame", "enabled": True},
+        "xiaohongshu": {"name": "小红书", "url": "https://creator.xiaohongshu.com/", "enabled": True},
+    }
    
    def __init__(self) -> None:
        # 存储活跃的登录会话，用于跟踪登录状态
@@ -181,7 +182,8 @@ class PublishService:
                    tags=tags,
                    publish_date=publish_time,
                    account_file=str(account_file),
-                    description=description
+                    description=description,
+                    user_id=user_id,
                )
            elif platform == "xiaohongshu":
                uploader = XiaohongshuUploader(
@@ -192,6 +194,16 @@ class PublishService:
                    account_file=str(account_file),
                    description=description
                )
+            elif platform == "weixin":
+                uploader = WeixinUploader(
+                    title=title,
+                    file_path=local_video_path,
+                    tags=tags,
+                    publish_date=publish_time,
+                    account_file=str(account_file),
+                    description=description,
+                    user_id=user_id,
+                )
            else:
                logger.warning(f"[发布] {platform} 上传功能尚未实现")
                return {
@@ -224,30 +236,38 @@ class PublishService:
    async def login(self, platform: str, user_id: Optional[str] = None) -> Dict[str, Any]:
        """
        启动QR码登录流程
-        
+
        Args:
            platform: 平台 ID
            user_id: 用户 ID (用于 Cookie 隔离)
-        
+
        Returns:
            dict: 包含二维码base64图片
        """
        if platform not in self.PLATFORMS:
            return {"success": False, "message": "不支持的平台"}
-        
+
        try:
            from .qr_login_service import QRLoginService
-            
+
            # 获取用户专属的 Cookie 目录
            cookies_dir = self._get_cookies_dir(user_id)
-            
+
+            # 清理旧的活跃会话（避免残留会话干扰新登录）
+            session_key = self._get_session_key(platform, user_id)
+            if session_key in self.active_login_sessions:
+                old_service = self.active_login_sessions.pop(session_key)
+                try:
+                    await old_service._cleanup()
+                except Exception:
+                    pass
+
            # 创建QR登录服务
            qr_service = QRLoginService(platform, cookies_dir)
-            
+
            # 存储活跃会话 (带用户隔离)
-            session_key = self._get_session_key(platform, user_id)
            self.active_login_sessions[session_key] = qr_service
-            
+
            # 启动登录并获取二维码
            result = await qr_service.start_login()
            
@@ -261,27 +281,28 @@ class PublishService:
            }

    def get_login_session_status(self, platform: str, user_id: Optional[str] = None) -> Dict[str, Any]:
-        """获取活跃登录会话的状态"""
+        """获取活跃登录会话的状态（仅用于扫码轮询）"""
        session_key = self._get_session_key(platform, user_id)
-        
-        # 1. 如果有活跃的扫码会话，优先检查它
+
+        # 只检查活跃的扫码会话，不检查 Cookie 文件
+        # Cookie 文件检查会导致"重新登录"时误判为已登录
        if session_key in self.active_login_sessions:
            qr_service = self.active_login_sessions[session_key]
            status = qr_service.get_login_status()
-            
+
            # 如果登录成功且Cookie已保存，清理会话
            if status["success"] and status["cookies_saved"]:
                del self.active_login_sessions[session_key]
                return {"success": True, "message": "登录成功"}
-            
-            return {"success": False, "message": "等待扫码..."}
-            
-        # 2. 检查本地Cookie文件是否存在
-        cookie_file = self._get_cookie_path(platform, user_id)
-        if cookie_file.exists():
-             return {"success": True, "message": "已登录 (历史状态)"}
-             
-        return {"success": False, "message": "未登录"}
+
+            # 刷脸验证：传递新二维码给前端
+            result: Dict[str, Any] = {"success": False, "message": "等待扫码..."}
+            if status.get("face_verify_qr"):
+                result["face_verify_qr"] = status["face_verify_qr"]
+            return result
+
+        # 没有活跃会话 → 返回 False（前端不应在无会话时轮询）
+        return {"success": False, "message": "无活跃登录会话"}
    
    def logout(self, platform: str, user_id: Optional[str] = None) -> Dict[str, Any]:
        """
--- a/backend/app/services/qr_login_service.py
+++ b/backend/app/services/qr_login_service.py
--- a/backend/app/services/remotion_service.py
+++ b/backend/app/services/remotion_service.py
@@ -5,6 +5,7 @@ Remotion 视频渲染服务

 import asyncio
 import json
+import os
 import subprocess
 from pathlib import Path
 from typing import Optional
@@ -52,13 +53,21 @@ class RemotionService:
            输出视频路径
        """
        # 构建命令参数
-        cmd = [
-            "npx", "ts-node", "render.ts",
+        # 优先使用预编译的 JS 文件（更快），如果不存在则回退到 ts-node
+        compiled_js = self.remotion_dir / "dist" / "render.js"
+        if compiled_js.exists():
+            cmd = ["node", "dist/render.js"]
+            logger.info("Using pre-compiled render.js for faster startup")
+        else:
+            cmd = ["npx", "ts-node", "render.ts"]
+            logger.warning("Using ts-node (slower). Run 'npm run build:render' to compile for faster startup.")
+        
+        cmd.extend([
            "--video", str(video_path),
            "--output", str(output_path),
            "--fps", str(fps),
            "--enableSubtitles", str(enable_subtitles).lower()
-        ]
+        ])

        if captions_path:
            cmd.extend(["--captions", str(captions_path)])
@@ -106,6 +115,16 @@ class RemotionService:
            process.wait()

            if process.returncode != 0:
+                # Remotion 渲染可能在完成输出后进程崩溃 (如 SIGABRT code -6)
+                # 如果输出文件已存在且大小合理，视为成功
+                output_file = Path(output_path)
+                if output_file.exists() and output_file.stat().st_size > 1024:
+                    logger.warning(
+                        f"Remotion process exited with code {process.returncode}, "
+                        f"but output file exists ({output_file.stat().st_size} bytes). Treating as success."
+                    )
+                    return output_path
+
                error_msg = "\n".join(output_lines[-20:])  # 最后 20 行
                raise RuntimeError(f"Remotion render failed (code {process.returncode}):\n{error_msg}")

--- a/backend/app/services/storage.py
+++ b/backend/app/services/storage.py
@@ -7,9 +7,12 @@ from pathlib import Path
 import asyncio
 import functools
 import os
+import shutil

-# Supabase Storage 本地存储根目录
-SUPABASE_STORAGE_LOCAL_PATH = Path("/home/rongye/ProgramFiles/Supabase/volumes/storage/stub/stub")
+# Supabase Storage 本地存储根目录（从环境变量读取，支持不同部署环境）
+# Supabase Storage 本地存储根目录（从环境变量读取，支持不同部署环境）
+_default_storage_path = "/var/lib/supabase/storage" # 生产环境默认路径
+SUPABASE_STORAGE_LOCAL_PATH = Path(os.getenv("SUPABASE_STORAGE_LOCAL_PATH", _default_storage_path))

 class StorageService:
    def __init__(self):
@@ -17,12 +20,13 @@ class StorageService:
        self.BUCKET_MATERIALS = "materials"
        self.BUCKET_OUTPUTS = "outputs"
        self.BUCKET_REF_AUDIOS = "ref-audios"
+        self.BUCKET_GENERATED_AUDIOS = "generated-audios"
        # 确保所有 bucket 存在
        self._ensure_buckets()

    def _ensure_buckets(self):
        """确保所有必需的 bucket 存在"""
-        buckets = [self.BUCKET_MATERIALS, self.BUCKET_OUTPUTS, self.BUCKET_REF_AUDIOS]
+        buckets = [self.BUCKET_MATERIALS, self.BUCKET_OUTPUTS, self.BUCKET_REF_AUDIOS, self.BUCKET_GENERATED_AUDIOS]
        try:
            existing = self.supabase.storage.list_buckets()
            existing_names = {b.name for b in existing} if existing else set()
@@ -100,6 +104,45 @@ class StorageService:
            logger.error(f"Storage upload failed: {e}")
            raise e

+    async def upload_file_from_path(self, bucket: str, storage_path: str, local_file_path: str, content_type: str) -> str:
+        """
+        从本地文件路径上传文件到 Supabase Storage
+        
+        使用分块读取减少内存峰值，避免大文件整读入内存
+        
+        Args:
+            bucket: 存储桶名称
+            storage_path: Storage 中的目标路径
+            local_file_path: 本地文件的绝对路径
+            content_type: MIME 类型
+        """
+        local_file = Path(local_file_path)
+        if not local_file.exists():
+            raise FileNotFoundError(f"本地文件不存在: {local_file_path}")
+        
+        loop = asyncio.get_running_loop()
+        file_size = local_file.stat().st_size
+        
+        # 分块读取文件，避免大文件整读入内存
+        # 虽然最终还是需要拼接成 bytes 传给 SDK，但分块读取可以减少 IO 压力
+        def read_file_chunked():
+            chunks = []
+            chunk_size = 10 * 1024 * 1024  # 10MB per chunk
+            with open(local_file_path, "rb") as f:
+                while True:
+                    chunk = f.read(chunk_size)
+                    if not chunk:
+                        break
+                    chunks.append(chunk)
+            return b"".join(chunks)
+        
+        if file_size > 50 * 1024 * 1024:  # 大于 50MB 记录日志
+            logger.info(f"大文件上传: {file_size / 1024 / 1024:.1f}MB")
+        
+        file_data = await loop.run_in_executor(None, read_file_chunked)
+        
+        return await self.upload_file(bucket, storage_path, file_data, content_type)
+
    async def get_signed_url(self, bucket: str, path: str, expires_in: int = 3600) -> str:
        """异步获取签名访问链接"""
        try:
@@ -139,8 +182,8 @@ class StorageService:
            logger.error(f"Get public URL failed: {e}")
            return ""

-    async def delete_file(self, bucket: str, path: str):
-        """异步删除文件"""
+    async def delete_file(self, bucket: str, path: str):
+        """异步删除文件"""
        try:
            loop = asyncio.get_running_loop()
            await loop.run_in_executor(
@@ -149,21 +192,21 @@ class StorageService:
            )
            logger.info(f"Deleted file: {bucket}/{path}")
        except Exception as e:
-            logger.error(f"Delete file failed: {e}")
-            pass
-
-    async def move_file(self, bucket: str, from_path: str, to_path: str):
-        """异步移动/重命名文件"""
-        try:
-            loop = asyncio.get_running_loop()
-            await loop.run_in_executor(
-                None,
-                lambda: self.supabase.storage.from_(bucket).move(from_path, to_path)
-            )
-            logger.info(f"Moved file: {bucket}/{from_path} -> {to_path}")
-        except Exception as e:
-            logger.error(f"Move file failed: {e}")
-            raise e
+            logger.error(f"Delete file failed: {e}")
+            pass
+
+    async def move_file(self, bucket: str, from_path: str, to_path: str):
+        """异步移动/重命名文件"""
+        try:
+            loop = asyncio.get_running_loop()
+            await loop.run_in_executor(
+                None,
+                lambda: self.supabase.storage.from_(bucket).move(from_path, to_path)
+            )
+            logger.info(f"Moved file: {bucket}/{from_path} -> {to_path}")
+        except Exception as e:
+            logger.error(f"Move file failed: {e}")
+            raise e

    async def list_files(self, bucket: str, path: str) -> List[Any]:
        """异步列出文件"""
--- a/backend/app/services/uploader/init.py
+++ b/backend/app/services/uploader/init.py
@@ -4,6 +4,7 @@ Platform uploader base classes and utilities
 from .base_uploader import BaseUploader
 from .bilibili_uploader import BilibiliUploader
 from .douyin_uploader import DouyinUploader
-from .xiaohongshu_uploader import XiaohongshuUploader
+from .xiaohongshu_uploader import XiaohongshuUploader
+from .weixin_uploader import WeixinUploader

-__all__ = ['BaseUploader', 'BilibiliUploader', 'DouyinUploader', 'XiaohongshuUploader']
+__all__ = ['BaseUploader', 'BilibiliUploader', 'DouyinUploader', 'XiaohongshuUploader', 'WeixinUploader']
--- a/backend/app/services/uploader/douyin_uploader.py
+++ b/backend/app/services/uploader/douyin_uploader.py
--- a/backend/app/services/uploader/weixin_uploader.py
+++ b/backend/app/services/uploader/weixin_uploader.py
--- a/backend/app/services/video_service.py
+++ b/backend/app/services/video_service.py
@@ -1,10 +1,10 @@
 """
 视频合成服务
 """
-import os
-import subprocess
-import json
-import shlex
+import os
+import subprocess
+import json
+import shlex
 from pathlib import Path
 from loguru import logger
 from typing import Optional
@@ -13,18 +13,18 @@ class VideoService:
    def __init__(self):
        pass

-    def _run_ffmpeg(self, cmd: list) -> bool:
-        cmd_str = ' '.join(shlex.quote(str(c)) for c in cmd)
-        logger.debug(f"FFmpeg CMD: {cmd_str}")
-        try:
-            # Synchronous call for BackgroundTasks compatibility
-            result = subprocess.run(
-                cmd,
-                shell=False,
-                capture_output=True,
-                text=True,
-                encoding='utf-8',
-            )
+    def _run_ffmpeg(self, cmd: list) -> bool:
+        cmd_str = ' '.join(shlex.quote(str(c)) for c in cmd)
+        logger.debug(f"FFmpeg CMD: {cmd_str}")
+        try:
+            # Synchronous call for BackgroundTasks compatibility
+            result = subprocess.run(
+                cmd,
+                shell=False,
+                capture_output=True,
+                text=True,
+                encoding='utf-8',
+            )
            if result.returncode != 0:
                logger.error(f"FFmpeg Error: {result.stderr}")
                return False
@@ -33,51 +33,56 @@ class VideoService:
            logger.error(f"FFmpeg Exception: {e}")
            return False

-    def _get_duration(self, file_path: str) -> float:
-        # Synchronous call for BackgroundTasks compatibility
-        cmd = f'ffprobe -v error -show_entries format=duration -of default=noprint_wrappers=1:nokey=1 "{file_path}"'
+    def _get_duration(self, file_path: str) -> float:
+        # Synchronous call for BackgroundTasks compatibility
+        # 使用参数列表形式避免 shell=True 的命令注入风险
+        cmd = [
+            'ffprobe', '-v', 'error',
+            '-show_entries', 'format=duration',
+            '-of', 'default=noprint_wrappers=1:nokey=1',
+            file_path
+        ]
        try:
            result = subprocess.run(
                cmd,
-                shell=True,
                capture_output=True,
                text=True,
            )
            return float(result.stdout.strip())
        except Exception:
-            return 0.0
-
-    def mix_audio(
-        self,
-        voice_path: str,
-        bgm_path: str,
-        output_path: str,
-        bgm_volume: float = 0.2
-    ) -> str:
-        """混合人声与背景音乐"""
-        Path(output_path).parent.mkdir(parents=True, exist_ok=True)
-
-        volume = max(0.0, min(float(bgm_volume), 1.0))
-        filter_complex = (
-            f"[0:a]volume=1.0[a0];"
-            f"[1:a]volume={volume}[a1];"
-            f"[a0][a1]amix=inputs=2:duration=first:dropout_transition=2:normalize=0[aout]"
-        )
-
-        cmd = [
-            "ffmpeg", "-y",
-            "-i", voice_path,
-            "-stream_loop", "-1", "-i", bgm_path,
-            "-filter_complex", filter_complex,
-            "-map", "[aout]",
-            "-c:a", "pcm_s16le",
-            "-shortest",
-            output_path,
-        ]
-
-        if self._run_ffmpeg(cmd):
-            return output_path
-        raise RuntimeError("FFmpeg audio mix failed")
+            return 0.0
+
+    def mix_audio(
+        self,
+        voice_path: str,
+        bgm_path: str,
+        output_path: str,
+        bgm_volume: float = 0.2
+    ) -> str:
+        """混合人声与背景音乐"""
+        Path(output_path).parent.mkdir(parents=True, exist_ok=True)
+
+        volume = max(0.0, min(float(bgm_volume), 1.0))
+        filter_complex = (
+            f"[0:a]volume=1.0[a0];"
+            f"[1:a]volume={volume}[a1];"
+            f"[a0][a1]amix=inputs=2:duration=first:dropout_transition=2:normalize=0[aout]"
+        )
+
+        cmd = [
+            "ffmpeg", "-y",
+            "-i", voice_path,
+            "-stream_loop", "-1", "-i", bgm_path,
+            "-filter_complex", filter_complex,
+            "-map", "[aout]",
+            "-c:a", "pcm_s16le",
+            "-shortest",
+            output_path,
+        ]
+
+        if self._run_ffmpeg(cmd):
+            return output_path
+        raise RuntimeError("FFmpeg audio mix failed")

    async def compose(
        self,
@@ -133,3 +138,145 @@ class VideoService:
            return output_path
        else:
            raise RuntimeError("FFmpeg composition failed")
+
+    def concat_videos(self, video_paths: list, output_path: str) -> str:
+        """使用 FFmpeg concat demuxer 拼接多个视频片段"""
+        if not video_paths:
+            raise ValueError("No video segments to concat")
+
+        Path(output_path).parent.mkdir(parents=True, exist_ok=True)
+
+        # 生成 concat list 文件
+        list_path = Path(output_path).parent / f"{Path(output_path).stem}_concat.txt"
+        with open(list_path, "w", encoding="utf-8") as f:
+            for vp in video_paths:
+                f.write(f"file '{vp}'\n")
+
+        cmd = [
+            "ffmpeg", "-y",
+            "-f", "concat",
+            "-safe", "0",
+            "-i", str(list_path),
+            "-c", "copy",
+            output_path,
+        ]
+
+        try:
+            if self._run_ffmpeg(cmd):
+                return output_path
+            else:
+                raise RuntimeError("FFmpeg concat failed")
+        finally:
+            try:
+                list_path.unlink(missing_ok=True)
+            except Exception:
+                pass
+
+    def split_audio(self, audio_path: str, start: float, end: float, output_path: str) -> str:
+        """用 FFmpeg 按时间范围切分音频"""
+        Path(output_path).parent.mkdir(parents=True, exist_ok=True)
+
+        duration = end - start
+        if duration <= 0:
+            raise ValueError(f"Invalid audio split range: start={start}, end={end}, duration={duration}")
+
+        cmd = [
+            "ffmpeg", "-y",
+            "-ss", str(start),
+            "-t", str(duration),
+            "-i", audio_path,
+            "-c", "copy",
+            output_path,
+        ]
+
+        if self._run_ffmpeg(cmd):
+            return output_path
+        raise RuntimeError(f"FFmpeg audio split failed: {start}-{end}")
+
+    def get_resolution(self, file_path: str) -> tuple:
+        """获取视频分辨率，返回 (width, height)"""
+        cmd = [
+            'ffprobe', '-v', 'error',
+            '-select_streams', 'v:0',
+            '-show_entries', 'stream=width,height',
+            '-of', 'csv=p=0',
+            file_path
+        ]
+        try:
+            result = subprocess.run(cmd, capture_output=True, text=True, timeout=10)
+            parts = result.stdout.strip().split(',')
+            return (int(parts[0]), int(parts[1]))
+        except Exception:
+            return (0, 0)
+
+    def prepare_segment(self, video_path: str, target_duration: float, output_path: str,
+                        target_resolution: tuple = None, source_start: float = 0.0) -> str:
+        """将素材视频裁剪或循环到指定时长（无音频）。
+        target_resolution: (width, height) 如需统一分辨率则传入，否则保持原分辨率。
+        source_start: 源视频截取起点（秒），默认 0。
+        """
+        Path(output_path).parent.mkdir(parents=True, exist_ok=True)
+
+        video_dur = self._get_duration(video_path)
+        if video_dur <= 0:
+            video_dur = target_duration
+
+        # 可用时长 = 从 source_start 到视频结尾
+        available = max(video_dur - source_start, 0.1)
+        needs_loop = target_duration > available
+        needs_scale = target_resolution is not None
+
+        # 当需要循环且有 source_start 时，先裁剪出片段，再循环裁剪后的文件
+        # 避免 stream_loop 循环整个视频（而不是从 source_start 开始的片段）
+        actual_input = video_path
+        trim_temp = None
+        if needs_loop and source_start > 0:
+            trim_temp = str(Path(output_path).parent / (Path(output_path).stem + "_trim_tmp.mp4"))
+            trim_cmd = [
+                "ffmpeg", "-y",
+                "-ss", str(source_start),
+                "-i", video_path,
+                "-t", str(available),
+                "-an",
+                "-c:v", "libx264", "-preset", "fast", "-crf", "18",
+                trim_temp,
+            ]
+            if not self._run_ffmpeg(trim_cmd):
+                raise RuntimeError(f"FFmpeg trim for loop failed: {video_path}")
+            actual_input = trim_temp
+            source_start = 0.0  # 已裁剪，不需要再 seek
+            # 重新计算循环次数（基于裁剪后文件）
+            available = self._get_duration(trim_temp) or available
+
+        loop_count = int(target_duration / available) + 1 if needs_loop else 0
+
+        cmd = ["ffmpeg", "-y"]
+        if needs_loop:
+            cmd.extend(["-stream_loop", str(loop_count)])
+        if source_start > 0:
+            cmd.extend(["-ss", str(source_start)])
+        cmd.extend(["-i", actual_input, "-t", str(target_duration), "-an"])
+
+        if needs_scale:
+            w, h = target_resolution
+            cmd.extend(["-vf", f"scale={w}:{h}:force_original_aspect_ratio=decrease,pad={w}:{h}:(ow-iw)/2:(oh-ih)/2"])
+
+        # 需要循环、缩放或指定起点时必须重编码，否则用 stream copy 保持原画质
+        if needs_loop or needs_scale or source_start > 0:
+            cmd.extend(["-c:v", "libx264", "-preset", "fast", "-crf", "18"])
+        else:
+            cmd.extend(["-c:v", "copy"])
+
+        cmd.append(output_path)
+
+        try:
+            if self._run_ffmpeg(cmd):
+                return output_path
+            raise RuntimeError(f"FFmpeg prepare_segment failed: {video_path}")
+        finally:
+            # 清理裁剪临时文件
+            if trim_temp:
+                try:
+                    Path(trim_temp).unlink(missing_ok=True)
+                except Exception:
+                    pass
--- a/backend/app/services/voice_clone_service.py
+++ b/backend/app/services/voice_clone_service.py
@@ -48,7 +48,7 @@ class VoiceCloneService:
        """
        # 使用锁确保串行执行，避免 GPU 显存溢出
        async with self._lock:
-            logger.info(f"🎤 Voice Clone: {text[:30]}...")
+            logger.info(f"🎤 Voice Clone: {text[:30]}... (language={language})")
            Path(output_path).parent.mkdir(parents=True, exist_ok=True)

            # 读取参考音频
--- a/backend/app/services/whisper_service.py
+++ b/backend/app/services/whisper_service.py
@@ -20,16 +20,23 @@ MAX_CHARS_PER_LINE = 12

 def split_word_to_chars(word: str, start: float, end: float) -> list:
    """
-    将词拆分成单个字符，时间戳线性插值
+    将词拆分成单个字符，时间戳线性插值。
+    保留英文词前的空格（Whisper 输出如 " Hello"），用于正确重建英文字幕。

    Args:
-        word: 词文本
+        word: 词文本（可能含前导空格）
        start: 词开始时间
        end: 词结束时间

    Returns:
        单字符列表，每个包含 word/start/end
    """
+    # 保留前导空格（英文 Whisper 输出常见 " Hello" 形式）
+    leading_space = ""
+    if word and not word[0].strip():
+        leading_space = " "
+        word = word.lstrip()
+
    tokens = []
    ascii_buffer = ""

@@ -54,7 +61,8 @@ def split_word_to_chars(word: str, start: float, end: float) -> list:
        return []

    if len(tokens) == 1:
-        return [{"word": tokens[0], "start": start, "end": end}]
+        w = leading_space + tokens[0] if leading_space else tokens[0]
+        return [{"word": w, "start": start, "end": end}]

    # 线性插值时间戳
    duration = end - start
@@ -64,8 +72,11 @@ def split_word_to_chars(word: str, start: float, end: float) -> list:
    for i, token in enumerate(tokens):
        token_start = start + i * token_duration
        token_end = start + (i + 1) * token_duration
+        w = token
+        if i == 0 and leading_space:
+            w = leading_space + w
        result.append({
-            "word": token,
+            "word": w,
            "start": round(token_start, 3),
            "end": round(token_end, 3)
        })
@@ -108,7 +119,7 @@ def split_segment_to_lines(words: List[dict], max_chars: int = MAX_CHARS_PER_LIN

        if should_break and current_words:
            segments.append({
-                "text": current_text,
+                "text": current_text.strip(),
                "start": current_words[0]["start"],
                "end": current_words[-1]["end"],
                "words": current_words.copy()
@@ -119,7 +130,7 @@ def split_segment_to_lines(words: List[dict], max_chars: int = MAX_CHARS_PER_LIN
    # 处理剩余的字
    if current_words:
        segments.append({
-            "text": current_text,
+            "text": current_text.strip(),
            "start": current_words[0]["start"],
            "end": current_words[-1]["end"],
            "words": current_words.copy()
@@ -162,7 +173,8 @@ class WhisperService:
        self,
        audio_path: str,
        text: str,
-        output_path: Optional[str] = None
+        output_path: Optional[str] = None,
+        language: str = "zh",
    ) -> dict:
        """
        对音频进行转录，生成字级别时间戳
@@ -171,12 +183,16 @@ class WhisperService:
            audio_path: 音频文件路径
            text: 原始文本（用于参考，但实际使用 whisper 转录结果）
            output_path: 可选，输出 JSON 文件路径
+            language: 语言代码 (zh/en 等)

        Returns:
            包含字级别时间戳的字典
        """
        import asyncio

+        # 英文等西文需要更大的每行字数
+        max_chars = 40 if language != "zh" else MAX_CHARS_PER_LINE
+
        def _do_transcribe():
            model = self._load_model()

@@ -185,7 +201,7 @@ class WhisperService:
            # 转录并获取字级别时间戳
            segments_iter, info = model.transcribe(
                audio_path,
-                language="zh",
+                language=language,
                word_timestamps=True,  # 启用字级别时间戳
                vad_filter=True,  # 启用 VAD 过滤静音
            )
@@ -198,9 +214,10 @@ class WhisperService:
                all_words = []
                if segment.words:
                    for word_info in segment.words:
-                        word_text = word_info.word.strip()
-                        if word_text:
+                        word_text = word_info.word
+                        if word_text.strip():
                            # 将词拆分成单字，时间戳线性插值
+                            # 保留前导空格用于英文词间距
                            chars = split_word_to_chars(
                                word_text,
                                word_info.start,
@@ -210,7 +227,7 @@ class WhisperService:

                # 将长段落按标点和字数拆分成多行
                if all_words:
-                    line_segments = split_segment_to_lines(all_words, MAX_CHARS_PER_LINE)
+                    line_segments = split_segment_to_lines(all_words, max_chars)
                    all_segments.extend(line_segments)

            logger.info(f"Generated {len(all_segments)} subtitle segments")
--- a/backend/scripts/watchdog.py
+++ b/backend/scripts/watchdog.py
@@ -23,20 +23,28 @@ SERVICES = [
        "name": "vigent2-qwen-tts",
        "url": "http://localhost:8009/health",
        "failures": 0,
-        "threshold": 3,
+        "threshold": 5,          # 连续5次失败才重启（5×30s = 2.5分钟容忍期）
        "timeout": 10.0,
-        "restart_cmd": ["pm2", "restart", "vigent2-qwen-tts"]
+        "restart_cmd": ["pm2", "restart", "vigent2-qwen-tts"],
+        "cooldown_until": 0,     # 重启后的冷却截止时间戳
+        "cooldown_sec": 120,     # 重启后等待120秒再开始检查
    }
 ]

 async def check_service(service):
    """检查单个服务健康状态"""
+    # 冷却期内跳过检查
+    now = time.time()
+    if now < service.get("cooldown_until", 0):
+        remaining = int(service["cooldown_until"] - now)
+        logger.debug(f"⏳ 服务 {service['name']} 冷却中，剩余 {remaining}s")
+        return True
+
    try:
        timeout = service.get("timeout", 10.0)
        async with httpx.AsyncClient(timeout=timeout) as client:
            response = await client.get(service["url"])
            if response.status_code == 200:
-                # 成功
                if service["failures"] > 0:
                    logger.info(f"✅ 服务 {service['name']} 已恢复正常")
                service["failures"] = 0
@@ -45,35 +53,36 @@ async def check_service(service):
                logger.warning(f"⚠️ 服务 {service['name']} 返回状态码 {response.status_code}")
    except Exception as e:
        logger.warning(f"⚠️ 无法连接服务 {service['name']}: {str(e)}")
-    
+
    # 失败处理
    service["failures"] += 1
    logger.warning(f"❌ 服务 {service['name']} 连续失败 {service['failures']}/{service['threshold']} 次")
-    
+
    if service["failures"] >= service['threshold']:
        logger.error(f"🚨 服务 {service['name']} 已达到失败阈值，正在重启...")
        try:
            subprocess.run(service["restart_cmd"], check=True)
            logger.info(f"♻️ 服务 {service['name']} 重启命令已发送")
-            # 重启后给予一段宽限期 (例如 60秒) 不检查，等待服务启动
-            service["failures"] = 0 # 重置计数
-            return "restarting" 
+            service["failures"] = 0
+            # 设置冷却期，等待服务完成启动和模型加载
+            service["cooldown_until"] = time.time() + service.get("cooldown_sec", 120)
+            return "restarting"
        except Exception as restart_error:
            logger.error(f"💥 重启服务 {service['name']} 失败: {restart_error}")
-            
+
    return False

 async def main():
    logger.info("🛡️ ViGent2 服务看门狗 (Watchdog) 已启动")
-    
+
+    # 启动时给所有服务一个初始冷却期，避免服务还没起来就被判定失败
+    for service in SERVICES:
+        service["cooldown_until"] = time.time() + 60
+
    while True:
-        # 并发检查所有服务
        for service in SERVICES:
-            result = await check_service(service)
-            if result == "restarting":
-                # 如果有服务重启，额外等待包含启动时间
-                pass
-                
+            await check_service(service)
+
        # 每 30 秒检查一次
        await asyncio.sleep(30)

--- a/frontend/package-lock.json
+++ b/frontend/package-lock.json
@@ -8,13 +8,18 @@
      "name": "frontend",
      "version": "0.1.0",
      "dependencies": {
+        "@dnd-kit/core": "^6.3.1",
+        "@dnd-kit/sortable": "^10.0.0",
+        "@dnd-kit/utilities": "^3.2.2",
        "@supabase/supabase-js": "^2.93.1",
        "axios": "^1.13.4",
        "lucide-react": "^0.563.0",
        "next": "16.1.1",
        "react": "19.2.3",
        "react-dom": "19.2.3",
-        "swr": "^2.3.8"
+        "sonner": "^2.0.7",
+        "swr": "^2.3.8",
+        "wavesurfer.js": "^7.12.1"
      },
      "devDependencies": {
        "@tailwindcss/postcss": "^4",
@@ -280,6 +285,59 @@
        "node": ">=6.9.0"
      }
    },
+    "node_modules/@dnd-kit/accessibility": {
+      "version": "3.1.1",
+      "resolved": "https://registry.npmjs.org/@dnd-kit/accessibility/-/accessibility-3.1.1.tgz",
+      "integrity": "sha512-2P+YgaXF+gRsIihwwY1gCsQSYnu9Zyj2py8kY5fFvUM1qm2WA2u639R6YNVfU4GWr+ZM5mqEsfHZZLoRONbemw==",
+      "license": "MIT",
+      "dependencies": {
+        "tslib": "^2.0.0"
+      },
+      "peerDependencies": {
+        "react": ">=16.8.0"
+      }
+    },
+    "node_modules/@dnd-kit/core": {
+      "version": "6.3.1",
+      "resolved": "https://registry.npmjs.org/@dnd-kit/core/-/core-6.3.1.tgz",
+      "integrity": "sha512-xkGBRQQab4RLwgXxoqETICr6S5JlogafbhNsidmrkVv2YRs5MLwpjoF2qpiGjQt8S9AoxtIV603s0GIUpY5eYQ==",
+      "license": "MIT",
+      "dependencies": {
+        "@dnd-kit/accessibility": "^3.1.1",
+        "@dnd-kit/utilities": "^3.2.2",
+        "tslib": "^2.0.0"
+      },
+      "peerDependencies": {
+        "react": ">=16.8.0",
+        "react-dom": ">=16.8.0"
+      }
+    },
+    "node_modules/@dnd-kit/sortable": {
+      "version": "10.0.0",
+      "resolved": "https://registry.npmjs.org/@dnd-kit/sortable/-/sortable-10.0.0.tgz",
+      "integrity": "sha512-+xqhmIIzvAYMGfBYYnbKuNicfSsk4RksY2XdmJhT+HAC01nix6fHCztU68jooFiMUB01Ky3F0FyOvhG/BZrWkg==",
+      "license": "MIT",
+      "dependencies": {
+        "@dnd-kit/utilities": "^3.2.2",
+        "tslib": "^2.0.0"
+      },
+      "peerDependencies": {
+        "@dnd-kit/core": "^6.3.0",
+        "react": ">=16.8.0"
+      }
+    },
+    "node_modules/@dnd-kit/utilities": {
+      "version": "3.2.2",
+      "resolved": "https://registry.npmjs.org/@dnd-kit/utilities/-/utilities-3.2.2.tgz",
+      "integrity": "sha512-+MKAJEOfaBe5SmV6t34p80MMKhjvUz0vRrvVJbPT0WElzaOJ/1xs+D+KDv+tD/NE5ujfrChEcshd4fLn0wpiqg==",
+      "license": "MIT",
+      "dependencies": {
+        "tslib": "^2.0.0"
+      },
+      "peerDependencies": {
+        "react": ">=16.8.0"
+      }
+    },
    "node_modules/@emnapi/core": {
      "version": "1.8.1",
      "resolved": "https://registry.npmjs.org/@emnapi/core/-/core-1.8.1.tgz",
@@ -6006,6 +6064,16 @@
        "url": "https://github.com/sponsors/ljharb"
      }
    },
+    "node_modules/sonner": {
+      "version": "2.0.7",
+      "resolved": "https://registry.npmjs.org/sonner/-/sonner-2.0.7.tgz",
+      "integrity": "sha512-W6ZN4p58k8aDKA4XPcx2hpIQXBRAgyiWVkYhT7CvK6D3iAu7xjvVyhQHg2/iaKJZ1XVJ4r7XuwGL+WGEK37i9w==",
+      "license": "MIT",
+      "peerDependencies": {
+        "react": "^18.0.0 || ^19.0.0 || ^19.0.0-rc",
+        "react-dom": "^18.0.0 || ^19.0.0 || ^19.0.0-rc"
+      }
+    },
    "node_modules/source-map-js": {
      "version": "1.2.1",
      "resolved": "https://registry.npmjs.org/source-map-js/-/source-map-js-1.2.1.tgz",
@@ -6600,6 +6668,12 @@
        "react": "^16.8.0 || ^17.0.0 || ^18.0.0 || ^19.0.0"
      }
    },
+    "node_modules/wavesurfer.js": {
+      "version": "7.12.1",
+      "resolved": "https://registry.npmjs.org/wavesurfer.js/-/wavesurfer.js-7.12.1.tgz",
+      "integrity": "sha512-NswPjVHxk0Q1F/VMRemCPUzSojjuHHisQrBqQiRXg7MVbe3f5vQ6r0rTTXA/a/neC/4hnOEC4YpXca4LpH0SUg==",
+      "license": "BSD-3-Clause"
+    },
    "node_modules/which": {
      "version": "2.0.2",
      "resolved": "https://registry.npmjs.org/which/-/which-2.0.2.tgz",
--- a/frontend/package.json
+++ b/frontend/package.json
@@ -9,13 +9,18 @@
    "lint": "eslint"
  },
  "dependencies": {
+    "@dnd-kit/core": "^6.3.1",
+    "@dnd-kit/sortable": "^10.0.0",
+    "@dnd-kit/utilities": "^3.2.2",
    "@supabase/supabase-js": "^2.93.1",
    "axios": "^1.13.4",
    "lucide-react": "^0.563.0",
    "next": "16.1.1",
    "react": "19.2.3",
    "react-dom": "19.2.3",
-    "swr": "^2.3.8"
+    "sonner": "^2.0.7",
+    "swr": "^2.3.8",
+    "wavesurfer.js": "^7.12.1"
  },
  "devDependencies": {
    "@tailwindcss/postcss": "^4",
--- a/frontend/src/app/admin/page.tsx
+++ b/frontend/src/app/admin/page.tsx
@@ -1,10 +1,12 @@
 'use client';

 import { useState, useEffect } from 'react';
-import { useRouter } from 'next/navigation';
-import { getCurrentUser, User } from "@/shared/lib/auth";
-import api from "@/shared/api/axios";
-import { ApiResponse, unwrap } from "@/shared/api/types";
+import { useRouter } from 'next/navigation';
+import Link from 'next/link';
+import { getCurrentUser, User } from "@/shared/lib/auth";
+import api from "@/shared/api/axios";
+import { ApiResponse, unwrap } from "@/shared/api/types";
+import { toast } from "sonner";

 interface UserListItem {
    id: string;
@@ -18,7 +20,7 @@ interface UserListItem {

 export default function AdminPage() {
    const router = useRouter();
-    const [currentUser, setCurrentUser] = useState<User | null>(null);
+    const [, setCurrentUser] = useState<User | null>(null);
    const [users, setUsers] = useState<UserListItem[]>([]);
    const [loading, setLoading] = useState(true);
    const [error, setError] = useState('');
@@ -28,6 +30,7 @@ export default function AdminPage() {
    useEffect(() => {
        checkAdmin();
        fetchUsers();
+        // eslint-disable-next-line react-hooks/exhaustive-deps
    }, []);

    const checkAdmin = async () => {
@@ -41,9 +44,9 @@ export default function AdminPage() {

    const fetchUsers = async () => {
        try {
-            const { data: res } = await api.get<ApiResponse<UserListItem[]>>('/api/admin/users');
-            setUsers(unwrap(res));
-        } catch (err) {
+            const { data: res } = await api.get<ApiResponse<UserListItem[]>>('/api/admin/users');
+            setUsers(unwrap(res));
+        } catch {
            setError('获取用户列表失败');
        } finally {
            setLoading(false);
@@ -57,7 +60,7 @@ export default function AdminPage() {
                expires_days: expireDays || null
            });
            fetchUsers();
-        } catch (err) {
+        } catch {
            // axios interceptor handles 401/403
        } finally {
            setActivatingId(null);
@@ -70,8 +73,8 @@ export default function AdminPage() {
        try {
            await api.post(`/api/admin/users/${userId}/deactivate`);
            fetchUsers();
-        } catch (err) {
-            alert('操作失败');
+        } catch {
+            toast.error('操作失败');
        }
    };

@@ -106,9 +109,9 @@ export default function AdminPage() {
            <div className="max-w-6xl mx-auto">
                <div className="flex justify-between items-center mb-8">
                    <h1 className="text-3xl font-bold text-white">用户管理</h1>
-                    <a href="/" className="text-purple-300 hover:text-purple-200">
+                    <Link href="/" className="text-purple-300 hover:text-purple-200">
                        ← 返回首页
-                    </a>
+                    </Link>
                </div>

                {error && (
--- a/frontend/src/app/layout.tsx
+++ b/frontend/src/app/layout.tsx
@@ -1,9 +1,10 @@
 import type { Metadata, Viewport } from "next";
 import { Geist, Geist_Mono } from "next/font/google";
 import "./globals.css";
-import { AuthProvider } from "@/contexts/AuthContext";
-import { TaskProvider } from "@/contexts/TaskContext";
-import GlobalTaskIndicator from "@/components/GlobalTaskIndicator";
+import { AuthProvider } from "@/shared/contexts/AuthContext";
+import { TaskProvider } from "@/shared/contexts/TaskContext";
+
+import { Toaster } from "sonner";

 const geistSans = Geist({
  variable: "--font-geist-sans",
@@ -39,10 +40,17 @@ export default function RootLayout({
      >
        <AuthProvider>
          <TaskProvider>
-            <GlobalTaskIndicator />
            {children}
          </TaskProvider>
        </AuthProvider>
+        <Toaster
+          position="top-center"
+          richColors
+          toastOptions={{
+            duration: 3000,
+            className: "text-sm",
+          }}
+        />
      </body>
    </html>
  );
--- a/frontend/src/app/login/page.tsx
+++ b/frontend/src/app/login/page.tsx
@@ -2,7 +2,7 @@

 import { useState } from 'react';
 import { useRouter } from 'next/navigation';
-import { login } from "@/shared/lib/auth";
+import { login } from "@/shared/lib/auth";

 export default function LoginPage() {
    const router = useRouter();
@@ -30,7 +30,7 @@ export default function LoginPage() {
            } else {
                setError(result.message || '登录失败');
            }
-        } catch (err) {
+        } catch {
            setError('网络错误，请稍后重试');
        } finally {
            setLoading(false);
--- a/frontend/src/app/register/page.tsx
+++ b/frontend/src/app/register/page.tsx
@@ -2,10 +2,10 @@

 import { useState } from 'react';
 import { useRouter } from 'next/navigation';
-import { register } from "@/shared/lib/auth";
+import { register } from "@/shared/lib/auth";

 export default function RegisterPage() {
-    const router = useRouter();
+    useRouter(); // 保留以便后续扩展
    const [phone, setPhone] = useState('');
    const [password, setPassword] = useState('');
    const [confirmPassword, setConfirmPassword] = useState('');
@@ -43,7 +43,7 @@ export default function RegisterPage() {
            } else {
                setError(result.message || '注册失败');
            }
-        } catch (err) {
+        } catch {
            setError('网络错误，请稍后重试');
        } finally {
            setLoading(false);
--- a/frontend/src/components/AccountSettingsDropdown.tsx
+++ b/frontend/src/components/AccountSettingsDropdown.tsx
@@ -1,7 +1,7 @@
 "use client";

 import { useState, useEffect, useRef } from "react";
-import { useAuth } from "@/contexts/AuthContext";
+import { useAuth } from "@/shared/contexts/AuthContext";
 import api from "@/shared/api/axios";
 import { ApiResponse } from "@/shared/api/types";

@@ -44,7 +44,7 @@ export default function AccountSettingsDropdown() {
        if (confirm('确定要退出登录吗？')) {
            try {
                await api.post('/api/auth/logout');
-            } catch (e) { }
+            } catch { }
            window.location.href = '/login';
        }
    };
@@ -76,14 +76,15 @@ export default function AccountSettingsDropdown() {
                setTimeout(async () => {
                    try {
                        await api.post('/api/auth/logout');
-                    } catch (e) { }
+                    } catch { }
                    window.location.href = '/login';
                }, 1500);
            } else {
                setError(res.message || '修改失败');
            }
-        } catch (err: any) {
-            setError(err.response?.data?.message || '修改失败，请重试');
+        } catch (err: unknown) {
+            const axiosErr = err as { response?: { data?: { message?: string } } };
+            setError(axiosErr.response?.data?.message || '修改失败，请重试');
        } finally {
            setLoading(false);
        }
--- a/frontend/src/components/GlobalTaskIndicator.tsx
+++ b/frontend/src/components/GlobalTaskIndicator.tsx
@@ -1,12 +1,15 @@
 "use client";

-import { useTask } from "@/contexts/TaskContext";
+import { useTask } from "@/shared/contexts/TaskContext";
 import Link from "next/link";
+import { usePathname } from "next/navigation";

 export default function GlobalTaskIndicator() {
  const { currentTask, isGenerating } = useTask();
+  const pathname = usePathname();

-  if (!isGenerating) return null;
+  // 首页已有专门的进度条展示，因此在首页不显示顶部全局进度条
+  if (!isGenerating || pathname === "/") return null;

  return (
    <div className="fixed top-0 left-0 right-0 z-50 bg-gradient-to-r from-purple-600 to-pink-600 text-white shadow-lg">
--- a/frontend/src/components/ScriptExtractionModal.tsx
+++ b/frontend/src/components/ScriptExtractionModal.tsx
@@ -1,424 +0,0 @@
-"use client";
-
-import { useState, useEffect } from "react";
-import api from "@/shared/api/axios";
-import { ApiResponse, unwrap } from "@/shared/api/types";
-
-interface ScriptExtractionModalProps {
-    isOpen: boolean;
-    onClose: () => void;
-    onApply?: (text: string) => void;
-}
-
-export default function ScriptExtractionModal({
-    isOpen,
-    onClose,
-    onApply
-}: ScriptExtractionModalProps) {
-    const [isLoading, setIsLoading] = useState(false);
-    const [script, setScript] = useState("");
-    const [rewrittenScript, setRewrittenScript] = useState("");
-    const [error, setError] = useState<string | null>(null);
-    const [doRewrite, setDoRewrite] = useState(true);
-    const [step, setStep] = useState<'config' | 'processing' | 'result'>('config');
-    const [dragActive, setDragActive] = useState(false);
-    const [selectedFile, setSelectedFile] = useState<File | null>(null);
-
-    // New state for URL mode
-    const [activeTab, setActiveTab] = useState<'file' | 'url'>('url');
-    const [inputUrl, setInputUrl] = useState("");
-
-    // Reset state when modal opens
-    useEffect(() => {
-        if (isOpen) {
-            setStep('config');
-            setScript("");
-            setRewrittenScript("");
-            setError(null);
-            setIsLoading(false);
-            setSelectedFile(null);
-            setInputUrl("");
-            setActiveTab('url');
-        }
-    }, [isOpen]);
-
-    const handleDrag = (e: React.DragEvent) => {
-        e.preventDefault();
-        e.stopPropagation();
-        if (e.type === "dragenter" || e.type === "dragover") {
-            setDragActive(true);
-        } else if (e.type === "dragleave") {
-            setDragActive(false);
-        }
-    };
-
-    const handleDrop = (e: React.DragEvent) => {
-        e.preventDefault();
-        e.stopPropagation();
-        setDragActive(false);
-        if (e.dataTransfer.files && e.dataTransfer.files[0]) {
-            handleFile(e.dataTransfer.files[0]);
-        }
-    };
-
-    const handleFileChange = (e: React.ChangeEvent<HTMLInputElement>) => {
-        if (e.target.files && e.target.files[0]) {
-            handleFile(e.target.files[0]);
-        }
-    };
-
-    const handleFile = (file: File) => {
-        const validTypes = ['.mp4', '.mov', '.avi', '.mp3', '.wav', '.m4a'];
-        const ext = file.name.toLowerCase().slice(file.name.lastIndexOf('.'));
-        if (!validTypes.includes(ext)) {
-            setError(`不支持的文件格式 ${ext}，请上传视频或音频文件`);
-            return;
-        }
-        setSelectedFile(file);
-        setError(null);
-    };
-
-    const handleExtract = async () => {
-        if (activeTab === 'file' && !selectedFile) {
-            setError("请先上传文件");
-            return;
-        }
-        if (activeTab === 'url' && !inputUrl.trim()) {
-            setError("请先输入视频链接");
-            return;
-        }
-
-        setIsLoading(true);
-        setStep('processing');
-        setError(null);
-
-        try {
-            const formData = new FormData();
-            if (activeTab === 'file' && selectedFile) {
-                formData.append('file', selectedFile);
-            } else if (activeTab === 'url') {
-                formData.append('url', inputUrl.trim());
-            }
-            formData.append('rewrite', doRewrite ? 'true' : 'false');
-
-            const { data: res } = await api.post<ApiResponse<{ original_script: string; rewritten_script?: string }>>(
-                '/api/tools/extract-script',
-                formData,
-                {
-                headers: { 'Content-Type': 'multipart/form-data' },
-                timeout: 180000 // 3 minutes timeout
-            });
-
-            const payload = unwrap(res);
-            setScript(payload.original_script);
-            setRewrittenScript(payload.rewritten_script || "");
-            setStep('result');
-        } catch (err: any) {
-            console.error(err);
-            const msg = err.response?.data?.message || err.message || "请求失败";
-            setError(msg);
-            setStep('config');
-        } finally {
-            setIsLoading(false);
-        }
-    };
-
-    const copyToClipboard = (text: string) => {
-        if (navigator.clipboard && window.isSecureContext) {
-            navigator.clipboard.writeText(text).then(() => {
-                alert("已复制到剪贴板");
-            }).catch(err => {
-                console.error('Async: Could not copy text: ', err);
-                fallbackCopyTextToClipboard(text);
-            });
-        } else {
-            fallbackCopyTextToClipboard(text);
-        }
-    };
-
-    const fallbackCopyTextToClipboard = (text: string) => {
-        var textArea = document.createElement("textarea");
-        textArea.value = text;
-
-        // Avoid scrolling to bottom
-        textArea.style.top = "0";
-        textArea.style.left = "0";
-        textArea.style.position = "fixed";
-        textArea.style.opacity = "0";
-
-        document.body.appendChild(textArea);
-        textArea.focus();
-        textArea.select();
-
-        try {
-            var successful = document.execCommand('copy');
-            var msg = successful ? 'successful' : 'unsuccessful';
-            if (successful) {
-                alert("已复制到剪贴板");
-            } else {
-                alert("复制失败，请手动复制");
-            }
-        } catch (err) {
-            console.error('Fallback: Oops, unable to copy', err);
-            alert("复制失败，请手动复制");
-        }
-
-        document.body.removeChild(textArea);
-    };
-
-    // Close when clicking outside - DISABLED as per user request
-    // const modalRef = useRef<HTMLDivElement>(null);
-    // const handleBackdropClick = (e: React.MouseEvent) => {
-    //     if (modalRef.current && !modalRef.current.contains(e.target as Node)) {
-    //         onClose();
-    //     }
-    // };
-
-    if (!isOpen) return null;
-
-    return (
-        <div
-            className="fixed inset-0 z-50 flex items-center justify-center bg-black/80 backdrop-blur-sm p-4 animate-in fade-in duration-200"
-        >
-            <div
-                // ref={modalRef}
-                className="bg-[#1a1a1a] border border-white/10 rounded-2xl w-full max-w-2xl max-h-[90vh] overflow-hidden flex flex-col shadow-2xl"
-            >
-                {/* Header */}
-                <div className="flex items-center justify-between p-4 border-b border-white/10 bg-white/5">
-                    <h3 className="text-lg font-semibold text-white flex items-center gap-2">
-                        📜 文案提取助手
-                    </h3>
-                    <button
-                        onClick={onClose}
-                        className="text-gray-400 hover:text-white transition-colors text-2xl leading-none"
-                    >
-                        &times;
-                    </button>
-                </div>
-
-                {/* Content */}
-                <div className="flex-1 overflow-y-auto p-6">
-                    {step === 'config' && (
-                        <div className="space-y-6">
-
-                            {/* Tabs */}
-                            <div className="flex p-1 bg-white/5 rounded-xl border border-white/10">
-                                <button
-                                    onClick={() => setActiveTab('url')}
-                                    className={`flex-1 py-2 rounded-lg text-sm font-medium transition-all ${activeTab === 'url'
-                                        ? 'bg-purple-600 text-white shadow-lg'
-                                        : 'text-gray-400 hover:text-white hover:bg-white/5'
-                                        }`}
-                                >
-                                    🔗 粘贴链接
-                                </button>
-                                <button
-                                    onClick={() => setActiveTab('file')}
-                                    className={`flex-1 py-2 rounded-lg text-sm font-medium transition-all ${activeTab === 'file'
-                                        ? 'bg-purple-600 text-white shadow-lg'
-                                        : 'text-gray-400 hover:text-white hover:bg-white/5'
-                                        }`}
-                                >
-                                    📂 上传文件
-                                </button>
-                            </div>
-
-                            {/* URL Input Area */}
-                            {activeTab === 'url' && (
-                                <div className="space-y-2 py-4">
-                                    <div className="relative">
-                                        <input
-                                            type="text"
-                                            value={inputUrl}
-                                            onChange={(e) => setInputUrl(e.target.value)}
-                                            placeholder="请粘贴抖音、B站等主流平台视频链接..."
-                                            className="w-full bg-black/20 border border-white/10 rounded-xl px-4 py-4 text-white placeholder-gray-500 focus:outline-none focus:border-purple-500 transition-colors"
-                                        />
-                                        {inputUrl && (
-                                            <button
-                                                onClick={() => setInputUrl("")}
-                                                className="absolute right-3 top-1/2 -translate-y-1/2 text-gray-500 hover:text-white p-1"
-                                            >
-                                                ✕
-                                            </button>
-                                        )}
-                                    </div>
-                                    <p className="text-xs text-gray-400 px-1">
-                                        支持抖音、B站等主流平台分享链接，自动解析下载并提取文案。
-                                    </p>
-                                </div>
-                            )}
-
-                            {/* File Upload Area */}
-                            {activeTab === 'file' && (
-                                <div
-                                    className={`
-                                        relative border-2 border-dashed rounded-xl p-8 text-center transition-all cursor-pointer
-                                        ${dragActive ? 'border-purple-500 bg-purple-500/10' : 'border-white/20 hover:border-white/40 hover:bg-white/5'}
-                                        ${selectedFile ? 'bg-purple-900/10 border-purple-500/50' : ''}
-                                    `}
-                                    onDragEnter={handleDrag}
-                                    onDragLeave={handleDrag}
-                                    onDragOver={handleDrag}
-                                    onDrop={handleDrop}
-                                >
-                                    <input
-                                        type="file"
-                                        className="absolute inset-0 w-full h-full opacity-0 cursor-pointer"
-                                        onChange={handleFileChange}
-                                        accept=".mp4,.mov,.avi,.mp3,.wav,.m4a"
-                                    />
-
-                                    {selectedFile ? (
-                                        <div className="flex flex-col items-center">
-                                            <div className="text-4xl mb-2">📄</div>
-                                            <div className="font-medium text-white break-all max-w-xs">{selectedFile.name}</div>
-                                            <div className="text-sm text-gray-400 mt-1">{(selectedFile.size / (1024 * 1024)).toFixed(1)} MB</div>
-                                            <div className="mt-4 text-xs text-purple-400">点击更换文件</div>
-                                        </div>
-                                    ) : (
-                                        <div className="flex flex-col items-center">
-                                            <div className="text-4xl mb-2">📤</div>
-                                            <div className="font-medium text-white">点击上传或拖拽文件到此处</div>
-                                            <div className="text-sm text-gray-400 mt-2">支持 MP4, MOV, MP3, WAV 等音视频格式</div>
-                                        </div>
-                                    )}
-                                </div>
-                            )}
-
-                            {/* Options */}
-                            <div className="bg-white/5 rounded-xl p-4 border border-white/10">
-                                <label className="flex items-center gap-3 cursor-pointer">
-                                    <input
-                                        type="checkbox"
-                                        checked={doRewrite}
-                                        onChange={e => setDoRewrite(e.target.checked)}
-                                        className="w-5 h-5 accent-purple-600 rounded"
-                                    />
-                                    <div>
-                                        <div className="text-white font-medium">启用 AI 洗稿</div>
-                                        <div className="text-xs text-gray-400">自动将提取的文案重写为更自然流畅的口播稿</div>
-                                    </div>
-                                </label>
-                            </div>
-
-                            {error && (
-                                <div className="p-3 bg-red-500/20 text-red-200 rounded-lg text-sm text-center">
-                                    ❌ {error}
-                                </div>
-                            )}
-
-                            <div className="flex justify-center pt-2">
-                                <button
-                                    onClick={handleExtract}
-                                    className="w-full sm:w-auto px-10 py-3 bg-gradient-to-r from-purple-600 to-pink-600 text-white rounded-xl font-bold hover:shadow-lg hover:from-purple-500 hover:to-pink-500 transition-all transform hover:-translate-y-0.5 disabled:opacity-50 disabled:cursor-not-allowed"
-                                    disabled={activeTab === 'file' ? !selectedFile : !inputUrl.trim()}
-                                >
-                                    {activeTab === 'url' ? '🔗 解析并提取' : '🚀 开始提取'}
-                                </button>
-                            </div>
-                        </div>
-                    )}
-
-                    {step === 'processing' && (
-                        <div className="flex flex-col items-center justify-center py-20">
-                            <div className="relative w-20 h-20 mb-6">
-                                <div className="absolute inset-0 border-4 border-purple-500/30 rounded-full"></div>
-                                <div className="absolute inset-0 border-4 border-t-purple-500 rounded-full animate-spin"></div>
-                            </div>
-                            <h4 className="text-xl font-medium text-white mb-2">正在处理中...</h4>
-                            <p className="text-sm text-gray-400 text-center max-w-sm px-4">
-                                {activeTab === 'url' && "正在下载视频..."}<br />
-                                {doRewrite ? "正在进行语音识别和 AI 智能改写..." : "正在进行语音识别..."}<br />
-                                <span className="opacity-75">大文件可能需要几分钟，请不要关闭窗口</span>
-                            </p>
-                        </div>
-                    )}
-
-                    {step === 'result' && (
-                        <div className="space-y-6">
-                            {rewrittenScript && (
-                                <div className="space-y-2">
-                                    <div className="flex justify-between items-center">
-                                        <h4 className="font-semibold text-purple-300 flex items-center gap-2">
-                                            ✨ AI 洗稿结果 <span className="text-xs font-normal text-purple-400/70">(推荐)</span>
-                                        </h4>
-                                        {onApply && (
-                                            <button
-                                                onClick={() => {
-                                                    onApply(rewrittenScript);
-                                                    onClose();
-                                                }}
-                                                className="text-xs bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-500 hover:to-pink-500 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1 shadow-sm"
-                                            >
-                                                📥 填入
-                                            </button>
-                                        )}
-                                        <button
-                                            onClick={() => copyToClipboard(rewrittenScript)}
-                                            className="text-xs bg-purple-600 hover:bg-purple-500 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1"
-                                        >
-                                            📋 复制内容
-                                        </button>
-                                    </div>
-                                    <div className="bg-purple-900/10 border border-purple-500/20 rounded-xl p-4 max-h-60 overflow-y-auto custom-scrollbar">
-                                        <p className="text-gray-200 text-sm leading-relaxed whitespace-pre-wrap">
-                                            {rewrittenScript}
-                                        </p>
-                                    </div>
-                                </div>
-                            )}
-
-                            <div className="space-y-2">
-                                <div className="flex justify-between items-center">
-                                    <h4 className="font-semibold text-gray-400 flex items-center gap-2">
-                                        🎙️ 原始识别结果
-                                    </h4>
-                                    {onApply && (
-                                        <button
-                                            onClick={() => {
-                                                onApply(script);
-                                                onClose();
-                                            }}
-                                            className="text-xs bg-white/10 hover:bg-white/20 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1"
-                                        >
-                                            📥 填入
-                                        </button>
-                                    )}
-                                    <button
-                                        onClick={() => copyToClipboard(script)}
-                                        className="text-xs bg-white/10 hover:bg-white/20 text-white px-3 py-1.5 rounded-lg transition-colors"
-                                    >
-                                        复制
-                                    </button>
-                                </div>
-                                <div className="bg-white/5 border border-white/10 rounded-xl p-4 max-h-40 overflow-y-auto custom-scrollbar">
-                                    <p className="text-gray-400 text-sm leading-relaxed whitespace-pre-wrap">
-                                        {script}
-                                    </p>
-                                </div>
-                            </div>
-
-                            <div className="flex justify-center pt-4">
-                                <button
-                                    onClick={() => {
-                                        setStep('config');
-                                        setScript("");
-                                        setRewrittenScript("");
-                                        setSelectedFile(null);
-                                        setInputUrl("");
-                                        // Keep current tab active
-                                    }}
-                                    className="px-6 py-2 bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors"
-                                >
-                                    提取下一个
-                                </button>
-                            </div>
-                        </div>
-                    )}
-                </div>
-            </div>
-        </div>
-    );
-}
--- a/frontend/src/features/home/model/useBgm.ts
+++ b/frontend/src/features/home/model/useBgm.ts
@@ -9,13 +9,15 @@ export interface BgmItem {
 }

 interface UseBgmOptions {
-  storageKey: string;
+
  selectedBgmId: string;
  setSelectedBgmId: React.Dispatch<React.SetStateAction<string>>;
 }

 export const useBgm = ({
-  storageKey,
+
+  // selectedBgmId 用于参数类型推断，不在此 hook 内部直接使用
+  // eslint-disable-next-line @typescript-eslint/no-unused-vars
  selectedBgmId,
  setSelectedBgmId,
 }: UseBgmOptions) => {
@@ -32,21 +34,20 @@ export const useBgm = ({
      const items: BgmItem[] = Array.isArray(payload.bgm) ? payload.bgm : [];
      setBgmList(items);

-      const savedBgmId = localStorage.getItem(`vigent_${storageKey}_bgmId`);
      setSelectedBgmId((prev) => {
        if (prev && items.some((item) => item.id === prev)) return prev;
-        if (savedBgmId && items.some((item) => item.id === savedBgmId)) return savedBgmId;
        return items[0]?.id || "";
      });
-    } catch (error: any) {
-      const message = error?.response?.data?.message || error?.message || '加载失败';
+    } catch (error: unknown) {
+      const axiosErr = error as { response?: { data?: { message?: string } }; message?: string };
+      const message = axiosErr?.response?.data?.message || axiosErr?.message || '加载失败';
      setBgmError(message);
      setBgmList([]);
      console.error("获取背景音乐失败:", error);
    } finally {
      setBgmLoading(false);
    }
-  }, [setSelectedBgmId, storageKey]);
+  }, [setSelectedBgmId]);

  return {
    bgmList,
--- a/frontend/src/features/home/model/useGeneratedAudios.ts
+++ b/frontend/src/features/home/model/useGeneratedAudios.ts
@@ -0,0 +1,192 @@
+import { useCallback, useEffect, useRef, useState } from "react";
+import api from "@/shared/api/axios";
+import { ApiResponse, unwrap } from "@/shared/api/types";
+import { toast } from "sonner";
+
+export interface GeneratedAudio {
+  id: string;
+  name: string;
+  path: string;
+  duration_sec: number;
+  text: string;
+  tts_mode: string;
+  language: string;
+  created_at: number;
+}
+
+interface AudioTask {
+  status: string;
+  progress?: number;
+  message?: string;
+  output?: GeneratedAudio & { audio_id: string };
+}
+
+interface UseGeneratedAudiosOptions {
+  selectedAudioId: string | null;
+  setSelectedAudioId: React.Dispatch<React.SetStateAction<string | null>>;
+}
+
+export const useGeneratedAudios = ({
+  selectedAudioId,
+  setSelectedAudioId,
+}: UseGeneratedAudiosOptions) => {
+  const [generatedAudios, setGeneratedAudios] = useState<GeneratedAudio[]>([]);
+  const [selectedAudio, setSelectedAudio] = useState<GeneratedAudio | null>(null);
+  const [isGeneratingAudio, setIsGeneratingAudio] = useState(false);
+  const [audioTaskId, setAudioTaskId] = useState<string | null>(null);
+  const [audioTask, setAudioTask] = useState<AudioTask | null>(null);
+  const pollRef = useRef<NodeJS.Timeout | null>(null);
+
+  const fetchGeneratedAudios = useCallback(async (selectId?: string) => {
+    try {
+      const { data: res } = await api.get<ApiResponse<{ items: GeneratedAudio[] }>>(
+        "/api/generated-audios"
+      );
+      const payload = unwrap(res);
+      const items: GeneratedAudio[] = payload.items || [];
+      setGeneratedAudios(items);
+
+      if (selectId && items.length > 0) {
+        if (selectId === "__latest__") {
+          setSelectedAudioId(items[0].id);
+          setSelectedAudio(items[0]);
+        } else {
+          const found = items.find((a) => a.id === selectId);
+          if (found) {
+            setSelectedAudioId(found.id);
+            setSelectedAudio(found);
+          }
+        }
+      }
+    } catch (error) {
+      console.error("获取配音列表失败:", error);
+    }
+  }, [setSelectedAudioId]);
+
+  // Sync selectedAudio when selectedAudioId changes externally (e.g. from persistence)
+  useEffect(() => {
+    if (!selectedAudioId || generatedAudios.length === 0) return;
+    const found = generatedAudios.find((a) => a.id === selectedAudioId);
+    if (found) {
+      setSelectedAudio(found);
+    }
+  }, [selectedAudioId, generatedAudios]);
+
+  const stopPolling = useCallback(() => {
+    if (pollRef.current) {
+      clearInterval(pollRef.current);
+      pollRef.current = null;
+    }
+  }, []);
+
+  const startPolling = useCallback((taskId: string) => {
+    stopPolling();
+    pollRef.current = setInterval(async () => {
+      try {
+        const { data: res } = await api.get<ApiResponse<AudioTask>>(
+          `/api/generated-audios/tasks/${taskId}`
+        );
+        const task = unwrap(res);
+        setAudioTask(task);
+
+        if (task.status === "completed") {
+          stopPolling();
+          setIsGeneratingAudio(false);
+          setAudioTaskId(null);
+          // Refresh list and select the new audio
+          await fetchGeneratedAudios("__latest__");
+          toast.success(task.message || "配音生成完成");
+        } else if (task.status === "failed") {
+          stopPolling();
+          setIsGeneratingAudio(false);
+          setAudioTaskId(null);
+          toast.error(task.message || "配音生成失败");
+        } else if (task.status === "not_found") {
+          stopPolling();
+          setIsGeneratingAudio(false);
+          setAudioTaskId(null);
+          setAudioTask(null);
+          toast.error("任务已丢失（服务可能已重启），请重新生成");
+        }
+      } catch {
+        // Network error, keep polling
+      }
+    }, 1000);
+  }, [stopPolling, fetchGeneratedAudios]);
+
+  // Cleanup on unmount
+  useEffect(() => {
+    return () => stopPolling();
+  }, [stopPolling]);
+
+  const generateAudio = useCallback(async (params: {
+    text: string;
+    tts_mode: string;
+    voice?: string;
+    ref_audio_id?: string;
+    ref_text?: string;
+    language: string;
+  }) => {
+    setIsGeneratingAudio(true);
+    setAudioTask({ status: "pending", progress: 0, message: "正在提交..." });
+
+    try {
+      const { data: res } = await api.post<ApiResponse<{ task_id: string }>>(
+        "/api/generated-audios/generate",
+        params
+      );
+      const { task_id } = unwrap(res);
+      setAudioTaskId(task_id);
+      startPolling(task_id);
+    } catch (err: unknown) {
+      setIsGeneratingAudio(false);
+      setAudioTask(null);
+      const axiosErr = err as { response?: { data?: { message?: string } }; message?: string };
+      const errorMsg = axiosErr.response?.data?.message || axiosErr.message || String(err);
+      toast.error(`配音生成失败: ${errorMsg}`);
+    }
+  }, [startPolling]);
+
+  const deleteAudio = useCallback(async (audioId: string) => {
+    if (!confirm("确定要删除这个配音吗？")) return;
+    try {
+      await api.delete(`/api/generated-audios/${encodeURIComponent(audioId)}`);
+      if (selectedAudioId === audioId) {
+        setSelectedAudioId(null);
+        setSelectedAudio(null);
+      }
+      fetchGeneratedAudios();
+    } catch (error) {
+      toast.error("删除失败: " + error);
+    }
+  }, [fetchGeneratedAudios, selectedAudioId, setSelectedAudioId]);
+
+  const renameAudio = useCallback(async (audioId: string, newName: string) => {
+    try {
+      await api.put(`/api/generated-audios/${encodeURIComponent(audioId)}`, {
+        new_name: newName,
+      });
+      fetchGeneratedAudios();
+    } catch (err: unknown) {
+      toast.error("重命名失败: " + String(err));
+    }
+  }, [fetchGeneratedAudios]);
+
+  const selectAudio = useCallback((audio: GeneratedAudio) => {
+    setSelectedAudioId(audio.id);
+    setSelectedAudio(audio);
+  }, [setSelectedAudioId]);
+
+  return {
+    generatedAudios,
+    selectedAudio,
+    selectedAudioId,
+    isGeneratingAudio,
+    audioTask,
+    fetchGeneratedAudios,
+    generateAudio,
+    deleteAudio,
+    renameAudio,
+    selectAudio,
+  };
+};
--- a/frontend/src/features/home/model/useGeneratedVideos.ts
+++ b/frontend/src/features/home/model/useGeneratedVideos.ts
@@ -1,6 +1,7 @@
-import { useCallback, useState } from "react";
+import { useCallback, useEffect, useState } from "react";
 import api from "@/shared/api/axios";
 import { ApiResponse, unwrap } from "@/shared/api/types";
+import { toast } from "sonner";

 interface GeneratedVideo {
  id: string;
@@ -11,7 +12,7 @@ interface GeneratedVideo {
 }

 interface UseGeneratedVideosOptions {
-  storageKey: string;
+
  selectedVideoId: string | null;
  setSelectedVideoId: React.Dispatch<React.SetStateAction<string | null>>;
  setGeneratedVideo: React.Dispatch<React.SetStateAction<string | null>>;
@@ -19,7 +20,7 @@ interface UseGeneratedVideosOptions {
 }

 export const useGeneratedVideos = ({
-  storageKey,
+
  selectedVideoId,
  setSelectedVideoId,
  setGeneratedVideo,
@@ -36,32 +37,42 @@ export const useGeneratedVideos = ({
      const videos: GeneratedVideo[] = payload.videos || [];
      setGeneratedVideos(videos);

-      const savedSelectedVideoId = localStorage.getItem(`vigent_${storageKey}_selectedVideoId`);
-      const currentId = preferVideoId || selectedVideoId || savedSelectedVideoId || null;
-      let nextId: string | null = null;
-      let nextUrl: string | null = null;
-
-      if (currentId) {
-        const found = videos.find(v => v.id === currentId);
-        if (found) {
-          nextId = found.id;
-          nextUrl = resolveMediaUrl(found.path);
+      // 只在明确指定 preferVideoId 时才自动选中
+      // "__latest__" 表示选中最新的（第一个），用于新视频生成完成后
+      // 其他值表示选中指定 ID 的视频
+      // 不传则不设置选中项，由 useHomePersistence 恢复
+      if (preferVideoId && videos.length > 0) {
+        if (preferVideoId === "__latest__") {
+          setSelectedVideoId(videos[0].id);
+          setGeneratedVideo(resolveMediaUrl(videos[0].path));
+        } else {
+          const found = videos.find(v => v.id === preferVideoId);
+          if (found) {
+            setSelectedVideoId(found.id);
+            setGeneratedVideo(resolveMediaUrl(found.path));
+          }
        }
      }
-
-      if (!nextId && videos.length > 0) {
-        nextId = videos[0].id;
-        nextUrl = resolveMediaUrl(videos[0].path);
-      }
-
-      if (nextId) {
-        setSelectedVideoId(nextId);
-        setGeneratedVideo(nextUrl);
-      }
    } catch (error) {
      console.error("获取历史视频失败:", error);
    }
-  }, [resolveMediaUrl, selectedVideoId, setGeneratedVideo, setSelectedVideoId, storageKey]);
+  }, [resolveMediaUrl, setGeneratedVideo, setSelectedVideoId]);
+
+  // 【核心修复】当 selectedVideoId 变化时（例如从持久化恢复），自动同步 generatedVideo (URL)
+  // 之前的逻辑只在 fetch 时设置，导致外部恢复 ID 后 URL 不同步
+  useEffect(() => {
+    if (!selectedVideoId || generatedVideos.length === 0) {
+      // 如果没有选中 ID，或者列表为空，不要轻易置空 URL，除非明确需要
+      // 这里保持现状，由 fetchGeneratedVideos 或 deleteVideo 处理置空
+      return;
+    }
+
+    const video = generatedVideos.find(v => v.id === selectedVideoId);
+    if (video) {
+      const url = resolveMediaUrl(video.path);
+      setGeneratedVideo(url);
+    }
+  }, [selectedVideoId, generatedVideos, resolveMediaUrl, setGeneratedVideo]);

  const deleteVideo = useCallback(async (videoId: string) => {
    if (!confirm("确定要删除这个视频吗？")) return;
@@ -73,7 +84,7 @@ export const useGeneratedVideos = ({
      }
      fetchGeneratedVideos();
    } catch (error) {
-      alert("删除失败: " + error);
+      toast.error("删除失败: " + error);
    }
  }, [fetchGeneratedVideos, selectedVideoId, setGeneratedVideo, setSelectedVideoId]);

--- a/frontend/src/features/home/model/useHomeController.ts
+++ b/frontend/src/features/home/model/useHomeController.ts
@@ -11,45 +11,83 @@ import {
 } from "@/shared/lib/media";
 import { clampTitle } from "@/shared/lib/title";
 import { useTitleInput } from "@/shared/hooks/useTitleInput";
-import { useAuth } from "@/contexts/AuthContext";
-import { useTask } from "@/contexts/TaskContext";
+import { useAuth } from "@/shared/contexts/AuthContext";
+import { useTask } from "@/shared/contexts/TaskContext";
+import { toast } from "sonner";
+import { usePublishPrefetch } from "@/shared/hooks/usePublishPrefetch";
+import { PublishAccount } from "@/shared/types/publish";
 import { useBgm } from "@/features/home/model/useBgm";
 import { useGeneratedVideos } from "@/features/home/model/useGeneratedVideos";
+import { useGeneratedAudios } from "@/features/home/model/useGeneratedAudios";
 import { useHomePersistence } from "@/features/home/model/useHomePersistence";
 import { useMaterials } from "@/features/home/model/useMaterials";
 import { useMediaPlayers } from "@/features/home/model/useMediaPlayers";
 import { useRefAudios } from "@/features/home/model/useRefAudios";
 import { useTitleSubtitleStyles } from "@/features/home/model/useTitleSubtitleStyles";
+import { useTimelineEditor } from "@/features/home/model/useTimelineEditor";
+import { useSavedScripts } from "@/features/home/model/useSavedScripts";
 import { ApiResponse, unwrap } from "@/shared/api/types";

-const VOICES = [
-  { id: "zh-CN-YunxiNeural", name: "云溪 (男声-年轻)" },
-  { id: "zh-CN-YunjianNeural", name: "云健 (男声-新闻)" },
-  { id: "zh-CN-YunyangNeural", name: "云扬 (男声-专业)" },
-  { id: "zh-CN-XiaoxiaoNeural", name: "晓晓 (女声-活泼)" },
-  { id: "zh-CN-XiaoyiNeural", name: "晓伊 (女声-温柔)" },
-];
+const VOICES: Record<string, { id: string; name: string }[]> = {
+  "zh-CN": [
+    { id: "zh-CN-YunxiNeural", name: "云溪 (男声-年轻)" },
+    { id: "zh-CN-YunjianNeural", name: "云健 (男声-新闻)" },
+    { id: "zh-CN-YunyangNeural", name: "云扬 (男声-专业)" },
+    { id: "zh-CN-XiaoxiaoNeural", name: "晓晓 (女声-活泼)" },
+    { id: "zh-CN-XiaoyiNeural", name: "晓伊 (女声-温柔)" },
+  ],
+  "en-US": [
+    { id: "en-US-GuyNeural", name: "Guy (Male)" },
+    { id: "en-US-JennyNeural", name: "Jenny (Female)" },
+  ],
+  "ja-JP": [
+    { id: "ja-JP-KeitaNeural", name: "圭太 (男声)" },
+    { id: "ja-JP-NanamiNeural", name: "七海 (女声)" },
+  ],
+  "ko-KR": [
+    { id: "ko-KR-InJoonNeural", name: "인준 (男声)" },
+    { id: "ko-KR-SunHiNeural", name: "선히 (女声)" },
+  ],
+  "fr-FR": [
+    { id: "fr-FR-HenriNeural", name: "Henri (Male)" },
+    { id: "fr-FR-DeniseNeural", name: "Denise (Female)" },
+  ],
+  "de-DE": [
+    { id: "de-DE-ConradNeural", name: "Conrad (Male)" },
+    { id: "de-DE-KatjaNeural", name: "Katja (Female)" },
+  ],
+  "es-ES": [
+    { id: "es-ES-AlvaroNeural", name: "Álvaro (Male)" },
+    { id: "es-ES-ElviraNeural", name: "Elvira (Female)" },
+  ],
+  "ru-RU": [
+    { id: "ru-RU-DmitryNeural", name: "Дмитрий (Male)" },
+    { id: "ru-RU-SvetlanaNeural", name: "Светлана (Female)" },
+  ],
+  "it-IT": [
+    { id: "it-IT-DiegoNeural", name: "Diego (Male)" },
+    { id: "it-IT-ElsaNeural", name: "Elsa (Female)" },
+  ],
+  "pt-BR": [
+    { id: "pt-BR-AntonioNeural", name: "Antonio (Male)" },
+    { id: "pt-BR-FranciscaNeural", name: "Francisca (Female)" },
+  ],
+};

-const PUBLISH_PREFETCH_KEY = "vigent_publish_prefetch_v1";
-const PUBLISH_PREFETCH_TTL = 2 * 60 * 1000;
+const LANG_TO_LOCALE: Record<string, string> = {
+  "中文": "zh-CN",
+  "English": "en-US",
+  "日本語": "ja-JP",
+  "한국어": "ko-KR",
+  "Français": "fr-FR",
+  "Deutsch": "de-DE",
+  "Español": "es-ES",
+  "Русский": "ru-RU",
+  "Italiano": "it-IT",
+  "Português": "pt-BR",
+};

-interface PublishAccount {
-  platform: string;
-  name: string;
-  logged_in: boolean;
-  enabled: boolean;
-}

-interface PublishVideo {
-  name: string;
-  path: string;
-}
-
-interface PublishPrefetchCache {
-  ts: number;
-  accounts?: PublishAccount[];
-  videos?: PublishVideo[];
-}

 const FIXED_REF_TEXT =
  "其实生活中有许多美好的瞬间，比如清晨的阳光，或者一杯温热的清茶。希望这次生成的音色能够自然、流畅，完美还原出我最真实的声音状态。";
@@ -86,41 +124,38 @@ interface RefAudio {
  created_at: number;
 }

-interface Material {
-  id: string;
-  name: string;
-  path: string;
-  size_mb: number;
-  scene?: string;
-}
+import type { Material } from "@/shared/types/material";

 export const useHomeController = () => {
  const apiBase = getApiBaseUrl();

-  const [selectedMaterial, setSelectedMaterial] = useState<string>("");
+  const [selectedMaterials, setSelectedMaterials] = useState<string[]>([]);
  const [previewMaterial, setPreviewMaterial] = useState<string | null>(null);

  const [text, setText] = useState<string>("");
  const [voice, setVoice] = useState<string>("zh-CN-YunxiNeural");
+  const [textLang, setTextLang] = useState<string>("zh-CN");

  // 使用全局任务状态
  const { currentTask, isGenerating, startTask } = useTask();
+  const prevIsGenerating = useRef(isGenerating);

  const [generatedVideo, setGeneratedVideo] = useState<string | null>(null);
  const [selectedVideoId, setSelectedVideoId] = useState<string | null>(null);

  // 字幕和标题相关状态
  const [videoTitle, setVideoTitle] = useState<string>("");
-  const [enableSubtitles, setEnableSubtitles] = useState<boolean>(true);
  const [selectedSubtitleStyleId, setSelectedSubtitleStyleId] = useState<string>("");
  const [selectedTitleStyleId, setSelectedTitleStyleId] = useState<string>("");
-  const [subtitleFontSize, setSubtitleFontSize] = useState<number>(60);
-  const [titleFontSize, setTitleFontSize] = useState<number>(90);
+  const [subtitleFontSize, setSubtitleFontSize] = useState<number>(80);
+  const [titleFontSize, setTitleFontSize] = useState<number>(120);
  const [subtitleSizeLocked, setSubtitleSizeLocked] = useState<boolean>(false);
  const [titleSizeLocked, setTitleSizeLocked] = useState<boolean>(false);
+  const [titleTopMargin, setTitleTopMargin] = useState<number>(62);
+  const [subtitleBottomMargin, setSubtitleBottomMargin] = useState<number>(80);
  const [showStylePreview, setShowStylePreview] = useState<boolean>(false);
  const [materialDimensions, setMaterialDimensions] = useState<{ width: number; height: number } | null>(null);
-  const [previewContainerWidth, setPreviewContainerWidth] = useState<number>(0);
+

  // 背景音乐相关状态
  const [selectedBgmId, setSelectedBgmId] = useState<string>("");
@@ -132,6 +167,13 @@ export const useHomeController = () => {
  const [selectedRefAudio, setSelectedRefAudio] = useState<RefAudio | null>(null);
  const [refText, setRefText] = useState(FIXED_REF_TEXT);

+  // 预生成配音选中 ID
+  const [selectedAudioId, setSelectedAudioId] = useState<string | null>(null);
+
+  // ClipTrimmer 模态框状态
+  const [clipTrimmerOpen, setClipTrimmerOpen] = useState(false);
+  const [clipTrimmerSegmentId, setClipTrimmerSegmentId] = useState<string | null>(null);
+
  // 音频预览与重命名状态
  const [editingAudioId, setEditingAudioId] = useState<string | null>(null);
  const [editName, setEditName] = useState("");
@@ -139,7 +181,7 @@ export const useHomeController = () => {
  const [editMaterialName, setEditMaterialName] = useState("");
  const bgmItemRefs = useRef<Record<string, HTMLDivElement | null>>({});
  const bgmListContainerRef = useRef<HTMLDivElement | null>(null);
-  const titlePreviewContainerRef = useRef<HTMLDivElement | null>(null);
+
  const materialItemRefs = useRef<Record<string, HTMLDivElement | null>>({});
  const videoItemRefs = useRef<Record<string, HTMLDivElement | null>>({});

@@ -166,8 +208,8 @@ export const useHomeController = () => {
      await api.put(`/api/ref-audios/${encodeURIComponent(audioId)}`, { new_name: editName });
      setEditingAudioId(null);
      fetchRefAudios(); // 刷新列表
-    } catch (err: any) {
-      alert("重命名失败: " + err);
+    } catch (err: unknown) {
+      toast.error("重命名失败: " + String(err));
    }
  };

@@ -194,21 +236,26 @@ export const useHomeController = () => {
        { new_name: editMaterialName.trim() }
      );
      const payload = unwrap(res);
-      if (selectedMaterial === materialId && payload?.id) {
-        setSelectedMaterial(payload.id);
+      if (selectedMaterials.includes(materialId) && payload?.id) {
+        setSelectedMaterials((prev) => prev.map((x) => (x === materialId ? payload.id : x)));
      }
      setEditingMaterialId(null);
      setEditMaterialName("");
      fetchMaterials();
-    } catch (err: any) {
-      const errorMsg = err.response?.data?.message || err.message || String(err);
-      alert(`重命名失败: ${errorMsg}`);
+    } catch (err: unknown) {
+      const axiosErr = err as { response?: { data?: { message?: string } }; message?: string };
+      const errorMsg = axiosErr.response?.data?.message || axiosErr.message || String(err);
+      toast.error(`重命名失败: ${errorMsg}`);
    }
  };

  // AI 生成标题标签
  const [isGeneratingMeta, setIsGeneratingMeta] = useState(false);

+  // AI 多语言翻译
+  const [isTranslating, setIsTranslating] = useState(false);
+  const [originalText, setOriginalText] = useState<string | null>(null);
+
  // 在线录音相关
  const [isRecording, setIsRecording] = useState(false);
  const [recordedBlob, setRecordedBlob] = useState<Blob | null>(null);
@@ -225,26 +272,8 @@ export const useHomeController = () => {
  // 获取存储 key 的前缀（登录用户使用 userId，未登录使用 guest）
  const storageKey = userId || "guest";

-  const readPublishPrefetch = () => {
-    if (typeof window === "undefined") return null;
-    const raw = sessionStorage.getItem(PUBLISH_PREFETCH_KEY);
-    if (!raw) return null;
-    try {
-      const cache = JSON.parse(raw) as PublishPrefetchCache;
-      if (!cache?.ts) return null;
-      if (Date.now() - cache.ts > PUBLISH_PREFETCH_TTL) return null;
-      return cache;
-    } catch {
-      return null;
-    }
-  };
-
-  const updatePublishPrefetch = (patch: Partial<PublishPrefetchCache>) => {
-    if (typeof window === "undefined") return;
-    const existing = readPublishPrefetch() || { ts: Date.now() };
-    const next = { ...existing, ...patch, ts: Date.now() };
-    sessionStorage.setItem(PUBLISH_PREFETCH_KEY, JSON.stringify(next));
-  };
+  // 使用共用的发布预加载 hook
+  const { updatePrefetch: updatePublishPrefetch } = usePublishPrefetch();

  const {
    materials,
@@ -256,11 +285,13 @@ export const useHomeController = () => {
    uploadError,
    setUploadError,
    fetchMaterials,
+    toggleMaterial,
+    reorderMaterials,
    deleteMaterial,
    handleUpload,
  } = useMaterials({
-    selectedMaterial,
-    setSelectedMaterial,
+    selectedMaterials,
+    setSelectedMaterials,
  });

  const {
@@ -270,7 +301,7 @@ export const useHomeController = () => {
    refreshTitleStyles,
  } = useTitleSubtitleStyles({
    isAuthLoading,
-    storageKey,
+
    setSelectedSubtitleStyleId,
    setSelectedTitleStyleId,
  });
@@ -296,7 +327,7 @@ export const useHomeController = () => {
    bgmError,
    fetchBgmList,
  } = useBgm({
-    storageKey,
+
    selectedBgmId,
    setSelectedBgmId,
  });
@@ -319,13 +350,40 @@ export const useHomeController = () => {
    fetchGeneratedVideos,
    deleteVideo,
  } = useGeneratedVideos({
-    storageKey,
+
    selectedVideoId,
    setSelectedVideoId,
    setGeneratedVideo,
    resolveMediaUrl,
  });

+  const {
+    generatedAudios,
+    selectedAudio,
+    isGeneratingAudio,
+    audioTask,
+    fetchGeneratedAudios,
+    generateAudio,
+    deleteAudio,
+    renameAudio,
+    selectAudio,
+  } = useGeneratedAudios({
+    selectedAudioId,
+    setSelectedAudioId,
+  });
+
+  const {
+    segments: timelineSegments,
+    reorderSegments,
+    setSourceRange,
+    toCustomAssignments,
+  } = useTimelineEditor({
+    audioDuration: selectedAudio?.duration_sec ?? 0,
+    materials,
+    selectedMaterials,
+    storageKey,
+  });
+
  useEffect(() => {
    if (isAuthLoading || !userId) return;
    let active = true;
@@ -347,15 +405,18 @@ export const useHomeController = () => {
    return () => {
      active = false;
    };
+    // eslint-disable-next-line react-hooks/exhaustive-deps
  }, [isAuthLoading, userId]);

  useEffect(() => {
    if (generatedVideos.length === 0) return;
    const prefetched = generatedVideos.map((video) => ({
+      id: video.id,
      name: formatDate(video.created_at) + ` (${video.size_mb.toFixed(1)}MB)`,
      path: video.path.startsWith("/") ? video.path.slice(1) : video.path,
    }));
    updatePublishPrefetch({ videos: prefetched });
+    // eslint-disable-next-line react-hooks/exhaustive-deps
  }, [generatedVideos]);

  const { isRestored } = useHomePersistence({
@@ -365,14 +426,14 @@ export const useHomeController = () => {
    setText,
    videoTitle,
    setVideoTitle,
-    enableSubtitles,
-    setEnableSubtitles,
    ttsMode,
    setTtsMode,
    voice,
    setVoice,
-    selectedMaterial,
-    setSelectedMaterial,
+    textLang,
+    setTextLang,
+    selectedMaterials,
+    setSelectedMaterials,
    selectedSubtitleStyleId,
    setSelectedSubtitleStyleId,
    selectedTitleStyleId,
@@ -383,6 +444,10 @@ export const useHomeController = () => {
    setTitleFontSize,
    setSubtitleSizeLocked,
    setTitleSizeLocked,
+    titleTopMargin,
+    setTitleTopMargin,
+    subtitleBottomMargin,
+    setSubtitleBottomMargin,
    selectedBgmId,
    setSelectedBgmId,
    bgmVolume,
@@ -392,8 +457,18 @@ export const useHomeController = () => {
    selectedVideoId,
    setSelectedVideoId,
    selectedRefAudio,
+    selectedAudioId,
+    setSelectedAudioId,
  });

+  const { savedScripts, saveScript, deleteScript: deleteSavedScript } = useSavedScripts(storageKey);
+
+  const handleSaveScript = () => {
+    if (!text.trim()) return;
+    saveScript(text);
+    toast.success("文案已保存");
+  };
+
  const syncTitleToPublish = (value: string) => {
    if (typeof window !== "undefined") {
      localStorage.setItem(`vigent_${storageKey}_publish_title`, value);
@@ -413,14 +488,29 @@ export const useHomeController = () => {
      fetchMaterials(),
      fetchGeneratedVideos(),
      fetchRefAudios(),
+      fetchGeneratedAudios(),
      refreshSubtitleStyles(),
      refreshTitleStyles(),
      fetchBgmList(),
    ]);
+    // eslint-disable-next-line react-hooks/exhaustive-deps
  }, [isAuthLoading]);

+  // 监听任务完成，自动刷新视频列表并选中最新
  useEffect(() => {
-    const material = materials.find((item) => item.id === selectedMaterial);
+    if (prevIsGenerating.current && !isGenerating) {
+      if (currentTask?.status === "completed") {
+        void fetchGeneratedVideos("__latest__");
+      } else {
+        void fetchGeneratedVideos();
+      }
+    }
+    prevIsGenerating.current = isGenerating;
+  }, [isGenerating, currentTask, fetchGeneratedVideos]);
+
+  useEffect(() => {
+    const firstSelected = selectedMaterials[0];
+    const material = materials.find((item) => item.id === firstSelected);
    if (!material?.path) {
      setMaterialDimensions(null);
      return;
@@ -460,27 +550,8 @@ export const useHomeController = () => {
      video.removeEventListener("loadedmetadata", handleLoaded);
      video.removeEventListener("error", handleError);
    };
-  }, [materials, selectedMaterial]);
+  }, [materials, selectedMaterials]);

-  useEffect(() => {
-    if (!showStylePreview) return;
-    const container = titlePreviewContainerRef.current;
-    if (!container) return;
-
-    setPreviewContainerWidth(container.getBoundingClientRect().width);
-
-    const resizeObserver = new ResizeObserver((entries) => {
-      for (const entry of entries) {
-        setPreviewContainerWidth(entry.contentRect.width);
-      }
-    });
-
-    resizeObserver.observe(container);
-
-    return () => {
-      resizeObserver.disconnect();
-    };
-  }, [showStylePreview]);

  useEffect(() => {
    if (subtitleSizeLocked || subtitleStyles.length === 0) return;
@@ -502,16 +573,8 @@ export const useHomeController = () => {
    }
  }, [titleStyles, selectedTitleStyleId, titleSizeLocked]);

-  useEffect(() => {
-    if (!enableBgm || selectedBgmId || bgmList.length === 0) return;
-    const savedBgmId = localStorage.getItem(`vigent_${storageKey}_bgmId`);
-    const savedItem = savedBgmId && bgmList.find((item) => item.id === savedBgmId);
-    if (savedItem) {
-      setSelectedBgmId(savedBgmId);
-      return;
-    }
-    setSelectedBgmId(bgmList[0].id);
-  }, [enableBgm, selectedBgmId, bgmList, storageKey, setSelectedBgmId]);
+  // 移除重复的 BGM 持久化恢复逻辑 (已统一移动到 useHomePersistence 中)
+  // useEffect(() => { ... })

  useEffect(() => {
    if (!selectedBgmId) return;
@@ -522,13 +585,39 @@ export const useHomeController = () => {
    }
  }, [selectedBgmId, bgmList]);

+  // 素材列表滚动：跳过首次恢复，仅用户主动操作时滚动
+  const materialScrollReady = useRef(false);
  useEffect(() => {
-    if (!selectedMaterial) return;
-    const target = materialItemRefs.current[selectedMaterial];
+    const firstSelected = selectedMaterials[0];
+    if (!firstSelected) return;
+    if (!materialScrollReady.current) {
+      // 首次有选中素材时标记就绪，但不滚动（避免刷新后整页跳动）
+      materialScrollReady.current = true;
+      return;
+    }
+    const target = materialItemRefs.current[firstSelected];
    if (target) {
      target.scrollIntoView({ block: "nearest", behavior: "smooth" });
    }
-  }, [selectedMaterial, materials]);
+    // eslint-disable-next-line react-hooks/exhaustive-deps
+  }, [selectedMaterials.length]);
+
+  // 【修复】历史视频默认选中逻辑
+  // 当持久化恢复完成，且列表加载完毕，如果没选中任何视频，默认选中第一个
+  useEffect(() => {
+    if (isRestored && generatedVideos.length > 0 && !selectedVideoId) {
+      const firstId = generatedVideos[0].id;
+      setSelectedVideoId(firstId);
+      setGeneratedVideo(resolveMediaUrl(generatedVideos[0].path));
+    }
+  }, [isRestored, generatedVideos, selectedVideoId, setSelectedVideoId, setGeneratedVideo, resolveMediaUrl]);
+
+  // 【修复】BGM 默认选中逻辑
+  useEffect(() => {
+    if (isRestored && bgmList.length > 0 && !selectedBgmId && enableBgm) {
+      setSelectedBgmId(bgmList[0].id);
+    }
+  }, [isRestored, bgmList, selectedBgmId, enableBgm, setSelectedBgmId]);

  useEffect(() => {
    if (!selectedVideoId) return;
@@ -593,7 +682,7 @@ export const useHomeController = () => {
        setRecordingTime((prev) => prev + 1);
      }, 1000);
    } catch (err) {
-      alert("无法访问麦克风，请检查权限设置");
+      toast.error("无法访问麦克风，请检查权限设置");
      console.error(err);
    }
  };
@@ -631,7 +720,7 @@ export const useHomeController = () => {
  // AI 生成标题和标签
  const handleGenerateMeta = async () => {
    if (!text.trim()) {
-      alert("请先输入口播文案");
+      toast.error("请先输入口播文案");
      return;
    }

@@ -649,32 +738,101 @@ export const useHomeController = () => {

      // 同步到发布页 localStorage
      localStorage.setItem(`vigent_${storageKey}_publish_tags`, JSON.stringify(payload.tags || []));
-    } catch (err: any) {
+    } catch (err: unknown) {
      console.error("AI generate meta failed:", err);
-      const errorMsg = err.response?.data?.message || err.message || String(err);
-      alert(`AI 生成失败: ${errorMsg}`);
+      const axiosErr = err as { response?: { data?: { message?: string } }; message?: string };
+      const errorMsg = axiosErr.response?.data?.message || axiosErr.message || String(err);
+      toast.error(`AI 生成失败: ${errorMsg}`);
    } finally {
      setIsGeneratingMeta(false);
    }
  };

-  // 生成视频
-  const handleGenerate = async () => {
-    if (!selectedMaterial || !text.trim()) {
-      alert("请先选择素材并填写文案");
+  // AI 多语言翻译
+  const handleTranslate = async (targetLang: string) => {
+    if (!text.trim()) {
+      toast.error("请先输入口播文案");
      return;
    }

-    // 声音克隆模式校验
-    if (ttsMode === "voiceclone") {
-      if (!selectedRefAudio) {
-        alert("请选择或上传参考音频");
-        return;
+    // 首次翻译时保存原文
+    if (originalText === null) {
+      setOriginalText(text);
+    }
+
+    setIsTranslating(true);
+    try {
+      const { data: res } = await api.post<ApiResponse<{ translated_text: string }>>(
+        "/api/ai/translate",
+        { text: text.trim(), target_lang: targetLang }
+      );
+      const payload = unwrap(res);
+      setText(payload.translated_text || "");
+
+      // 根据翻译目标语言更新 textLang 并自动切换声音
+      const locale = LANG_TO_LOCALE[targetLang] || "zh-CN";
+      setTextLang(locale);
+      if (ttsMode === "edgetts") {
+        const langVoices = VOICES[locale] || VOICES["zh-CN"];
+        setVoice(langVoices[0].id);
      }
+    } catch (err: unknown) {
+      console.error("AI translate failed:", err);
+      const axiosErr = err as { response?: { data?: { message?: string } }; message?: string };
+      const errorMsg = axiosErr.response?.data?.message || axiosErr.message || String(err);
+      toast.error(`AI 翻译失败: ${errorMsg}`);
+    } finally {
+      setIsTranslating(false);
+    }
+  };
+
+  const handleRestoreOriginal = () => {
+    if (originalText !== null) {
+      setText(originalText);
+      setOriginalText(null);
+      setTextLang("zh-CN");
+      if (ttsMode === "edgetts") {
+        setVoice(VOICES["zh-CN"][0].id);
+      }
+    }
+  };
+
+  // 生成配音
+  const handleGenerateAudio = async () => {
+    if (!text.trim()) {
+      toast.error("请先输入文案");
+      return;
+    }
+    if (ttsMode === "voiceclone" && !selectedRefAudio) {
+      toast.error("请选择参考音频");
+      return;
+    }
+
+    const params = {
+      text: text.trim(),
+      tts_mode: ttsMode,
+      voice: ttsMode === "edgetts" ? voice : undefined,
+      ref_audio_id: ttsMode === "voiceclone" ? selectedRefAudio!.id : undefined,
+      ref_text: ttsMode === "voiceclone" ? refText : undefined,
+      language: textLang,
+    };
+    await generateAudio(params);
+  };
+
+  // 生成视频
+  const handleGenerate = async () => {
+    if (selectedMaterials.length === 0 || !text.trim()) {
+      toast.error("请先选择素材并填写文案");
+      return;
+    }
+
+    if (!selectedAudio) {
+      toast.error("请先生成并选中配音");
+      return;
    }

    if (enableBgm && !selectedBgmId) {
-      alert("请选择背景音乐");
+      toast.error("请选择背景音乐");
      return;
    }

@@ -682,26 +840,44 @@ export const useHomeController = () => {

    try {
      // 查找选中的素材对象以获取路径
-      const materialObj = materials.find((m) => m.id === selectedMaterial);
-      if (!materialObj) {
-        alert("素材数据异常");
+      const firstMaterialObj = materials.find((m) => m.id === selectedMaterials[0]);
+      if (!firstMaterialObj) {
+        toast.error("素材数据异常");
        return;
      }

-      // 构建请求参数
-      const payload: Record<string, any> = {
-        material_path: materialObj.path,
-        text: text,
-        tts_mode: ttsMode,
+      // 构建请求参数 - 使用预生成配音
+      const payload: Record<string, unknown> = {
+        material_path: firstMaterialObj.path,
+        text: selectedAudio.text || text,
+        generated_audio_id: selectedAudio.id,
+        language: selectedAudio.language || textLang,
        title: videoTitle.trim() || undefined,
-        enable_subtitles: enableSubtitles,
+        enable_subtitles: true,
      };

-      if (enableSubtitles && selectedSubtitleStyleId) {
+      // 多素材
+      if (selectedMaterials.length > 1) {
+        payload.material_paths = selectedMaterials
+          .map((id) => materials.find((x) => x.id === id)?.path)
+          .filter((path): path is string => !!path);
+        // 发送自定义时间轴分配
+        const assignments = toCustomAssignments();
+        if (assignments.length > 0) {
+          payload.custom_assignments = assignments;
+        }
+      }
+
+      // 单素材 + 截取起点
+      if (selectedMaterials.length === 1 && timelineSegments[0]?.sourceStart > 0) {
+        payload.custom_assignments = toCustomAssignments();
+      }
+
+      if (selectedSubtitleStyleId) {
        payload.subtitle_style_id = selectedSubtitleStyleId;
      }

-      if (enableSubtitles && subtitleFontSize) {
+      if (subtitleFontSize) {
        payload.subtitle_font_size = Math.round(subtitleFontSize);
      }

@@ -713,18 +889,17 @@ export const useHomeController = () => {
        payload.title_font_size = Math.round(titleFontSize);
      }

+      if (videoTitle.trim()) {
+        payload.title_top_margin = Math.round(titleTopMargin);
+      }
+
+      payload.subtitle_bottom_margin = Math.round(subtitleBottomMargin);
+
      if (enableBgm && selectedBgmId) {
        payload.bgm_id = selectedBgmId;
        payload.bgm_volume = bgmVolume;
      }

-      if (ttsMode === "edgetts") {
-        payload.voice = voice;
-      } else {
-        payload.ref_audio_id = selectedRefAudio!.id;
-        payload.ref_text = refText;
-      }
-
      // 创建生成任务
      const { data: res } = await api.post<ApiResponse<{ task_id: string }>>(
        "/api/videos/generate",
@@ -785,8 +960,8 @@ export const useHomeController = () => {
    fetchMaterials,
    deleteMaterial,
    handleUpload,
-    selectedMaterial,
-    setSelectedMaterial,
+    selectedMaterials,
+    toggleMaterial,
    handlePreviewMaterial,
    editingMaterialId,
    editMaterialName,
@@ -800,6 +975,13 @@ export const useHomeController = () => {
    setExtractModalOpen,
    handleGenerateMeta,
    isGeneratingMeta,
+    handleTranslate,
+    isTranslating,
+    originalText,
+    handleRestoreOriginal,
+    savedScripts,
+    handleSaveScript,
+    deleteSavedScript,
    showStylePreview,
    setShowStylePreview,
    videoTitle,
@@ -816,19 +998,20 @@ export const useHomeController = () => {
    subtitleFontSize,
    setSubtitleFontSize,
    setSubtitleSizeLocked,
-    enableSubtitles,
-    setEnableSubtitles,
+    titleTopMargin,
+    setTitleTopMargin,
+    subtitleBottomMargin,
+    setSubtitleBottomMargin,
    resolveAssetUrl,
    getFontFormat,
    buildTextShadow,
-    previewContainerWidth,
    materialDimensions,
-    titlePreviewContainerRef,
    ttsMode,
    setTtsMode,
-    voices: VOICES,
+    voices: VOICES[textLang] || VOICES["zh-CN"],
    voice,
    setVoice,
+    textLang,
    refAudios,
    selectedRefAudio,
    handleSelectRefAudio,
@@ -879,5 +1062,22 @@ export const useHomeController = () => {
    fetchGeneratedVideos,
    registerVideoRef,
    formatDate,
+    generatedAudios,
+    selectedAudio,
+    selectedAudioId,
+    isGeneratingAudio,
+    audioTask,
+    fetchGeneratedAudios,
+    handleGenerateAudio,
+    deleteAudio,
+    renameAudio,
+    selectAudio,
+    timelineSegments,
+    reorderSegments,
+    setSourceRange,
+    clipTrimmerOpen,
+    setClipTrimmerOpen,
+    clipTrimmerSegmentId,
+    setClipTrimmerSegmentId,
  };
 };
--- a/frontend/src/features/home/model/useHomePersistence.ts
+++ b/frontend/src/features/home/model/useHomePersistence.ts
@@ -17,14 +17,14 @@ interface UseHomePersistenceOptions {
  setText: React.Dispatch<React.SetStateAction<string>>;
  videoTitle: string;
  setVideoTitle: React.Dispatch<React.SetStateAction<string>>;
-  enableSubtitles: boolean;
-  setEnableSubtitles: React.Dispatch<React.SetStateAction<boolean>>;
  ttsMode: 'edgetts' | 'voiceclone';
  setTtsMode: React.Dispatch<React.SetStateAction<'edgetts' | 'voiceclone'>>;
  voice: string;
  setVoice: React.Dispatch<React.SetStateAction<string>>;
-  selectedMaterial: string;
-  setSelectedMaterial: React.Dispatch<React.SetStateAction<string>>;
+  textLang: string;
+  setTextLang: React.Dispatch<React.SetStateAction<string>>;
+  selectedMaterials: string[];
+  setSelectedMaterials: React.Dispatch<React.SetStateAction<string[]>>;
  selectedSubtitleStyleId: string;
  setSelectedSubtitleStyleId: React.Dispatch<React.SetStateAction<string>>;
  selectedTitleStyleId: string;
@@ -35,6 +35,10 @@ interface UseHomePersistenceOptions {
  setTitleFontSize: React.Dispatch<React.SetStateAction<number>>;
  setSubtitleSizeLocked: React.Dispatch<React.SetStateAction<boolean>>;
  setTitleSizeLocked: React.Dispatch<React.SetStateAction<boolean>>;
+  titleTopMargin: number;
+  setTitleTopMargin: React.Dispatch<React.SetStateAction<number>>;
+  subtitleBottomMargin: number;
+  setSubtitleBottomMargin: React.Dispatch<React.SetStateAction<number>>;
  selectedBgmId: string;
  setSelectedBgmId: React.Dispatch<React.SetStateAction<string>>;
  bgmVolume: number;
@@ -44,6 +48,8 @@ interface UseHomePersistenceOptions {
  selectedVideoId: string | null;
  setSelectedVideoId: React.Dispatch<React.SetStateAction<string | null>>;
  selectedRefAudio: RefAudio | null;
+  selectedAudioId: string | null;
+  setSelectedAudioId: React.Dispatch<React.SetStateAction<string | null>>;
 }

 export const useHomePersistence = ({
@@ -53,14 +59,14 @@ export const useHomePersistence = ({
  setText,
  videoTitle,
  setVideoTitle,
-  enableSubtitles,
-  setEnableSubtitles,
  ttsMode,
  setTtsMode,
  voice,
  setVoice,
-  selectedMaterial,
-  setSelectedMaterial,
+  textLang,
+  setTextLang,
+  selectedMaterials,
+  setSelectedMaterials,
  selectedSubtitleStyleId,
  setSelectedSubtitleStyleId,
  selectedTitleStyleId,
@@ -71,6 +77,10 @@ export const useHomePersistence = ({
  setTitleFontSize,
  setSubtitleSizeLocked,
  setTitleSizeLocked,
+  titleTopMargin,
+  setTitleTopMargin,
+  subtitleBottomMargin,
+  setSubtitleBottomMargin,
  selectedBgmId,
  setSelectedBgmId,
  bgmVolume,
@@ -80,6 +90,8 @@ export const useHomePersistence = ({
  selectedVideoId,
  setSelectedVideoId,
  selectedRefAudio,
+  selectedAudioId,
+  setSelectedAudioId,
 }: UseHomePersistenceOptions) => {
  const [isRestored, setIsRestored] = useState(false);

@@ -88,9 +100,9 @@ export const useHomePersistence = ({

    const savedText = localStorage.getItem(`vigent_${storageKey}_text`);
    const savedTitle = localStorage.getItem(`vigent_${storageKey}_title`);
-    const savedSubtitles = localStorage.getItem(`vigent_${storageKey}_subtitles`);
    const savedTtsMode = localStorage.getItem(`vigent_${storageKey}_ttsMode`);
    const savedVoice = localStorage.getItem(`vigent_${storageKey}_voice`);
+    const savedTextLang = localStorage.getItem(`vigent_${storageKey}_textLang`);
    const savedMaterial = localStorage.getItem(`vigent_${storageKey}_material`);
    const savedSubtitleStyle = localStorage.getItem(`vigent_${storageKey}_subtitleStyle`);
    const savedTitleStyle = localStorage.getItem(`vigent_${storageKey}_titleStyle`);
@@ -98,16 +110,31 @@ export const useHomePersistence = ({
    const savedTitleFontSize = localStorage.getItem(`vigent_${storageKey}_titleFontSize`);
    const savedBgmId = localStorage.getItem(`vigent_${storageKey}_bgmId`);
    const savedSelectedVideoId = localStorage.getItem(`vigent_${storageKey}_selectedVideoId`);
+    const savedSelectedAudioId = localStorage.getItem(`vigent_${storageKey}_selectedAudioId`);
    const savedBgmVolume = localStorage.getItem(`vigent_${storageKey}_bgmVolume`);
    const savedEnableBgm = localStorage.getItem(`vigent_${storageKey}_enableBgm`);
+    const savedTitleTopMargin = localStorage.getItem(`vigent_${storageKey}_titleTopMargin`);
+    const savedSubtitleBottomMargin = localStorage.getItem(`vigent_${storageKey}_subtitleBottomMargin`);

    setText(savedText || "大家好，欢迎来到我的频道，今天给大家分享一些有趣的内容。");
    setVideoTitle(savedTitle ? clampTitle(savedTitle) : "");
-    setEnableSubtitles(savedSubtitles !== null ? savedSubtitles === 'true' : true);
    setTtsMode((savedTtsMode as 'edgetts' | 'voiceclone') || 'edgetts');
    setVoice(savedVoice || "zh-CN-YunxiNeural");
+    if (savedTextLang) setTextLang(savedTextLang);

-    if (savedMaterial) setSelectedMaterial(savedMaterial);
+    if (savedMaterial) {
+      try {
+        const parsed = JSON.parse(savedMaterial);
+        if (Array.isArray(parsed)) {
+          setSelectedMaterials(parsed);
+        } else {
+          setSelectedMaterials([savedMaterial]);
+        }
+      } catch {
+        // 旧格式: 单字符串
+        setSelectedMaterials([savedMaterial]);
+      }
+    }
    if (savedSubtitleStyle) setSelectedSubtitleStyleId(savedSubtitleStyle);
    if (savedTitleStyle) setSelectedTitleStyleId(savedTitleStyle);

@@ -131,23 +158,37 @@ export const useHomePersistence = ({
    if (savedBgmVolume) setBgmVolume(parseFloat(savedBgmVolume));
    if (savedEnableBgm !== null) setEnableBgm(savedEnableBgm === 'true');
    if (savedSelectedVideoId) setSelectedVideoId(savedSelectedVideoId);
+    if (savedSelectedAudioId) setSelectedAudioId(savedSelectedAudioId);

+    if (savedTitleTopMargin) {
+      const parsed = parseInt(savedTitleTopMargin, 10);
+      if (!Number.isNaN(parsed)) setTitleTopMargin(parsed);
+    }
+    if (savedSubtitleBottomMargin) {
+      const parsed = parseInt(savedSubtitleBottomMargin, 10);
+      if (!Number.isNaN(parsed)) setSubtitleBottomMargin(parsed);
+    }
+
+    // eslint-disable-next-line react-hooks/set-state-in-effect
    setIsRestored(true);
  }, [
    isAuthLoading,
    setBgmVolume,
    setEnableBgm,
-    setEnableSubtitles,
    setSelectedBgmId,
-    setSelectedMaterial,
+    setSelectedMaterials,
    setSelectedSubtitleStyleId,
    setSelectedTitleStyleId,
    setSelectedVideoId,
+    setSelectedAudioId,
    setSubtitleFontSize,
    setSubtitleSizeLocked,
    setText,
+    setTextLang,
    setTitleFontSize,
    setTitleSizeLocked,
+    setTitleTopMargin,
+    setSubtitleBottomMargin,
    setTtsMode,
    setVideoTitle,
    setVoice,
@@ -170,10 +211,6 @@ export const useHomePersistence = ({
    return () => clearTimeout(timeout);
  }, [videoTitle, storageKey, isRestored]);

-  useEffect(() => {
-    if (isRestored) localStorage.setItem(`vigent_${storageKey}_subtitles`, String(enableSubtitles));
-  }, [enableSubtitles, storageKey, isRestored]);
-
  useEffect(() => {
    if (isRestored) localStorage.setItem(`vigent_${storageKey}_ttsMode`, ttsMode);
  }, [ttsMode, storageKey, isRestored]);
@@ -183,10 +220,14 @@ export const useHomePersistence = ({
  }, [voice, storageKey, isRestored]);

  useEffect(() => {
-    if (isRestored && selectedMaterial) {
-      localStorage.setItem(`vigent_${storageKey}_material`, selectedMaterial);
+    if (isRestored) localStorage.setItem(`vigent_${storageKey}_textLang`, textLang);
+  }, [textLang, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored && selectedMaterials.length > 0) {
+      localStorage.setItem(`vigent_${storageKey}_material`, JSON.stringify(selectedMaterials));
    }
-  }, [selectedMaterial, storageKey, isRestored]);
+  }, [selectedMaterials, storageKey, isRestored]);

  useEffect(() => {
    if (isRestored && selectedSubtitleStyleId) {
@@ -212,6 +253,18 @@ export const useHomePersistence = ({
    }
  }, [titleFontSize, storageKey, isRestored]);

+  useEffect(() => {
+    if (isRestored) {
+      localStorage.setItem(`vigent_${storageKey}_titleTopMargin`, String(titleTopMargin));
+    }
+  }, [titleTopMargin, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored) {
+      localStorage.setItem(`vigent_${storageKey}_subtitleBottomMargin`, String(subtitleBottomMargin));
+    }
+  }, [subtitleBottomMargin, storageKey, isRestored]);
+
  useEffect(() => {
    if (isRestored) {
      localStorage.setItem(`vigent_${storageKey}_bgmId`, selectedBgmId);
@@ -241,6 +294,15 @@ export const useHomePersistence = ({
    }
  }, [selectedVideoId, storageKey, isRestored]);

+  useEffect(() => {
+    if (!isRestored) return;
+    if (selectedAudioId) {
+      localStorage.setItem(`vigent_${storageKey}_selectedAudioId`, selectedAudioId);
+    } else {
+      localStorage.removeItem(`vigent_${storageKey}_selectedAudioId`);
+    }
+  }, [selectedAudioId, storageKey, isRestored]);
+
  useEffect(() => {
    if (isRestored && selectedRefAudio) {
      localStorage.setItem(`vigent_${storageKey}_refAudioId`, selectedRefAudio.id);
--- a/frontend/src/features/home/model/useMaterials.ts
+++ b/frontend/src/features/home/model/useMaterials.ts
@@ -1,23 +1,45 @@
 import { useCallback, useState } from "react";
 import api from "@/shared/api/axios";
 import { ApiResponse, unwrap } from "@/shared/api/types";
+import { toast } from "sonner";
+import { resolveMediaUrl } from "@/shared/lib/media";
+import type { Material } from "@/shared/types/material";

-interface Material {
-  id: string;
-  name: string;
-  scene: string;
-  size_mb: number;
-  path: string;
+/** Probe video duration from a URL using <video> element */
+function probeVideoDuration(url: string): Promise<number> {
+  return new Promise((resolve) => {
+    const video = document.createElement("video");
+    video.preload = "metadata";
+    video.crossOrigin = "anonymous";
+    const cleanup = () => {
+      video.removeEventListener("loadedmetadata", onMeta);
+      video.removeEventListener("error", onError);
+      video.src = "";
+    };
+    const onMeta = () => {
+      const dur = video.duration;
+      cleanup();
+      resolve(Number.isFinite(dur) ? dur : 0);
+    };
+    const onError = () => {
+      cleanup();
+      resolve(0);
+    };
+    video.addEventListener("loadedmetadata", onMeta);
+    video.addEventListener("error", onError);
+    video.src = url;
+    video.load();
+  });
 }

 interface UseMaterialsOptions {
-  selectedMaterial: string;
-  setSelectedMaterial: React.Dispatch<React.SetStateAction<string>>;
+  selectedMaterials: string[];
+  setSelectedMaterials: React.Dispatch<React.SetStateAction<string[]>>;
 }

 export const useMaterials = ({
-  selectedMaterial,
-  setSelectedMaterial,
+  selectedMaterials,
+  setSelectedMaterials,
 }: UseMaterialsOptions) => {
  const [materials, setMaterials] = useState<Material[]>([]);
  const [fetchError, setFetchError] = useState<string | null>(null);
@@ -40,41 +62,84 @@ export const useMaterials = ({
      setMaterials(nextMaterials);
      setLastMaterialCount(nextMaterials.length);

-      const nextSelected = nextMaterials.find((item: Material) => item.id === selectedMaterial)?.id
-        || nextMaterials[0]?.id
-        || "";
-      if (nextSelected !== selectedMaterial) {
-        setSelectedMaterial(nextSelected);
+      // Probe video durations in background
+      if (nextMaterials.length > 0) {
+        Promise.all(
+          nextMaterials.map(async (m) => {
+            const url = resolveMediaUrl(m.path);
+            if (!url) return m;
+            const dur = await probeVideoDuration(url);
+            return { ...m, duration_sec: dur };
+          })
+        ).then((enriched) => setMaterials(enriched));
      }
+
+      setSelectedMaterials((prev) => {
+        // 保留已选中且仍存在的
+        const existingIds = new Set(nextMaterials.map((m) => m.id));
+        const kept = prev.filter((id) => existingIds.has(id));
+        if (kept.length > 0) return kept;
+        // 否则默认选中第一个
+        return nextMaterials[0]?.id ? [nextMaterials[0].id] : [];
+      });
    } catch (error) {
      console.error("获取素材失败:", error);
      setFetchError(String(error));
    } finally {
      setIsFetching(false);
    }
-  }, [selectedMaterial, setSelectedMaterial]);
+  }, [setSelectedMaterials]);
+
+  const MAX_MATERIALS = 4;
+
+  const toggleMaterial = useCallback((id: string) => {
+    setSelectedMaterials((prev) => {
+      if (prev.includes(id)) {
+        // 不能取消最后一个
+        if (prev.length <= 1) return prev;
+        return prev.filter((x) => x !== id);
+      }
+      if (prev.length >= MAX_MATERIALS) return prev;
+      return [...prev, id];
+    });
+  }, [setSelectedMaterials]);
+
+  const reorderMaterials = useCallback((activeId: string, overId: string) => {
+    setSelectedMaterials((prev) => {
+      const oldIndex = prev.indexOf(activeId);
+      const newIndex = prev.indexOf(overId);
+      if (oldIndex === -1 || newIndex === -1) return prev;
+      const next = [...prev];
+      next.splice(oldIndex, 1);
+      next.splice(newIndex, 0, activeId);
+      return next;
+    });
+  }, [setSelectedMaterials]);

  const deleteMaterial = useCallback(async (materialId: string) => {
    if (!confirm("确定要删除这个素材吗？")) return;
    try {
      await api.delete(`/api/materials/${materialId}`);
      fetchMaterials();
-      if (selectedMaterial === materialId) {
-        setSelectedMaterial("");
+      if (selectedMaterials.includes(materialId)) {
+        setSelectedMaterials((prev) => {
+          const next = prev.filter((id) => id !== materialId);
+          return next.length > 0 ? next : [];
+        });
      }
    } catch (error) {
-      alert("删除失败: " + error);
+      toast.error("删除失败: " + error);
    }
-  }, [fetchMaterials, selectedMaterial, setSelectedMaterial]);
+  }, [fetchMaterials, selectedMaterials, setSelectedMaterials]);

  const handleUpload = useCallback(async (e: React.ChangeEvent<HTMLInputElement>) => {
    const file = e.target.files?.[0];
    if (!file) return;

-    const validTypes = ['.mp4', '.mov', '.avi'];
+    const validTypes = ['.mp4', '.mov', '.avi', '.mkv', '.webm', '.flv', '.wmv', '.m4v', '.ts', '.mts'];
    const ext = file.name.toLowerCase().slice(file.name.lastIndexOf('.'));
    if (!validTypes.includes(ext)) {
-      setUploadError('仅支持 MP4、MOV、AVI 格式');
+      setUploadError('不支持的视频格式');
      return;
    }

@@ -98,11 +163,39 @@ export const useMaterials = ({

      setUploadProgress(100);
      setIsUploading(false);
-      fetchMaterials();
-    } catch (err: any) {
+
+      // 上传后重新拉列表并自动选中新素材
+      const { data: res } = await api.get<ApiResponse<{ materials: Material[] }>>(
+        `/api/materials?t=${new Date().getTime()}`
+      );
+      const payload = unwrap(res);
+      const nextMaterials = payload.materials || [];
+      setMaterials(nextMaterials);
+      setLastMaterialCount(nextMaterials.length);
+
+      // Probe video durations in background
+      if (nextMaterials.length > 0) {
+        Promise.all(
+          nextMaterials.map(async (m) => {
+            const url = resolveMediaUrl(m.path);
+            if (!url) return m;
+            const dur = await probeVideoDuration(url);
+            return { ...m, duration_sec: dur };
+          })
+        ).then((enriched) => setMaterials(enriched));
+      }
+
+      // 找出新增的素材 ID 并自动选中
+      const oldIds = new Set(materials.map((m) => m.id));
+      const newIds = nextMaterials.filter((m) => !oldIds.has(m.id)).map((m) => m.id);
+      if (newIds.length > 0) {
+        setSelectedMaterials((prev) => [...prev, ...newIds]);
+      }
+    } catch (err: unknown) {
      console.error("Upload failed:", err);
      setIsUploading(false);
-      const errorMsg = err.response?.data?.message || err.message || String(err);
+      const axiosErr = err as { response?: { data?: { message?: string } }; message?: string };
+      const errorMsg = axiosErr.response?.data?.message || axiosErr.message || String(err);
      setUploadError(`上传失败: ${errorMsg}`);
    }

@@ -119,6 +212,8 @@ export const useMaterials = ({
    uploadError,
    setUploadError,
    fetchMaterials,
+    toggleMaterial,
+    reorderMaterials,
    deleteMaterial,
    handleUpload,
  };
--- a/frontend/src/features/home/model/useMediaPlayers.ts
+++ b/frontend/src/features/home/model/useMediaPlayers.ts
@@ -1,5 +1,6 @@
 import { useCallback, useEffect, useRef, useState } from "react";
 import type { BgmItem } from "@/features/home/model/useBgm";
+import { toast } from "sonner";

 interface RefAudio {
  id: string;
@@ -64,12 +65,12 @@ export const useMediaPlayers = ({

    const audioUrl = resolveMediaUrl(audio.path) || audio.path;
    if (!audioUrl) {
-      alert("无法播放该参考音频");
+      toast.error("无法播放该参考音频");
      return;
    }
    const player = new Audio(audioUrl);
    player.onended = () => setPlayingAudioId(null);
-    player.play().catch((err) => alert("播放失败: " + err));
+    player.play().catch((err) => toast.error("播放失败: " + err));
    audioPlayerRef.current = player;
    setPlayingAudioId(audio.id);
  }, [playingAudioId, resolveMediaUrl, stopAudio, stopBgm]);
@@ -81,7 +82,7 @@ export const useMediaPlayers = ({

    const bgmUrl = resolveBgmUrl(bgm.id);
    if (!bgmUrl) {
-      alert("无法播放该背景音乐");
+      toast.error("无法播放该背景音乐");
      return;
    }

@@ -96,7 +97,7 @@ export const useMediaPlayers = ({
    const player = new Audio(bgmUrl);
    player.volume = Math.max(0, Math.min(bgmVolume, 1));
    player.onended = () => setPlayingBgmId(null);
-    player.play().catch((err) => alert("播放失败: " + err));
+    player.play().catch((err) => toast.error("播放失败: " + err));
    bgmPlayerRef.current = player;
    setPlayingBgmId(bgm.id);
  }, [bgmVolume, playingBgmId, resolveBgmUrl, setEnableBgm, setSelectedBgmId, stopAudio, stopBgm]);
--- a/frontend/src/features/home/model/useRefAudios.ts
+++ b/frontend/src/features/home/model/useRefAudios.ts
@@ -1,6 +1,7 @@
 import { useCallback, useState } from "react";
 import api from "@/shared/api/axios";
 import { ApiResponse, unwrap } from "@/shared/api/types";
+import { toast } from "sonner";

 interface RefAudio {
  id: string;
@@ -60,10 +61,11 @@ export const useRefAudios = ({
      setSelectedRefAudio(payload);
      setRefText(payload.ref_text);
      setIsUploadingRef(false);
-    } catch (err: any) {
+    } catch (err: unknown) {
      console.error("Upload ref audio failed:", err);
      setIsUploadingRef(false);
-      const errorMsg = err.response?.data?.message || err.message || String(err);
+      const axiosErr = err as { response?: { data?: { message?: string } }; message?: string };
+      const errorMsg = axiosErr.response?.data?.message || axiosErr.message || String(err);
      setUploadRefError(`上传失败: ${errorMsg}`);
    }
  }, [fetchRefAudios, fixedRefText, setRefText, setSelectedRefAudio]);
@@ -78,7 +80,7 @@ export const useRefAudios = ({
        setRefText('');
      }
    } catch (error) {
-      alert("删除失败: " + error);
+      toast.error("删除失败: " + error);
    }
  }, [fetchRefAudios, selectedRefAudio, setRefText, setSelectedRefAudio]);

--- a/frontend/src/features/home/model/useSavedScripts.ts
+++ b/frontend/src/features/home/model/useSavedScripts.ts
@@ -0,0 +1,51 @@
+import { useState, useEffect, useRef } from "react";
+
+export interface SavedScript {
+  id: string;
+  name: string;
+  content: string;
+  savedAt: number;
+}
+
+export function useSavedScripts(storageKey: string) {
+  const lsKey = `vigent_${storageKey}_savedScripts`;
+  const lsKeyRef = useRef(lsKey);
+  lsKeyRef.current = lsKey;
+
+  const [savedScripts, setSavedScripts] = useState<SavedScript[]>([]);
+
+  // Re-read from localStorage whenever lsKey changes (e.g. guest → userId)
+  useEffect(() => {
+    try {
+      const raw = localStorage.getItem(lsKey);
+      setSavedScripts(raw ? JSON.parse(raw) : []);
+    } catch {
+      setSavedScripts([]);
+    }
+  }, [lsKey]);
+
+  const saveScript = (content: string) => {
+    const name = content.slice(0, 15).replace(/\n/g, " ") || "未命名";
+    const entry: SavedScript = {
+      id: Date.now().toString(36) + Math.random().toString(36).slice(2, 6),
+      name,
+      content,
+      savedAt: Date.now(),
+    };
+    setSavedScripts((prev) => {
+      const next = [entry, ...prev];
+      localStorage.setItem(lsKeyRef.current, JSON.stringify(next));
+      return next;
+    });
+  };
+
+  const deleteScript = (id: string) => {
+    setSavedScripts((prev) => {
+      const next = prev.filter((s) => s.id !== id);
+      localStorage.setItem(lsKeyRef.current, JSON.stringify(next));
+      return next;
+    });
+  };
+
+  return { savedScripts, saveScript, deleteScript };
+}
--- a/frontend/src/features/home/model/useTimelineEditor.ts
+++ b/frontend/src/features/home/model/useTimelineEditor.ts
@@ -0,0 +1,246 @@
+import { useCallback, useEffect, useRef, useState } from "react";
+import type { Material } from "@/shared/types/material";
+
+export interface TimelineSegment {
+  id: string;
+  materialId: string;
+  materialName: string;
+  start: number;
+  end: number;
+  sourceStart: number;
+  sourceEnd: number;
+  color: string;
+}
+
+export interface CustomAssignment {
+  material_path: string;
+  start: number;
+  end: number;
+  source_start: number;
+}
+
+const COLORS = ["#8b5cf6", "#ec4899", "#06b6d4", "#f59e0b", "#10b981", "#f97316"];
+
+/** Serializable subset for localStorage */
+interface SegmentSnapshot {
+  materialId: string;
+  start: number;
+  end: number;
+  sourceStart: number;
+  sourceEnd: number;
+}
+
+/** Get effective duration of a segment (clipped range or full material duration) */
+function getEffectiveDuration(
+  seg: { sourceStart: number; sourceEnd: number; materialId: string },
+  mats: Material[]
+): number {
+  if (seg.sourceEnd > seg.sourceStart) return seg.sourceEnd - seg.sourceStart;
+  const mat = mats.find((m) => m.id === seg.materialId);
+  return mat?.duration_sec ?? 0;
+}
+
+/**
+ * Recalculate segment start/end positions based on effective durations.
+ * - Segments placed sequentially by effective duration
+ * - Segments exceeding audioDuration keep their positions (overflow, start >= duration)
+ * - Last visible segment is capped/extended to exactly audioDuration (loop fill)
+ */
+function recalcPositions(
+  segs: TimelineSegment[],
+  mats: Material[],
+  duration: number
+): TimelineSegment[] {
+  if (segs.length === 0 || duration <= 0) return segs;
+
+  const fallbackDur = duration / segs.length;
+  let cursor = 0;
+  const result = segs.map((seg) => {
+    const effDur = getEffectiveDuration(seg, mats);
+    const dur = effDur > 0 ? effDur : fallbackDur;
+    const newSeg = { ...seg, start: cursor, end: cursor + dur };
+    cursor += dur;
+    return newSeg;
+  });
+
+  // Find last segment that starts before audioDuration
+  let lastVisibleIdx = -1;
+  for (let i = result.length - 1; i >= 0; i--) {
+    if (result[i].start < duration) {
+      lastVisibleIdx = i;
+      break;
+    }
+  }
+
+  // Cap/extend last visible segment to exactly audioDuration
+  if (lastVisibleIdx >= 0) {
+    result[lastVisibleIdx] = { ...result[lastVisibleIdx], end: duration };
+  }
+
+  return result;
+}
+
+interface UseTimelineEditorOptions {
+  audioDuration: number;
+  materials: Material[];
+  selectedMaterials: string[];
+  storageKey?: string;
+}
+
+export const useTimelineEditor = ({
+  audioDuration,
+  materials,
+  selectedMaterials,
+  storageKey,
+}: UseTimelineEditorOptions) => {
+  const [segments, setSegments] = useState<TimelineSegment[]>([]);
+  const prevKey = useRef("");
+  const restoredRef = useRef(false);
+
+  // Refs for stable callbacks (avoid recreating on every materials/duration change)
+  const materialsRef = useRef(materials);
+  materialsRef.current = materials;
+  const audioDurationRef = useRef(audioDuration);
+  audioDurationRef.current = audioDuration;
+
+  // Build a durationsKey so segments re-init when material durations become available
+  const durationsKey = selectedMaterials
+    .map((id) => materials.find((m) => m.id === id)?.duration_sec ?? 0)
+    .join(",");
+
+  // Build a cache key from materials + duration
+  const cacheKey = `${selectedMaterials.join(",")}_${audioDuration.toFixed(1)}`;
+  const lsKey = storageKey ? `vigent_${storageKey}_timeline` : null;
+
+  const initSegments = useCallback(() => {
+    if (selectedMaterials.length === 0 || audioDuration <= 0) {
+      setSegments([]);
+      return;
+    }
+
+    // Try restore from localStorage
+    if (lsKey) {
+      try {
+        const raw = localStorage.getItem(lsKey);
+        if (raw) {
+          const saved = JSON.parse(raw) as { key: string; segments: SegmentSnapshot[] };
+          if (saved.key === cacheKey && saved.segments.length === selectedMaterials.length) {
+            const allMatch = saved.segments.every(
+              (s, i) => s.materialId === selectedMaterials[i] || saved.segments.some((ss) => ss.materialId === selectedMaterials[i])
+            );
+            if (allMatch) {
+              const restored: TimelineSegment[] = saved.segments.map((s, i) => {
+                const mat = materials.find((m) => m.id === s.materialId);
+                return {
+                  id: `seg-${i}-${Date.now()}`,
+                  materialId: s.materialId,
+                  materialName: mat?.scene || mat?.name || s.materialId,
+                  start: 0,
+                  end: 0,
+                  sourceStart: s.sourceStart,
+                  sourceEnd: s.sourceEnd,
+                  color: COLORS[i % COLORS.length],
+                };
+              });
+              setSegments(recalcPositions(restored, materials, audioDuration));
+              restoredRef.current = true;
+              return;
+            }
+          }
+        }
+      } catch {
+        // ignore parse errors
+      }
+    }
+
+    // Create fresh segments — positions derived by recalcPositions
+    const newSegments: TimelineSegment[] = selectedMaterials.map((matId, i) => {
+      const mat = materials.find((m) => m.id === matId);
+      return {
+        id: `seg-${i}-${Date.now()}`,
+        materialId: matId,
+        materialName: mat?.scene || mat?.name || matId,
+        start: 0,
+        end: 0,
+        sourceStart: 0,
+        sourceEnd: 0,
+        color: COLORS[i % COLORS.length],
+      };
+    });
+
+    setSegments(recalcPositions(newSegments, materials, audioDuration));
+  }, [audioDuration, materials, selectedMaterials, lsKey, cacheKey]);
+
+  // Auto-init when selectedMaterials, audioDuration, or material durations change
+  useEffect(() => {
+    const key = `${selectedMaterials.join(",")}_${audioDuration}_${durationsKey}`;
+    if (key !== prevKey.current) {
+      prevKey.current = key;
+      initSegments();
+    }
+  }, [selectedMaterials, audioDuration, durationsKey, initSegments]);
+
+  // Persist segments to localStorage on change (debounced)
+  useEffect(() => {
+    if (!lsKey || segments.length === 0) return;
+    const timeout = setTimeout(() => {
+      const snapshots: SegmentSnapshot[] = segments.map((s) => ({
+        materialId: s.materialId,
+        start: s.start,
+        end: s.end,
+        sourceStart: s.sourceStart,
+        sourceEnd: s.sourceEnd,
+      }));
+      localStorage.setItem(lsKey, JSON.stringify({ key: cacheKey, segments: snapshots }));
+    }, 300);
+    return () => clearTimeout(timeout);
+  }, [segments, lsKey, cacheKey]);
+
+  const reorderSegments = useCallback(
+    (fromIdx: number, toIdx: number) => {
+      setSegments((prev) => {
+        if (fromIdx < 0 || toIdx < 0 || fromIdx >= prev.length || toIdx >= prev.length) return prev;
+        if (fromIdx === toIdx) return prev;
+        const next = [...prev];
+        // Move the segment: remove from old position, insert at new position
+        const [moved] = next.splice(fromIdx, 1);
+        next.splice(toIdx, 0, moved);
+        return recalcPositions(next, materialsRef.current, audioDurationRef.current);
+      });
+    },
+    []
+  );
+
+  const setSourceRange = useCallback(
+    (id: string, sourceStart: number, sourceEnd: number) => {
+      setSegments((prev) => {
+        const updated = prev.map((s) => (s.id === id ? { ...s, sourceStart, sourceEnd } : s));
+        return recalcPositions(updated, materialsRef.current, audioDurationRef.current);
+      });
+    },
+    []
+  );
+
+  const toCustomAssignments = useCallback((): CustomAssignment[] => {
+    const duration = audioDurationRef.current;
+    return segments
+      .filter((seg) => seg.start < duration)
+      .map((seg) => {
+        const mat = materialsRef.current.find((m) => m.id === seg.materialId);
+        return {
+          material_path: mat?.path || seg.materialId,
+          start: seg.start,
+          end: seg.end,
+          source_start: seg.sourceStart,
+        };
+      });
+  }, [segments]);
+
+  return {
+    segments,
+    initSegments,
+    reorderSegments,
+    setSourceRange,
+    toCustomAssignments,
+  };
+};
--- a/frontend/src/features/home/model/useTitleSubtitleStyles.ts
+++ b/frontend/src/features/home/model/useTitleSubtitleStyles.ts
@@ -34,14 +34,14 @@ export interface TitleStyleOption {

 interface UseTitleSubtitleStylesOptions {
  isAuthLoading: boolean;
-  storageKey: string;
+
  setSelectedSubtitleStyleId: React.Dispatch<React.SetStateAction<string>>;
  setSelectedTitleStyleId: React.Dispatch<React.SetStateAction<string>>;
 }

 export const useTitleSubtitleStyles = ({
  isAuthLoading,
-  storageKey,
+
  setSelectedSubtitleStyleId,
  setSelectedTitleStyleId,
 }: UseTitleSubtitleStylesOptions) => {
@@ -57,17 +57,15 @@ export const useTitleSubtitleStyles = ({
      const styles: SubtitleStyleOption[] = payload.styles || [];
      setSubtitleStyles(styles);

-      const savedStyleId = localStorage.getItem(`vigent_${storageKey}_subtitleStyle`);
      setSelectedSubtitleStyleId((prev) => {
        if (prev && styles.some((s) => s.id === prev)) return prev;
-        if (savedStyleId && styles.some((s) => s.id === savedStyleId)) return savedStyleId;
        const defaultStyle = styles.find((s) => s.is_default) || styles[0];
        return defaultStyle?.id || "";
      });
    } catch (error) {
      console.error("获取字幕样式失败:", error);
    }
-  }, [setSelectedSubtitleStyleId, storageKey]);
+  }, [setSelectedSubtitleStyleId]);

  const refreshTitleStyles = useCallback(async () => {
    try {
@@ -78,21 +76,21 @@ export const useTitleSubtitleStyles = ({
      const styles: TitleStyleOption[] = payload.styles || [];
      setTitleStyles(styles);

-      const savedStyleId = localStorage.getItem(`vigent_${storageKey}_titleStyle`);
      setSelectedTitleStyleId((prev) => {
        if (prev && styles.some((s) => s.id === prev)) return prev;
-        if (savedStyleId && styles.some((s) => s.id === savedStyleId)) return savedStyleId;
        const defaultStyle = styles.find((s) => s.is_default) || styles[0];
        return defaultStyle?.id || "";
      });
    } catch (error) {
      console.error("获取标题样式失败:", error);
    }
-  }, [setSelectedTitleStyleId, storageKey]);
+  }, [setSelectedTitleStyleId]);

  useEffect(() => {
    if (isAuthLoading) return;
+    // eslint-disable-next-line react-hooks/set-state-in-effect
    refreshSubtitleStyles();
+    // eslint-disable-next-line react-hooks/set-state-in-effect
    refreshTitleStyles();
  }, [isAuthLoading, refreshSubtitleStyles, refreshTitleStyles]);

--- a/frontend/src/features/home/ui/ClipTrimmer.tsx
+++ b/frontend/src/features/home/ui/ClipTrimmer.tsx
@@ -0,0 +1,293 @@
+import { useCallback, useEffect, useRef, useState } from "react";
+import { X, Play, Pause } from "lucide-react";
+import type { TimelineSegment } from "@/features/home/model/useTimelineEditor";
+
+interface ClipTrimmerProps {
+  isOpen: boolean;
+  segment: TimelineSegment | null;
+  materialUrl: string | null;
+  onConfirm: (sourceStart: number, sourceEnd: number) => void;
+  onClose: () => void;
+}
+
+function formatSec(sec: number): string {
+  const m = Math.floor(sec / 60);
+  const s = sec % 60;
+  return `${String(m).padStart(2, "0")}:${s.toFixed(1).padStart(4, "0")}`;
+}
+
+export function ClipTrimmer({
+  isOpen,
+  segment,
+  materialUrl,
+  onConfirm,
+  onClose,
+}: ClipTrimmerProps) {
+  const videoRef = useRef<HTMLVideoElement>(null);
+  const trackRef = useRef<HTMLDivElement>(null);
+  const [duration, setDuration] = useState(0);
+  const [sourceStart, setSourceStart] = useState(0);
+  const [sourceEnd, setSourceEnd] = useState(0);
+  const [currentTime, setCurrentTime] = useState(0);
+  const [isPlaying, setIsPlaying] = useState(false);
+  const [dragging, setDragging] = useState<"start" | "end" | null>(null);
+  const animRef = useRef<number>(0);
+
+  // Reset state when segment changes
+  useEffect(() => {
+    if (segment && isOpen) {
+      setSourceStart(segment.sourceStart);
+      setSourceEnd(segment.sourceEnd);
+      setCurrentTime(segment.sourceStart);
+      setIsPlaying(false);
+    }
+  }, [segment, isOpen]);
+
+  // Track currentTime during playback
+  useEffect(() => {
+    if (!isPlaying || !videoRef.current) return;
+
+    const tick = () => {
+      if (!videoRef.current) return;
+      const t = videoRef.current.currentTime;
+      const end = sourceEnd || duration;
+      if (t >= end) {
+        videoRef.current.pause();
+        videoRef.current.currentTime = sourceStart;
+        setCurrentTime(sourceStart);
+        setIsPlaying(false);
+        return;
+      }
+      setCurrentTime(t);
+      animRef.current = requestAnimationFrame(tick);
+    };
+    animRef.current = requestAnimationFrame(tick);
+    return () => cancelAnimationFrame(animRef.current);
+  }, [isPlaying, sourceStart, sourceEnd, duration]);
+
+  // Seek video when not playing and currentTime changes
+  useEffect(() => {
+    if (videoRef.current && !isPlaying) {
+      videoRef.current.currentTime = currentTime;
+    }
+  }, [currentTime, isPlaying]);
+
+  const handleLoadedMetadata = useCallback(() => {
+    if (videoRef.current) {
+      const dur = videoRef.current.duration;
+      setDuration(dur);
+      if (sourceEnd === 0) {
+        setSourceEnd(dur);
+      }
+    }
+  }, [sourceEnd]);
+
+  const togglePlay = useCallback(() => {
+    if (!videoRef.current || duration === 0) return;
+    if (isPlaying) {
+      videoRef.current.pause();
+      setIsPlaying(false);
+    } else {
+      const end = sourceEnd || duration;
+      if (videoRef.current.currentTime >= end || videoRef.current.currentTime < sourceStart) {
+        videoRef.current.currentTime = sourceStart;
+        setCurrentTime(sourceStart);
+      }
+      videoRef.current.play().catch(() => {});
+      setIsPlaying(true);
+    }
+  }, [isPlaying, sourceStart, sourceEnd, duration]);
+
+  // --- Dual-handle slider logic ---
+  const getPositionFromEvent = useCallback(
+    (clientX: number) => {
+      if (!trackRef.current || duration === 0) return 0;
+      const rect = trackRef.current.getBoundingClientRect();
+      const ratio = Math.max(0, Math.min(1, (clientX - rect.left) / rect.width));
+      return ratio * duration;
+    },
+    [duration]
+  );
+
+  const handleThumbPointerDown = useCallback(
+    (which: "start" | "end", e: React.PointerEvent) => {
+      e.preventDefault();
+      e.stopPropagation();
+      setDragging(which);
+      (e.target as HTMLElement).setPointerCapture(e.pointerId);
+    },
+    []
+  );
+
+  const handleTrackPointerMove = useCallback(
+    (e: React.PointerEvent) => {
+      if (!dragging) return;
+      const pos = getPositionFromEvent(e.clientX);
+      const minGap = 0.5;
+      if (dragging === "start") {
+        const clamped = Math.max(0, Math.min(pos, (sourceEnd || duration) - minGap));
+        setSourceStart(clamped);
+        setCurrentTime(clamped);
+      } else {
+        const clamped = Math.min(duration, Math.max(pos, sourceStart + minGap));
+        setSourceEnd(clamped);
+      }
+    },
+    [dragging, getPositionFromEvent, sourceStart, sourceEnd, duration]
+  );
+
+  const handleTrackPointerUp = useCallback(() => {
+    setDragging(null);
+  }, []);
+
+  const handleConfirm = () => {
+    onConfirm(sourceStart, sourceEnd >= duration ? 0 : sourceEnd);
+  };
+
+  if (!isOpen || !segment) return null;
+
+  const assignedDur = segment.end - segment.start;
+  const effectiveEnd = sourceEnd || duration;
+  const clipDur = effectiveEnd - sourceStart;
+  const startPct = duration > 0 ? (sourceStart / duration) * 100 : 0;
+  const endPct = duration > 0 ? (effectiveEnd / duration) * 100 : 100;
+  const playheadPct = duration > 0 ? (currentTime / duration) * 100 : 0;
+
+  return (
+    <div className="fixed inset-0 z-50 flex items-center justify-center bg-black/60 backdrop-blur-sm" onClick={onClose}>
+      <div
+        className="bg-gray-900 border border-white/10 rounded-2xl w-full max-w-lg mx-4 overflow-hidden"
+        onClick={(e) => e.stopPropagation()}
+      >
+        {/* Header */}
+        <div className="flex items-center justify-between px-5 py-3 border-b border-white/10">
+          <h3 className="text-white font-semibold text-sm">
+            截取设置 - {segment.materialName}
+          </h3>
+          <button onClick={onClose} className="text-gray-400 hover:text-white">
+            <X className="h-4 w-4" />
+          </button>
+        </div>
+
+        {/* Video preview */}
+        <div className="px-5 pt-4">
+          <div className="relative bg-black rounded-lg overflow-hidden aspect-video group">
+            {materialUrl ? (
+              <video
+                ref={videoRef}
+                src={materialUrl}
+                className="w-full h-full object-contain"
+                onLoadedMetadata={handleLoadedMetadata}
+                onEnded={() => setIsPlaying(false)}
+                preload="auto"
+                muted
+              />
+            ) : (
+              <div className="flex items-center justify-center h-full text-gray-500 text-sm">
+                无法加载视频
+              </div>
+            )}
+            {/* Play/Pause overlay */}
+            {materialUrl && (
+              <button
+                onClick={togglePlay}
+                className="absolute inset-0 flex items-center justify-center bg-black/0 hover:bg-black/30 transition-colors"
+              >
+                <div className={`p-3 rounded-full bg-black/60 text-white transition-opacity ${isPlaying ? "opacity-0 group-hover:opacity-100" : "opacity-100"}`}>
+                  {isPlaying ? <Pause className="h-6 w-6" /> : <Play className="h-6 w-6" />}
+                </div>
+              </button>
+            )}
+            <div className="absolute bottom-2 right-2 bg-black/70 text-white text-[10px] px-2 py-0.5 rounded pointer-events-none">
+              {formatSec(currentTime)}
+            </div>
+          </div>
+        </div>
+
+        {/* Dual-handle range slider */}
+        <div className="px-5 py-4 space-y-3">
+          <div className="text-xs text-gray-400 flex justify-between">
+            <span>源视频时长: {duration > 0 ? formatSec(duration) : "加载中..."}</span>
+          </div>
+
+          {/* Custom range track */}
+          <div
+            ref={trackRef}
+            className="relative h-8 cursor-pointer select-none touch-none"
+            onPointerMove={handleTrackPointerMove}
+            onPointerUp={handleTrackPointerUp}
+            onPointerLeave={handleTrackPointerUp}
+          >
+            {/* Background track */}
+            <div className="absolute top-1/2 -translate-y-1/2 left-0 right-0 h-2 bg-white/10 rounded-full" />
+
+            {/* Selected range */}
+            <div
+              className="absolute top-1/2 -translate-y-1/2 h-2 rounded-full"
+              style={{
+                left: `${startPct}%`,
+                width: `${endPct - startPct}%`,
+                backgroundColor: segment.color + "88",
+              }}
+            />
+
+            {/* Playhead indicator */}
+            {duration > 0 && (
+              <div
+                className="absolute top-1/2 -translate-y-1/2 w-0.5 h-4 bg-white/60 rounded-full pointer-events-none"
+                style={{ left: `${playheadPct}%` }}
+              />
+            )}
+
+            {/* Start thumb */}
+            <div
+              onPointerDown={(e) => handleThumbPointerDown("start", e)}
+              className="absolute top-1/2 -translate-y-1/2 -translate-x-1/2 w-4 h-4 rounded-full bg-purple-500 border-2 border-white shadow-lg cursor-grab active:cursor-grabbing hover:scale-110 transition-transform z-10"
+              style={{ left: `${startPct}%` }}
+              title={`起点: ${formatSec(sourceStart)}`}
+            />
+
+            {/* End thumb */}
+            <div
+              onPointerDown={(e) => handleThumbPointerDown("end", e)}
+              className="absolute top-1/2 -translate-y-1/2 -translate-x-1/2 w-4 h-4 rounded-full bg-pink-500 border-2 border-white shadow-lg cursor-grab active:cursor-grabbing hover:scale-110 transition-transform z-10"
+              style={{ left: `${endPct}%` }}
+              title={`终点: ${formatSec(effectiveEnd)}`}
+            />
+          </div>
+
+          {/* Time labels */}
+          <div className="flex justify-between text-xs text-gray-400">
+            <span className="text-purple-400">{formatSec(sourceStart)}</span>
+            <span className="text-pink-400">{formatSec(effectiveEnd)}</span>
+          </div>
+
+          {/* Info */}
+          <div className="text-[11px] text-gray-500 flex items-center gap-2 flex-wrap">
+            <span>截取: {clipDur.toFixed(1)}s</span>
+            <span className="text-gray-600">|</span>
+            <span>分配: {assignedDur.toFixed(1)}s</span>
+            {clipDur < assignedDur && <span className="text-amber-500">(将循环补足)</span>}
+            {clipDur > assignedDur && <span className="text-cyan-500">(将截断)</span>}
+          </div>
+        </div>
+
+        {/* Actions */}
+        <div className="flex justify-end gap-2 px-5 pb-4">
+          <button
+            onClick={onClose}
+            className="px-4 py-1.5 text-xs bg-white/10 hover:bg-white/20 rounded-lg text-gray-300 transition-colors"
+          >
+            取消
+          </button>
+          <button
+            onClick={handleConfirm}
+            className="px-4 py-1.5 text-xs bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white rounded-lg transition-colors"
+          >
+            确定
+          </button>
+        </div>
+      </div>
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/FloatingStylePreview.tsx
+++ b/frontend/src/features/home/ui/FloatingStylePreview.tsx
@@ -0,0 +1,226 @@
+import { useEffect } from "react";
+import { createPortal } from "react-dom";
+import { X } from "lucide-react";
+
+interface SubtitleStyleOption {
+  id: string;
+  label: string;
+  font_family?: string;
+  font_file?: string;
+  font_size?: number;
+  highlight_color?: string;
+  normal_color?: string;
+  stroke_color?: string;
+  stroke_size?: number;
+  letter_spacing?: number;
+  bottom_margin?: number;
+  is_default?: boolean;
+}
+
+interface TitleStyleOption {
+  id: string;
+  label: string;
+  font_family?: string;
+  font_file?: string;
+  font_size?: number;
+  color?: string;
+  stroke_color?: string;
+  stroke_size?: number;
+  letter_spacing?: number;
+  font_weight?: number;
+  top_margin?: number;
+  is_default?: boolean;
+}
+
+interface FloatingStylePreviewProps {
+  onClose: () => void;
+  videoTitle: string;
+  titleStyles: TitleStyleOption[];
+  selectedTitleStyleId: string;
+  titleFontSize: number;
+  subtitleStyles: SubtitleStyleOption[];
+  selectedSubtitleStyleId: string;
+  subtitleFontSize: number;
+  titleTopMargin: number;
+  subtitleBottomMargin: number;
+  enableSubtitles: boolean;
+  resolveAssetUrl: (path?: string | null) => string | null;
+  getFontFormat: (fontFile?: string) => string;
+  buildTextShadow: (color: string, size: number) => string;
+  previewBaseWidth: number;
+  previewBaseHeight: number;
+}
+
+const DESKTOP_WIDTH = 280;
+
+export function FloatingStylePreview({
+  onClose,
+  videoTitle,
+  titleStyles,
+  selectedTitleStyleId,
+  titleFontSize,
+  subtitleStyles,
+  selectedSubtitleStyleId,
+  subtitleFontSize,
+  titleTopMargin,
+  subtitleBottomMargin,
+  enableSubtitles,
+  resolveAssetUrl,
+  getFontFormat,
+  buildTextShadow,
+  previewBaseWidth,
+  previewBaseHeight,
+}: FloatingStylePreviewProps) {
+  const isMobile = typeof window !== "undefined" && window.innerWidth < 640;
+  const windowWidth = isMobile
+    ? Math.min(window.innerWidth - 32, 360)
+    : DESKTOP_WIDTH;
+
+  useEffect(() => {
+    const handleKeyDown = (e: KeyboardEvent) => {
+      if (e.key === "Escape") onClose();
+    };
+    window.addEventListener("keydown", handleKeyDown);
+    return () => window.removeEventListener("keydown", handleKeyDown);
+  }, [onClose]);
+
+  const previewScale = windowWidth / previewBaseWidth;
+  const previewHeight = previewBaseHeight * previewScale;
+
+  const activeSubtitleStyle = subtitleStyles.find((s) => s.id === selectedSubtitleStyleId)
+    || subtitleStyles.find((s) => s.is_default)
+    || subtitleStyles[0];
+
+  const activeTitleStyle = titleStyles.find((s) => s.id === selectedTitleStyleId)
+    || titleStyles.find((s) => s.is_default)
+    || titleStyles[0];
+
+  const previewTitleText = videoTitle.trim() || "这里是标题预览";
+  const subtitleHighlightText = "最近，一个叫Cloudbot";
+  const subtitleNormalText = "的开源项目在GitHub上彻底火了";
+
+  const subtitleHighlightColor = activeSubtitleStyle?.highlight_color || "#FFE600";
+  const subtitleNormalColor = activeSubtitleStyle?.normal_color || "#FFFFFF";
+  const subtitleStrokeColor = activeSubtitleStyle?.stroke_color || "#000000";
+  const subtitleStrokeSize = activeSubtitleStyle?.stroke_size ?? 3;
+  const subtitleLetterSpacing = activeSubtitleStyle?.letter_spacing ?? 2;
+  const subtitleFontFamilyName = `SubtitlePreview-${activeSubtitleStyle?.id || "default"}`;
+  const subtitleFontUrl = activeSubtitleStyle?.font_file
+    ? resolveAssetUrl(`fonts/${activeSubtitleStyle.font_file}`)
+    : null;
+
+  const titleColor = activeTitleStyle?.color || "#FFFFFF";
+  const titleStrokeColor = activeTitleStyle?.stroke_color || "#000000";
+  const titleStrokeSize = activeTitleStyle?.stroke_size ?? 8;
+  const titleLetterSpacing = activeTitleStyle?.letter_spacing ?? 4;
+  const titleFontWeight = activeTitleStyle?.font_weight ?? 900;
+  const titleFontFamilyName = `TitlePreview-${activeTitleStyle?.id || "default"}`;
+  const titleFontUrl = activeTitleStyle?.font_file
+    ? resolveAssetUrl(`fonts/${activeTitleStyle.font_file}`)
+    : null;
+
+  const content = (
+    <div
+      style={{
+        position: "fixed",
+        left: "16px",
+        top: "16px",
+        width: `${windowWidth}px`,
+        zIndex: 150,
+        maxHeight: "calc(100dvh - 32px)",
+        overflow: "hidden",
+      }}
+      className="rounded-xl border border-white/20 bg-gray-900/95 backdrop-blur-md shadow-2xl"
+    >
+      {/* 标题栏 */}
+      <div
+        className="flex items-center justify-between px-3 py-2 border-b border-white/10 select-none"
+      >
+        <div className="flex items-center gap-2 text-sm text-gray-300">
+          <span>样式预览</span>
+        </div>
+        <button
+          onClick={onClose}
+          className="p-1 rounded hover:bg-white/10 text-gray-400 hover:text-white transition-colors"
+        >
+          <X className="h-4 w-4" />
+        </button>
+      </div>
+
+      {/* 预览内容 */}
+      <div
+        className="relative overflow-hidden rounded-b-xl"
+        style={{ height: `${previewHeight}px` }}
+      >
+        {(titleFontUrl || subtitleFontUrl) && (
+          <style>{`
+            ${titleFontUrl ? `@font-face { font-family: '${titleFontFamilyName}'; src: url('${titleFontUrl}') format('${getFontFormat(activeTitleStyle?.font_file)}'); font-weight: 400; font-style: normal; }` : ''}
+            ${subtitleFontUrl ? `@font-face { font-family: '${subtitleFontFamilyName}'; src: url('${subtitleFontUrl}') format('${getFontFormat(activeSubtitleStyle?.font_file)}'); font-weight: 400; font-style: normal; }` : ''}
+          `}</style>
+        )}
+        <div className="absolute inset-0 opacity-20 bg-gradient-to-br from-purple-500/40 via-transparent to-pink-500/30" />
+        <div
+          className="absolute top-0 left-0"
+          style={{
+            width: `${previewBaseWidth}px`,
+            height: `${previewBaseHeight}px`,
+            transform: `scale(${previewScale})`,
+            transformOrigin: 'top left',
+          }}
+        >
+          <div
+            className="w-full text-center"
+            style={{
+              position: 'absolute',
+              top: `${titleTopMargin}px`,
+              left: 0,
+              right: 0,
+              color: titleColor,
+              fontSize: `${titleFontSize}px`,
+              fontWeight: titleFontWeight,
+              fontFamily: titleFontUrl
+                ? `'${titleFontFamilyName}', "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans SC", sans-serif`
+                : '"PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans SC", sans-serif',
+              textShadow: buildTextShadow(titleStrokeColor, titleStrokeSize),
+              letterSpacing: `${titleLetterSpacing}px`,
+              lineHeight: 1.2,
+              opacity: videoTitle.trim() ? 1 : 0.7,
+              padding: '0 5%',
+            }}
+          >
+            {previewTitleText}
+          </div>
+
+          <div
+            className="w-full text-center"
+            style={{
+              position: 'absolute',
+              bottom: `${subtitleBottomMargin}px`,
+              left: 0,
+              right: 0,
+              fontSize: `${subtitleFontSize}px`,
+              fontFamily: subtitleFontUrl
+                ? `'${subtitleFontFamilyName}', "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans SC", sans-serif`
+                : '"PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans SC", sans-serif',
+              textShadow: buildTextShadow(subtitleStrokeColor, subtitleStrokeSize),
+              letterSpacing: `${subtitleLetterSpacing}px`,
+              lineHeight: 1.35,
+              padding: '0 6%',
+            }}
+          >
+            {enableSubtitles ? (
+              <>
+                <span style={{ color: subtitleHighlightColor }}>{subtitleHighlightText}</span>
+                <span style={{ color: subtitleNormalColor }}>{subtitleNormalText}</span>
+              </>
+            ) : (
+              <span className="text-gray-400 text-sm">字幕已关闭</span>
+            )}
+          </div>
+        </div>
+      </div>
+    </div>
+  );
+
+  return createPortal(content, document.body);
+}
--- a/frontend/src/features/home/ui/GenerateActionBar.tsx
+++ b/frontend/src/features/home/ui/GenerateActionBar.tsx
@@ -4,6 +4,7 @@ interface GenerateActionBarProps {
  isGenerating: boolean;
  progress: number;
  disabled: boolean;
+  materialCount?: number;
  onGenerate: () => void;
 }

@@ -11,43 +12,51 @@ export function GenerateActionBar({
  isGenerating,
  progress,
  disabled,
+  materialCount = 1,
  onGenerate,
 }: GenerateActionBarProps) {
  return (
-    <button
-      onClick={onGenerate}
-      disabled={disabled}
-      className={`w-full py-4 rounded-xl font-bold text-lg transition-all ${disabled
-        ? "bg-gray-600 cursor-not-allowed text-gray-400"
-        : "bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white shadow-lg hover:shadow-purple-500/25"
-        }`}
-    >
-      {isGenerating ? (
-        <span className="flex items-center justify-center gap-3">
-          <svg className="animate-spin h-5 w-5" viewBox="0 0 24 24">
-            <circle
-              className="opacity-25"
-              cx="12"
-              cy="12"
-              r="10"
-              stroke="currentColor"
-              strokeWidth="4"
-              fill="none"
-            />
-            <path
-              className="opacity-75"
-              fill="currentColor"
-              d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4z"
-            />
-          </svg>
-          生成中... {progress}%
-        </span>
-      ) : (
-        <span className="flex items-center justify-center gap-2">
-          <Rocket className="h-5 w-5" />
-          生成视频
-        </span>
+    <div>
+      <button
+        onClick={onGenerate}
+        disabled={disabled}
+        className={`w-full py-4 rounded-xl font-bold text-lg transition-all ${disabled
+          ? "bg-gray-600 cursor-not-allowed text-gray-400"
+          : "bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white shadow-lg hover:shadow-purple-500/25"
+          }`}
+      >
+        {isGenerating ? (
+          <span className="flex items-center justify-center gap-3">
+            <svg className="animate-spin h-5 w-5" viewBox="0 0 24 24">
+              <circle
+                className="opacity-25"
+                cx="12"
+                cy="12"
+                r="10"
+                stroke="currentColor"
+                strokeWidth="4"
+                fill="none"
+              />
+              <path
+                className="opacity-75"
+                fill="currentColor"
+                d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4z"
+              />
+            </svg>
+            生成中... {progress}%
+          </span>
+        ) : (
+          <span className="flex items-center justify-center gap-2">
+            <Rocket className="h-5 w-5" />
+            生成视频
+          </span>
+        )}
+      </button>
+      {!isGenerating && materialCount >= 2 && (
+        <p className="text-xs text-gray-400 text-center mt-1.5">
+          多素材模式 ({materialCount} 个机位)，生成耗时较长
+        </p>
      )}
-    </button>
+    </div>
  );
 }
--- a/frontend/src/features/home/ui/GeneratedAudiosPanel.tsx
+++ b/frontend/src/features/home/ui/GeneratedAudiosPanel.tsx
@@ -0,0 +1,224 @@
+import { useState, useRef, useCallback, useEffect } from "react";
+import { Play, Pause, Pencil, Trash2, Check, X, RefreshCw, Mic } from "lucide-react";
+import type { GeneratedAudio } from "@/features/home/model/useGeneratedAudios";
+
+interface AudioTask {
+  status: string;
+  progress?: number;
+  message?: string;
+}
+
+interface GeneratedAudiosPanelProps {
+  generatedAudios: GeneratedAudio[];
+  selectedAudioId: string | null;
+  isGeneratingAudio: boolean;
+  audioTask: AudioTask | null;
+  onGenerateAudio: () => void;
+  onRefresh: () => void;
+  onSelectAudio: (audio: GeneratedAudio) => void;
+  onDeleteAudio: (id: string) => void;
+  onRenameAudio: (id: string, newName: string) => void;
+  hasText: boolean;
+}
+
+export function GeneratedAudiosPanel({
+  generatedAudios,
+  selectedAudioId,
+  isGeneratingAudio,
+  audioTask,
+  onGenerateAudio,
+  onRefresh,
+  onSelectAudio,
+  onDeleteAudio,
+  onRenameAudio,
+  hasText,
+}: GeneratedAudiosPanelProps) {
+  const [editingId, setEditingId] = useState<string | null>(null);
+  const [editName, setEditName] = useState("");
+  const [playingId, setPlayingId] = useState<string | null>(null);
+  const audioRef = useRef<HTMLAudioElement | null>(null);
+
+  const stopPlaying = useCallback(() => {
+    if (audioRef.current) {
+      audioRef.current.pause();
+      audioRef.current.currentTime = 0;
+      audioRef.current = null;
+    }
+    setPlayingId(null);
+  }, []);
+
+  // Cleanup on unmount
+  useEffect(() => {
+    return () => {
+      if (audioRef.current) {
+        audioRef.current.pause();
+        audioRef.current = null;
+      }
+    };
+  }, []);
+
+  const togglePlay = (audio: GeneratedAudio, e: React.MouseEvent) => {
+    e.stopPropagation();
+    if (playingId === audio.id) {
+      stopPlaying();
+      return;
+    }
+    stopPlaying();
+    const player = new Audio(audio.path);
+    player.onended = () => setPlayingId(null);
+    player.play().catch(() => {});
+    audioRef.current = player;
+    setPlayingId(audio.id);
+  };
+
+  const startEditing = (audio: GeneratedAudio, e: React.MouseEvent) => {
+    e.stopPropagation();
+    setEditingId(audio.id);
+    setEditName(audio.name);
+  };
+
+  const saveEditing = (audioId: string, e: React.MouseEvent) => {
+    e.stopPropagation();
+    if (!editName.trim()) return;
+    onRenameAudio(audioId, editName.trim());
+    setEditingId(null);
+    setEditName("");
+  };
+
+  const cancelEditing = (e: React.MouseEvent) => {
+    e.stopPropagation();
+    setEditingId(null);
+    setEditName("");
+  };
+
+  return (
+    <div className="bg-white/5 rounded-2xl p-4 sm:p-6 border border-white/10 backdrop-blur-sm">
+      <div className="flex justify-between items-center gap-2 mb-4">
+        <h2 className="text-base sm:text-lg font-semibold text-white flex items-center gap-2 whitespace-nowrap">
+          <Mic className="h-4 w-4 text-purple-400" />
+          配音列表
+        </h2>
+        <div className="flex gap-1.5">
+          <button
+            onClick={onGenerateAudio}
+            disabled={isGeneratingAudio || !hasText}
+            className={`px-2 py-1 text-xs rounded transition-all whitespace-nowrap flex items-center gap-1 ${
+              isGeneratingAudio || !hasText
+                ? "bg-gray-600 cursor-not-allowed text-gray-400"
+                : "bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white"
+            }`}
+          >
+            <Mic className="h-3.5 w-3.5" />
+            生成配音
+          </button>
+          <button
+            onClick={onRefresh}
+            className="px-2 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300 whitespace-nowrap flex items-center gap-1"
+          >
+            <RefreshCw className="h-3.5 w-3.5" />
+          </button>
+        </div>
+      </div>
+
+      {/* 生成进度 */}
+      {isGeneratingAudio && audioTask && (
+        <div className="mb-4 p-3 bg-purple-500/10 rounded-xl border border-purple-500/30">
+          <div className="flex justify-between text-sm text-purple-300 mb-2">
+            <span>{audioTask.message || "生成中..."}</span>
+            <span>{audioTask.progress || 0}%</span>
+          </div>
+          <div className="h-2 bg-black/30 rounded-full overflow-hidden">
+            <div
+              className="h-full bg-gradient-to-r from-purple-500 to-pink-500 transition-all duration-300"
+              style={{ width: `${audioTask.progress || 0}%` }}
+            />
+          </div>
+        </div>
+      )}
+
+      {/* 配音列表 */}
+      {generatedAudios.length === 0 ? (
+        <div className="text-center py-6 text-gray-400">
+          <p className="text-sm">暂无配音</p>
+          <p className="text-xs mt-1 text-gray-500">点击「生成配音」创建</p>
+        </div>
+      ) : (
+        <div className="space-y-2 max-h-48 sm:max-h-56 overflow-y-auto hide-scrollbar">
+          {generatedAudios.map((audio) => {
+            const isSelected = selectedAudioId === audio.id;
+            return (
+              <div
+                key={audio.id}
+                onClick={() => onSelectAudio(audio)}
+                className={`p-3 rounded-lg border transition-all cursor-pointer flex items-center justify-between group ${
+                  isSelected
+                    ? "border-purple-500 bg-purple-500/20"
+                    : "border-white/10 bg-white/5 hover:border-white/30"
+                }`}
+              >
+                {editingId === audio.id ? (
+                  <div className="flex-1 flex items-center gap-2" onClick={(e) => e.stopPropagation()}>
+                    <input
+                      value={editName}
+                      onChange={(e) => setEditName(e.target.value)}
+                      className="flex-1 bg-black/40 border border-white/20 rounded-md px-2 py-1 text-xs text-white"
+                      autoFocus
+                      onKeyDown={(e) => {
+                        if (e.key === "Enter") saveEditing(audio.id, e as unknown as React.MouseEvent);
+                        if (e.key === "Escape") cancelEditing(e as unknown as React.MouseEvent);
+                      }}
+                    />
+                    <button onClick={(e) => saveEditing(audio.id, e)} className="p-1 text-green-400 hover:text-green-300" title="保存">
+                      <Check className="h-4 w-4" />
+                    </button>
+                    <button onClick={cancelEditing} className="p-1 text-gray-400 hover:text-white" title="取消">
+                      <X className="h-4 w-4" />
+                    </button>
+                  </div>
+                ) : (
+                  <>
+                    <div className="min-w-0 flex-1">
+                      <div className="text-white text-sm truncate">{audio.name}</div>
+                      <div className="text-gray-400 text-xs">{audio.duration_sec.toFixed(1)}s</div>
+                    </div>
+                    <div className="flex items-center gap-1 pl-2 opacity-0 group-hover:opacity-100 transition-opacity">
+                      <button
+                        onClick={(e) => togglePlay(audio, e)}
+                        className="p-1 text-gray-500 hover:text-purple-400 transition-colors"
+                        title={playingId === audio.id ? "暂停" : "播放"}
+                      >
+                        {playingId === audio.id ? (
+                          <Pause className="h-3.5 w-3.5" />
+                        ) : (
+                          <Play className="h-3.5 w-3.5" />
+                        )}
+                      </button>
+                      <button
+                        onClick={(e) => startEditing(audio, e)}
+                        className="p-1 text-gray-500 hover:text-white transition-colors"
+                        title="重命名"
+                      >
+                        <Pencil className="h-3.5 w-3.5" />
+                      </button>
+                      <button
+                        onClick={(e) => {
+                          e.stopPropagation();
+                          onDeleteAudio(audio.id);
+                        }}
+                        className="p-1 text-gray-500 hover:text-red-400 transition-colors"
+                        title="删除"
+                      >
+                        <Trash2 className="h-3.5 w-3.5" />
+                      </button>
+                    </div>
+                  </>
+                )}
+              </div>
+            );
+          })}
+        </div>
+      )}
+
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/HomePage.tsx
+++ b/frontend/src/features/home/ui/HomePage.tsx
@@ -1,20 +1,24 @@
 "use client";

-import { useEffect } from "react";
+import { useEffect, useMemo } from "react";
 import { useRouter } from "next/navigation";
 import VideoPreviewModal from "@/components/VideoPreviewModal";
-import ScriptExtractionModal from "@/components/ScriptExtractionModal";
+import ScriptExtractionModal from "./ScriptExtractionModal";
 import { useHomeController } from "@/features/home/model/useHomeController";
+import { resolveMediaUrl } from "@/shared/lib/media";
 import { BgmPanel } from "@/features/home/ui/BgmPanel";
 import { GenerateActionBar } from "@/features/home/ui/GenerateActionBar";
 import { HistoryList } from "@/features/home/ui/HistoryList";
 import { HomeHeader } from "@/features/home/ui/HomeHeader";
 import { MaterialSelector } from "@/features/home/ui/MaterialSelector";
+import { TimelineEditor } from "@/features/home/ui/TimelineEditor";
+import { ClipTrimmer } from "@/features/home/ui/ClipTrimmer";
 import { PreviewPanel } from "@/features/home/ui/PreviewPanel";
 import { RefAudioPanel } from "@/features/home/ui/RefAudioPanel";
 import { ScriptEditor } from "@/features/home/ui/ScriptEditor";
 import { TitleSubtitlePanel } from "@/features/home/ui/TitleSubtitlePanel";
 import { VoiceSelector } from "@/features/home/ui/VoiceSelector";
+import { GeneratedAudiosPanel } from "@/features/home/ui/GeneratedAudiosPanel";

 export function HomePage() {
  const router = useRouter();
@@ -34,8 +38,8 @@ export function HomePage() {
    fetchMaterials,
    deleteMaterial,
    handleUpload,
-    selectedMaterial,
-    setSelectedMaterial,
+    selectedMaterials,
+    toggleMaterial,
    handlePreviewMaterial,
    editingMaterialId,
    editMaterialName,
@@ -49,6 +53,13 @@ export function HomePage() {
    setExtractModalOpen,
    handleGenerateMeta,
    isGeneratingMeta,
+    handleTranslate,
+    isTranslating,
+    originalText,
+    handleRestoreOriginal,
+    savedScripts,
+    handleSaveScript,
+    deleteSavedScript,
    showStylePreview,
    setShowStylePreview,
    videoTitle,
@@ -65,14 +76,14 @@ export function HomePage() {
    subtitleFontSize,
    setSubtitleFontSize,
    setSubtitleSizeLocked,
-    enableSubtitles,
-    setEnableSubtitles,
+    titleTopMargin,
+    setTitleTopMargin,
+    subtitleBottomMargin,
+    setSubtitleBottomMargin,
    resolveAssetUrl,
    getFontFormat,
    buildTextShadow,
-    previewContainerWidth,
    materialDimensions,
-    titlePreviewContainerRef,
    ttsMode,
    setTtsMode,
    voices,
@@ -128,12 +139,40 @@ export function HomePage() {
    fetchGeneratedVideos,
    registerVideoRef,
    formatDate,
+    generatedAudios,
+    selectedAudio,
+    selectedAudioId,
+    isGeneratingAudio,
+    audioTask,
+    fetchGeneratedAudios,
+    handleGenerateAudio,
+    deleteAudio,
+    renameAudio,
+    selectAudio,
+    timelineSegments,
+    reorderSegments,
+    setSourceRange,
+    clipTrimmerOpen,
+    setClipTrimmerOpen,
+    clipTrimmerSegmentId,
+    setClipTrimmerSegmentId,
  } = useHomeController();

  useEffect(() => {
    router.prefetch("/publish");
  }, [router]);

+  const clipTrimmerSegment = useMemo(
+    () => timelineSegments.find((s) => s.id === clipTrimmerSegmentId) ?? null,
+    [timelineSegments, clipTrimmerSegmentId]
+  );
+
+  const clipTrimmerMaterialUrl = useMemo(() => {
+    if (!clipTrimmerSegment) return null;
+    const mat = materials.find((m) => m.id === clipTrimmerSegment.materialId);
+    return mat?.path ? resolveMediaUrl(mat.path) : null;
+  }, [clipTrimmerSegment, materials]);
+
  return (
    <div className="min-h-dvh">
      <HomeHeader />
@@ -142,42 +181,24 @@ export function HomePage() {
        <div className="grid grid-cols-1 lg:grid-cols-2 gap-8">
          {/* 左侧: 输入区域 */}
          <div className="space-y-6">
-            {/* 素材选择 */}
-            <MaterialSelector
-              materials={materials}
-              selectedMaterial={selectedMaterial}
-              isFetching={isFetching}
-              lastMaterialCount={lastMaterialCount}
-              editingMaterialId={editingMaterialId}
-              editMaterialName={editMaterialName}
-              isUploading={isUploading}
-              uploadProgress={uploadProgress}
-              uploadError={uploadError}
-              fetchError={fetchError}
-              apiBase={apiBase}
-              onUploadChange={handleUpload}
-              onRefresh={fetchMaterials}
-              onSelectMaterial={setSelectedMaterial}
-              onPreviewMaterial={handlePreviewMaterial}
-              onStartEditing={startMaterialEditing}
-              onEditNameChange={setEditMaterialName}
-              onSaveEditing={saveMaterialEditing}
-              onCancelEditing={cancelMaterialEditing}
-              onDeleteMaterial={deleteMaterial}
-              onClearUploadError={() => setUploadError(null)}
-              registerMaterialRef={registerMaterialRef}
-            />
-
-            {/* 文案输入 */}
+            {/* 1. 文案输入 */}
            <ScriptEditor
              text={text}
              onChangeText={setText}
              onOpenExtractModal={() => setExtractModalOpen(true)}
              onGenerateMeta={handleGenerateMeta}
              isGeneratingMeta={isGeneratingMeta}
+              onTranslate={handleTranslate}
+              isTranslating={isTranslating}
+              hasOriginalText={originalText !== null}
+              onRestoreOriginal={handleRestoreOriginal}
+              savedScripts={savedScripts}
+              onSaveScript={handleSaveScript}
+              onLoadScript={setText}
+              onDeleteScript={deleteSavedScript}
            />

-            {/* 标题和字幕设置 */}
+            {/* 2. 标题和字幕设置 */}
            <TitleSubtitlePanel
              showStylePreview={showStylePreview}
              onTogglePreview={() => setShowStylePreview((prev) => !prev)}
@@ -201,23 +222,18 @@ export function HomePage() {
                setSubtitleFontSize(value);
                setSubtitleSizeLocked(true);
              }}
-              enableSubtitles={enableSubtitles}
-              onToggleSubtitles={setEnableSubtitles}
+              titleTopMargin={titleTopMargin}
+              onTitleTopMarginChange={setTitleTopMargin}
+              subtitleBottomMargin={subtitleBottomMargin}
+              onSubtitleBottomMarginChange={setSubtitleBottomMargin}
              resolveAssetUrl={resolveAssetUrl}
              getFontFormat={getFontFormat}
              buildTextShadow={buildTextShadow}
-              previewScale={previewContainerWidth && (materialDimensions?.width || 1280)
-                ? previewContainerWidth / (materialDimensions?.width || 1280)
-                : 1}
-              previewAspectRatio={materialDimensions
-                ? `${materialDimensions.width} / ${materialDimensions.height}`
-                : "16 / 9"}
-              previewBaseWidth={materialDimensions?.width || 1280}
-              previewBaseHeight={materialDimensions?.height || 720}
-              previewContainerRef={titlePreviewContainerRef}
+              previewBaseWidth={materialDimensions?.width || 1080}
+              previewBaseHeight={materialDimensions?.height || 1920}
            />

-            {/* 配音方式选择 */}
+            {/* 3. 配音方式选择 */}
            <VoiceSelector
              ttsMode={ttsMode}
              onSelectTtsMode={setTtsMode}
@@ -255,7 +271,69 @@ export function HomePage() {
              )}
            />

-            {/* 背景音乐 */}
+            {/* 4. 配音列表 */}
+            <GeneratedAudiosPanel
+              generatedAudios={generatedAudios}
+              selectedAudioId={selectedAudioId}
+              isGeneratingAudio={isGeneratingAudio}
+              audioTask={audioTask}
+              onGenerateAudio={handleGenerateAudio}
+              onRefresh={() => fetchGeneratedAudios()}
+              onSelectAudio={selectAudio}
+              onDeleteAudio={deleteAudio}
+              onRenameAudio={renameAudio}
+              hasText={!!text.trim()}
+            />
+
+            {/* 5. 视频素材 */}
+            <MaterialSelector
+                materials={materials}
+                selectedMaterials={selectedMaterials}
+                isFetching={isFetching}
+                lastMaterialCount={lastMaterialCount}
+                editingMaterialId={editingMaterialId}
+                editMaterialName={editMaterialName}
+                isUploading={isUploading}
+                uploadProgress={uploadProgress}
+                uploadError={uploadError}
+                fetchError={fetchError}
+                apiBase={apiBase}
+                onUploadChange={handleUpload}
+                onRefresh={fetchMaterials}
+                onToggleMaterial={toggleMaterial}
+                onPreviewMaterial={handlePreviewMaterial}
+                onStartEditing={startMaterialEditing}
+                onEditNameChange={setEditMaterialName}
+                onSaveEditing={saveMaterialEditing}
+                onCancelEditing={cancelMaterialEditing}
+                onDeleteMaterial={deleteMaterial}
+                onClearUploadError={() => setUploadError(null)}
+                registerMaterialRef={registerMaterialRef}
+              />
+
+            {/* 5.5 时间轴编辑器 — 未选配音/素材时模糊遮挡 */}
+            <div className="relative">
+              {(!selectedAudio || selectedMaterials.length === 0) && (
+                <div className="absolute inset-0 bg-black/50 backdrop-blur-sm rounded-2xl flex items-center justify-center z-10">
+                  <p className="text-gray-400">
+                    {!selectedAudio ? "请先生成并选中配音" : "请先选择素材"}
+                  </p>
+                </div>
+              )}
+              <TimelineEditor
+                audioDuration={selectedAudio?.duration_sec ?? 0}
+                audioUrl={selectedAudio ? (resolveMediaUrl(selectedAudio.path) || "") : ""}
+                segments={timelineSegments}
+                materials={materials}
+                onReorderSegment={reorderSegments}
+                onClickSegment={(seg) => {
+                  setClipTrimmerSegmentId(seg.id);
+                  setClipTrimmerOpen(true);
+                }}
+              />
+            </div>
+
+            {/* 6. 背景音乐 */}
            <BgmPanel
              bgmList={bgmList}
              bgmLoading={bgmLoading}
@@ -273,11 +351,12 @@ export function HomePage() {
              registerBgmItemRef={registerBgmItemRef}
            />

-            {/* 生成按钮 */}
+            {/* 7. 生成按钮 */}
            <GenerateActionBar
              isGenerating={isGenerating}
              progress={currentTask?.progress || 0}
-              disabled={isGenerating || !selectedMaterial || (ttsMode === "voiceclone" && !selectedRefAudio)}
+              materialCount={selectedMaterials.length}
+              disabled={isGenerating || selectedMaterials.length === 0 || !selectedAudio}
              onGenerate={handleGenerate}
            />
          </div>
@@ -313,6 +392,19 @@ export function HomePage() {
        onClose={() => setExtractModalOpen(false)}
        onApply={(nextText) => setText(nextText)}
      />
+
+      <ClipTrimmer
+        isOpen={clipTrimmerOpen}
+        segment={clipTrimmerSegment}
+        materialUrl={clipTrimmerMaterialUrl}
+        onConfirm={(sourceStart, sourceEnd) => {
+          if (clipTrimmerSegmentId) {
+            setSourceRange(clipTrimmerSegmentId, sourceStart, sourceEnd);
+          }
+          setClipTrimmerOpen(false);
+        }}
+        onClose={() => setClipTrimmerOpen(false)}
+      />
    </div>
  );
 }
--- a/frontend/src/features/home/ui/MaterialSelector.tsx
+++ b/frontend/src/features/home/ui/MaterialSelector.tsx
@@ -1,17 +1,10 @@
-import type { ChangeEvent, MouseEvent } from "react";
+import { type ChangeEvent, type MouseEvent } from "react";
 import { Upload, RefreshCw, Eye, Trash2, X, Pencil, Check } from "lucide-react";
-
-interface Material {
-  id: string;
-  name: string;
-  scene: string;
-  size_mb: number;
-  path: string;
-}
+import type { Material } from "@/shared/types/material";

 interface MaterialSelectorProps {
  materials: Material[];
-  selectedMaterial: string;
+  selectedMaterials: string[];
  isFetching: boolean;
  lastMaterialCount: number;
  editingMaterialId: string | null;
@@ -23,7 +16,7 @@ interface MaterialSelectorProps {
  apiBase: string;
  onUploadChange: (event: ChangeEvent<HTMLInputElement>) => void;
  onRefresh: () => void;
-  onSelectMaterial: (id: string) => void;
+  onToggleMaterial: (id: string) => void;
  onPreviewMaterial: (path: string) => void;
  onStartEditing: (material: Material, event: MouseEvent) => void;
  onEditNameChange: (value: string) => void;
@@ -36,7 +29,7 @@ interface MaterialSelectorProps {

 export function MaterialSelector({
  materials,
-  selectedMaterial,
+  selectedMaterials,
  isFetching,
  lastMaterialCount,
  editingMaterialId,
@@ -48,7 +41,7 @@ export function MaterialSelector({
  apiBase,
  onUploadChange,
  onRefresh,
-  onSelectMaterial,
+  onToggleMaterial,
  onPreviewMaterial,
  onStartEditing,
  onEditNameChange,
@@ -58,20 +51,23 @@ export function MaterialSelector({
  onClearUploadError,
  registerMaterialRef,
 }: MaterialSelectorProps) {
+  const selectedSet = new Set(selectedMaterials);
+  const isFull = selectedMaterials.length >= 4;
+
  return (
    <div className="bg-white/5 rounded-2xl p-4 sm:p-6 border border-white/10 backdrop-blur-sm">
      <div className="flex justify-between items-center gap-2 mb-4">
        <h2 className="text-base sm:text-lg font-semibold text-white flex items-center gap-2 whitespace-nowrap">
          📹 视频素材
          <span className="ml-1 text-[11px] sm:text-xs text-gray-400/90 font-normal">
-            (上传自拍视频)
+            (可多选，最多4个)
          </span>
        </h2>
        <div className="flex gap-1.5">
          <input
            type="file"
            id="video-upload"
-            accept=".mp4,.mov,.avi"
+            accept="video/*"
            onChange={onUploadChange}
            className="hidden"
          />
@@ -126,7 +122,7 @@ export function MaterialSelector({
          API: {apiBase}/api/materials/
        </div>
      ) : isFetching && materials.length === 0 ? (
-        <div className="space-y-2 max-h-64 overflow-y-auto hide-scrollbar" style={{ contentVisibility: 'auto' }}>
+        <div className="space-y-2 max-h-48 sm:max-h-64 overflow-y-auto hide-scrollbar" style={{ contentVisibility: 'auto' }}>
          {Array.from({ length: Math.min(4, Math.max(1, lastMaterialCount || 1)) }).map((_, index) => (
            <div
              key={`material-skeleton-${index}`}
@@ -147,82 +143,98 @@ export function MaterialSelector({
        </div>
      ) : (
        <div
-          className="space-y-2 max-h-64 overflow-y-auto hide-scrollbar"
+          className="space-y-2 max-h-48 sm:max-h-64 overflow-y-auto hide-scrollbar"
          style={{ contentVisibility: 'auto' }}
        >
-          {materials.map((m) => (
-            <div
-              key={m.id}
-              ref={(el) => registerMaterialRef(m.id, el)}
-              className={`p-3 rounded-lg border transition-all flex items-center justify-between group ${selectedMaterial === m.id
-                ? "border-purple-500 bg-purple-500/20"
-                : "border-white/10 bg-white/5 hover:border-white/30"
-                }`}
-            >
-              {editingMaterialId === m.id ? (
-                <div className="flex-1 flex items-center gap-2" onClick={(e) => e.stopPropagation()}>
-                  <input
-                    value={editMaterialName}
-                    onChange={(e) => onEditNameChange(e.target.value)}
-                    className="flex-1 bg-black/40 border border-white/20 rounded-md px-2 py-1 text-xs text-white"
-                    autoFocus
-                  />
-                  <button
-                    onClick={(e) => onSaveEditing(m.id, e)}
-                    className="p-1 text-green-400 hover:text-green-300"
-                    title="保存"
-                  >
-                    <Check className="h-4 w-4" />
-                  </button>
-                  <button
-                    onClick={onCancelEditing}
-                    className="p-1 text-gray-400 hover:text-white"
-                    title="取消"
-                  >
-                    <X className="h-4 w-4" />
-                  </button>
-                </div>
-              ) : (
-                <button onClick={() => onSelectMaterial(m.id)} className="flex-1 text-left">
-                  <div className="text-white text-sm truncate">{m.scene || m.name}</div>
-                  <div className="text-gray-400 text-xs">{m.size_mb.toFixed(1)} MB</div>
-                </button>
-              )}
-              <div className="flex items-center gap-2 pl-2">
-                <button
-                  onClick={(e) => {
-                    e.stopPropagation();
-                    if (m.path) {
-                      onPreviewMaterial(m.path);
-                    }
-                  }}
-                  className="p-1 text-gray-500 hover:text-white opacity-0 group-hover:opacity-100 transition-opacity"
-                  title="预览视频"
-                >
-                  <Eye className="h-4 w-4" />
-                </button>
-                {editingMaterialId !== m.id && (
-                  <button
-                    onClick={(e) => onStartEditing(m, e)}
-                    className="p-1 text-gray-500 hover:text-white opacity-0 group-hover:opacity-100 transition-opacity"
-                    title="重命名"
-                  >
-                    <Pencil className="h-4 w-4" />
+          {materials.map((m) => {
+            const isSelected = selectedSet.has(m.id);
+            return (
+              <div
+                key={m.id}
+                ref={(el) => registerMaterialRef(m.id, el)}
+                className={`p-3 rounded-lg border transition-all flex items-center justify-between group ${isSelected
+                  ? "border-purple-500 bg-purple-500/20"
+                  : isFull
+                    ? "border-white/5 bg-white/[0.02] opacity-50 cursor-not-allowed"
+                    : "border-white/10 bg-white/5 hover:border-white/30"
+                  }`}
+              >
+                {editingMaterialId === m.id ? (
+                  <div className="flex-1 flex items-center gap-2" onClick={(e) => e.stopPropagation()}>
+                    <input
+                      value={editMaterialName}
+                      onChange={(e) => onEditNameChange(e.target.value)}
+                      className="flex-1 bg-black/40 border border-white/20 rounded-md px-2 py-1 text-xs text-white"
+                      autoFocus
+                    />
+                    <button
+                      onClick={(e) => onSaveEditing(m.id, e)}
+                      className="p-1 text-green-400 hover:text-green-300"
+                      title="保存"
+                    >
+                      <Check className="h-4 w-4" />
+                    </button>
+                    <button
+                      onClick={onCancelEditing}
+                      className="p-1 text-gray-400 hover:text-white"
+                      title="取消"
+                    >
+                      <X className="h-4 w-4" />
+                    </button>
+                  </div>
+                ) : (
+                  <button onClick={() => onToggleMaterial(m.id)} className="flex-1 text-left flex items-center gap-2">
+                    {/* 复选框 */}
+                    <span
+                      className={`flex-shrink-0 w-4 h-4 rounded border flex items-center justify-center text-[10px] ${isSelected
+                        ? "border-purple-500 bg-purple-500 text-white"
+                        : "border-white/30 text-transparent"
+                        }`}
+                    >
+                      {isSelected ? "✓" : ""}
+                    </span>
+                    <div className="min-w-0">
+                      <div className="text-white text-sm truncate">{m.scene || m.name}</div>
+                      <div className="text-gray-400 text-xs">{m.size_mb.toFixed(1)} MB</div>
+                    </div>
                  </button>
                )}
-                <button
-                  onClick={(e) => {
-                    e.stopPropagation();
-                    onDeleteMaterial(m.id);
-                  }}
-                  className="p-1 text-gray-500 hover:text-red-400 opacity-0 group-hover:opacity-100 transition-opacity"
-                  title="删除素材"
-                >
-                  <Trash2 className="h-4 w-4" />
-                </button>
+                <div className="flex items-center gap-2 pl-2">
+                  <button
+                    onClick={(e) => {
+                      e.stopPropagation();
+                      if (m.path) {
+                        onPreviewMaterial(m.path);
+                      }
+                    }}
+                    className="p-1 text-gray-500 hover:text-white opacity-0 group-hover:opacity-100 transition-opacity"
+                    title="预览视频"
+                  >
+                    <Eye className="h-4 w-4" />
+                  </button>
+                  {editingMaterialId !== m.id && (
+                    <button
+                      onClick={(e) => onStartEditing(m, e)}
+                      className="p-1 text-gray-500 hover:text-white opacity-0 group-hover:opacity-100 transition-opacity"
+                      title="重命名"
+                    >
+                      <Pencil className="h-4 w-4" />
+                    </button>
+                  )}
+                  <button
+                    onClick={(e) => {
+                      e.stopPropagation();
+                      onDeleteMaterial(m.id);
+                    }}
+                    className="p-1 text-gray-500 hover:text-red-400 opacity-0 group-hover:opacity-100 transition-opacity"
+                    title="删除素材"
+                  >
+                    <Trash2 className="h-4 w-4" />
+                  </button>
+                </div>
              </div>
-            </div>
-          ))}
+            );
+          })}
        </div>
      )}
    </div>
--- a/frontend/src/features/home/ui/RefAudioPanel.tsx
+++ b/frontend/src/features/home/ui/RefAudioPanel.tsx
@@ -70,6 +70,7 @@ export function RefAudioPanel({

  useEffect(() => {
    if (!recordedBlob) {
+      // eslint-disable-next-line react-hooks/set-state-in-effect
      setRecordedUrl(null);
      return;
    }
@@ -162,8 +163,8 @@ export function RefAudioPanel({
                      className="w-full bg-black/50 text-white text-xs px-1 py-0.5 rounded border border-purple-500 focus:outline-none"
                      autoFocus
                      onKeyDown={(e) => {
-                        if (e.key === 'Enter') onSaveEditing(audio.id, e as any);
-                        if (e.key === 'Escape') onCancelEditing(e as any);
+                        if (e.key === 'Enter') onSaveEditing(audio.id, e as unknown as MouseEvent);
+                        if (e.key === 'Escape') onCancelEditing(e as unknown as MouseEvent);
                      }}
                    />
                    <button onClick={(e) => onSaveEditing(audio.id, e)} className="text-green-400 hover:text-green-300 text-xs">
--- a/frontend/src/features/home/ui/ScriptEditor.tsx
+++ b/frontend/src/features/home/ui/ScriptEditor.tsx
@@ -1,4 +1,18 @@
-import { FileText, Loader2, Sparkles } from "lucide-react";
+import { useEffect, useRef, useState } from "react";
+import { FileText, History, Languages, Loader2, RotateCcw, Save, Sparkles, Trash2 } from "lucide-react";
+import type { SavedScript } from "@/features/home/model/useSavedScripts";
+
+const LANGUAGES = [
+  { code: "English", label: "英语 English" },
+  { code: "日本語", label: "日语 日本語" },
+  { code: "한국어", label: "韩语 한국어" },
+  { code: "Français", label: "法语 Français" },
+  { code: "Deutsch", label: "德语 Deutsch" },
+  { code: "Español", label: "西班牙语 Español" },
+  { code: "Русский", label: "俄语 Русский" },
+  { code: "Italiano", label: "意大利语 Italiano" },
+  { code: "Português", label: "葡萄牙语 Português" },
+];

 interface ScriptEditorProps {
  text: string;
@@ -6,6 +20,14 @@ interface ScriptEditorProps {
  onOpenExtractModal: () => void;
  onGenerateMeta: () => void;
  isGeneratingMeta: boolean;
+  onTranslate: (targetLang: string) => void;
+  isTranslating: boolean;
+  hasOriginalText: boolean;
+  onRestoreOriginal: () => void;
+  savedScripts: SavedScript[];
+  onSaveScript: () => void;
+  onLoadScript: (content: string) => void;
+  onDeleteScript: (id: string) => void;
 }

 export function ScriptEditor({
@@ -14,39 +36,176 @@ export function ScriptEditor({
  onOpenExtractModal,
  onGenerateMeta,
  isGeneratingMeta,
+  onTranslate,
+  isTranslating,
+  hasOriginalText,
+  onRestoreOriginal,
+  savedScripts,
+  onSaveScript,
+  onLoadScript,
+  onDeleteScript,
 }: ScriptEditorProps) {
+  const [showLangMenu, setShowLangMenu] = useState(false);
+  const langMenuRef = useRef<HTMLDivElement>(null);
+  const [showHistoryMenu, setShowHistoryMenu] = useState(false);
+  const historyMenuRef = useRef<HTMLDivElement>(null);
+
+  useEffect(() => {
+    if (!showLangMenu) return;
+    const handleClickOutside = (e: MouseEvent) => {
+      if (langMenuRef.current && !langMenuRef.current.contains(e.target as Node)) {
+        setShowLangMenu(false);
+      }
+    };
+    document.addEventListener("mousedown", handleClickOutside);
+    return () => document.removeEventListener("mousedown", handleClickOutside);
+  }, [showLangMenu]);
+
+  useEffect(() => {
+    if (!showHistoryMenu) return;
+    const handleClickOutside = (e: MouseEvent) => {
+      if (historyMenuRef.current && !historyMenuRef.current.contains(e.target as Node)) {
+        setShowHistoryMenu(false);
+      }
+    };
+    document.addEventListener("mousedown", handleClickOutside);
+    return () => document.removeEventListener("mousedown", handleClickOutside);
+  }, [showHistoryMenu]);
+
+  const handleSelectLang = (langCode: string) => {
+    setShowLangMenu(false);
+    onTranslate(langCode);
+  };
+
+  const formatDate = (ts: number) => {
+    const d = new Date(ts);
+    return `${(d.getMonth() + 1).toString().padStart(2, "0")}-${d.getDate().toString().padStart(2, "0")} ${d.getHours().toString().padStart(2, "0")}:${d.getMinutes().toString().padStart(2, "0")}`;
+  };
+
  return (
-    <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-      <div className="flex justify-between items-center gap-2 mb-4">
-        <h2 className="text-base sm:text-lg font-semibold text-white flex items-center gap-2 whitespace-nowrap">
+    <div className="relative z-10 bg-white/5 rounded-2xl p-4 sm:p-6 border border-white/10 backdrop-blur-sm">
+      <div className="mb-4 space-y-3">
+        <h2 className="text-base sm:text-lg font-semibold text-white flex items-center gap-2">
          ✍️ 文案提取与编辑
        </h2>
-        <div className="flex gap-2">
+        <div className="flex gap-2 flex-wrap justify-end items-center">
+          {/* 历史文案 */}
+          <div className="relative" ref={historyMenuRef}>
+            <button
+              onClick={() => setShowHistoryMenu((prev) => !prev)}
+              className="h-7 px-2.5 text-xs rounded transition-all whitespace-nowrap bg-gray-600 hover:bg-gray-500 text-white inline-flex items-center gap-1"
+            >
+              <History className="h-3.5 w-3.5" />
+              历史文案
+            </button>
+            {showHistoryMenu && (
+              <div className="absolute left-0 top-full mt-1 z-50 bg-gray-800 border border-white/10 rounded-lg shadow-xl py-1 min-w-[220px] max-h-[280px] overflow-y-auto">
+                {savedScripts.length === 0 ? (
+                  <div className="px-3 py-3 text-xs text-gray-500 text-center">暂无保存的文案</div>
+                ) : (
+                  savedScripts.map((script) => (
+                    <div
+                      key={script.id}
+                      className="flex items-center gap-1 px-3 py-1.5 hover:bg-white/10 transition-colors group"
+                    >
+                      <button
+                        onClick={() => {
+                          onLoadScript(script.content);
+                          setShowHistoryMenu(false);
+                        }}
+                        className="flex-1 text-left min-w-0"
+                      >
+                        <div className="text-xs text-gray-200 truncate">{script.name}</div>
+                        <div className="text-[10px] text-gray-500">{formatDate(script.savedAt)}</div>
+                      </button>
+                      <button
+                        onClick={(e) => {
+                          e.stopPropagation();
+                          onDeleteScript(script.id);
+                        }}
+                        className="opacity-0 group-hover:opacity-100 p-1 text-gray-500 hover:text-red-400 transition-all shrink-0"
+                      >
+                        <Trash2 className="h-3 w-3" />
+                      </button>
+                    </div>
+                  ))
+                )}
+              </div>
+            )}
+          </div>
          <button
            onClick={onOpenExtractModal}
-            className="px-2 py-1 text-xs rounded transition-all whitespace-nowrap bg-purple-600 hover:bg-purple-700 text-white flex items-center gap-1"
+            className="h-7 px-2.5 text-xs rounded transition-all whitespace-nowrap bg-purple-600 hover:bg-purple-700 text-white inline-flex items-center gap-1"
          >
            <FileText className="h-3.5 w-3.5" />
            文案提取助手
          </button>
+          <div className="relative" ref={langMenuRef}>
+            <button
+              onClick={() => setShowLangMenu((prev) => !prev)}
+              disabled={isTranslating || !text.trim()}
+              className={`h-7 px-2.5 text-xs rounded transition-all whitespace-nowrap inline-flex items-center gap-1 ${
+                isTranslating || !text.trim()
+                  ? "bg-gray-600 cursor-not-allowed text-gray-400"
+                  : "bg-gradient-to-r from-emerald-600 to-teal-600 hover:from-emerald-700 hover:to-teal-700 text-white"
+              }`}
+            >
+              {isTranslating ? (
+                <>
+                  <Loader2 className="h-3.5 w-3.5 animate-spin" />
+                  翻译中...
+                </>
+              ) : (
+                <>
+                  <Languages className="h-3.5 w-3.5" />
+                  AI多语言
+                </>
+              )}
+            </button>
+            {showLangMenu && (
+              <div className="absolute right-0 top-full mt-1 z-50 bg-gray-800 border border-white/10 rounded-lg shadow-xl py-1 min-w-[160px]">
+                {hasOriginalText && (
+                  <>
+                    <button
+                      onClick={() => { setShowLangMenu(false); onRestoreOriginal(); }}
+                      className="w-full text-left px-3 py-1.5 text-xs text-amber-400 hover:bg-white/10 transition-colors flex items-center gap-1"
+                    >
+                      <RotateCcw className="h-3 w-3" />
+                      还原原文
+                    </button>
+                    <div className="border-t border-white/10 my-1" />
+                  </>
+                )}
+                {LANGUAGES.map((lang) => (
+                  <button
+                    key={lang.code}
+                    onClick={() => handleSelectLang(lang.code)}
+                    className="w-full text-left px-3 py-1.5 text-xs text-gray-200 hover:bg-white/10 transition-colors"
+                  >
+                    {lang.label}
+                  </button>
+                ))}
+              </div>
+            )}
+          </div>
          <button
            onClick={onGenerateMeta}
            disabled={isGeneratingMeta || !text.trim()}
-            className={`px-2 py-1 text-xs rounded transition-all whitespace-nowrap ${isGeneratingMeta || !text.trim()
+            className={`h-7 px-2.5 text-xs rounded transition-all whitespace-nowrap inline-flex items-center gap-1 ${isGeneratingMeta || !text.trim()
              ? "bg-gray-600 cursor-not-allowed text-gray-400"
              : "bg-gradient-to-r from-blue-600 to-cyan-600 hover:from-blue-700 hover:to-cyan-700 text-white"
              }`}
          >
            {isGeneratingMeta ? (
-              <span className="flex items-center gap-1">
+              <>
                <Loader2 className="h-3.5 w-3.5 animate-spin" />
                生成中...
-              </span>
+              </>
            ) : (
-              <span className="flex items-center gap-1">
+              <>
                <Sparkles className="h-3.5 w-3.5" />
                AI生成标题标签
-              </span>
+              </>
            )}
          </button>
        </div>
@@ -57,9 +216,20 @@ export function ScriptEditor({
        placeholder="请输入你想说的话..."
        className="w-full h-40 bg-black/30 border border-white/10 rounded-xl p-4 text-white placeholder-gray-500 resize-none focus:outline-none focus:border-purple-500 transition-colors hide-scrollbar"
      />
-      <div className="flex justify-between mt-2 text-sm text-gray-400">
+      <div className="flex items-center justify-between mt-2 text-sm text-gray-400">
        <span>{text.length} 字</span>
-        <span>预计时长: ~{Math.ceil(text.length / 4)} 秒</span>
+        <button
+          onClick={onSaveScript}
+          disabled={!text.trim()}
+          className={`px-2.5 py-1 text-xs rounded transition-all flex items-center gap-1 ${
+            !text.trim()
+              ? "bg-gray-700 cursor-not-allowed text-gray-500"
+              : "bg-amber-600/80 hover:bg-amber-600 text-white"
+          }`}
+        >
+          <Save className="h-3 w-3" />
+          保存文案
+        </button>
      </div>
    </div>
  );
--- a/frontend/src/features/home/ui/ScriptExtractionModal.tsx
+++ b/frontend/src/features/home/ui/ScriptExtractionModal.tsx
@@ -0,0 +1,333 @@
+"use client";
+
+import { useEffect, useCallback } from "react";
+import { Loader2 } from "lucide-react";
+import { useScriptExtraction } from "./script-extraction/useScriptExtraction";
+
+interface ScriptExtractionModalProps {
+    isOpen: boolean;
+    onClose: () => void;
+    onApply?: (text: string) => void;
+}
+
+export default function ScriptExtractionModal({
+    isOpen,
+    onClose,
+    onApply,
+}: ScriptExtractionModalProps) {
+    const {
+        isLoading,
+        script,
+        rewrittenScript,
+        error,
+        doRewrite,
+        step,
+        dragActive,
+        selectedFile,
+        activeTab,
+        inputUrl,
+        setDoRewrite,
+        setActiveTab,
+        setInputUrl,
+        handleDrag,
+        handleDrop,
+        handleFileChange,
+        handleExtract,
+        copyToClipboard,
+        resetToConfig,
+        clearSelectedFile,
+        clearInputUrl,
+    } = useScriptExtraction({ isOpen });
+
+    // 快捷键：ESC 关闭，Enter 提交（仅在 config 步骤）
+    const canExtract = (activeTab === "file" && selectedFile) || (activeTab === "url" && inputUrl.trim());
+
+    const handleKeyDown = useCallback((e: KeyboardEvent) => {
+        if (e.key === "Escape") {
+            onClose();
+        } else if (e.key === "Enter" && !e.shiftKey && step === "config" && canExtract && !isLoading) {
+            e.preventDefault();
+            handleExtract();
+        }
+    }, [onClose, step, canExtract, isLoading, handleExtract]);
+
+    useEffect(() => {
+        if (!isOpen) return;
+        document.addEventListener("keydown", handleKeyDown);
+        return () => document.removeEventListener("keydown", handleKeyDown);
+    }, [isOpen, handleKeyDown]);
+
+    if (!isOpen) return null;
+
+    const handleApplyAndClose = (text: string) => {
+        onApply?.(text);
+        onClose();
+    };
+
+    const handleExtractNext = () => {
+        resetToConfig();
+        clearSelectedFile();
+        clearInputUrl();
+    };
+
+    return (
+        <div className="fixed inset-0 z-50 flex items-center justify-center bg-black/80 backdrop-blur-sm p-4 animate-in fade-in duration-200">
+            <div className="bg-[#1a1a1a] border border-white/10 rounded-2xl w-full max-w-2xl max-h-[90vh] overflow-hidden flex flex-col shadow-2xl">
+                {/* Header */}
+                <div className="flex items-center justify-between p-4 border-b border-white/10 bg-white/5">
+                    <h3 className="text-lg font-semibold text-white flex items-center gap-2">
+                        📜 文案提取助手
+                    </h3>
+                    <button
+                        onClick={onClose}
+                        className="text-gray-400 hover:text-white transition-colors text-2xl leading-none"
+                    >
+                        &times;
+                    </button>
+                </div>
+
+                {/* Content */}
+                <div className="flex-1 overflow-y-auto p-6">
+                    {step === "config" && (
+                        <div className="space-y-6">
+                            {/* Tabs */}
+                            <div className="flex p-1 bg-white/5 rounded-xl border border-white/10">
+                                <button
+                                    onClick={() => setActiveTab("url")}
+                                    className={`flex-1 py-2 rounded-lg text-sm font-medium transition-all ${activeTab === "url"
+                                        ? "bg-purple-600 text-white shadow-lg"
+                                        : "text-gray-400 hover:text-white hover:bg-white/5"
+                                        }`}
+                                >
+                                    🔗 粘贴链接
+                                </button>
+                                <button
+                                    onClick={() => setActiveTab("file")}
+                                    className={`flex-1 py-2 rounded-lg text-sm font-medium transition-all ${activeTab === "file"
+                                        ? "bg-purple-600 text-white shadow-lg"
+                                        : "text-gray-400 hover:text-white hover:bg-white/5"
+                                        }`}
+                                >
+                                    📂 上传文件
+                                </button>
+                            </div>
+
+                            {/* URL Input Area */}
+                            {activeTab === "url" && (
+                                <div className="space-y-2 py-4">
+                                    <div className="relative">
+                                        <input
+                                            type="text"
+                                            value={inputUrl}
+                                            onChange={(e) => setInputUrl(e.target.value)}
+                                            placeholder="请粘贴抖音、B站等主流平台视频链接..."
+                                            className="w-full bg-black/20 border border-white/10 rounded-xl px-4 py-4 text-white placeholder-gray-500 focus:outline-none focus:border-purple-500 transition-colors"
+                                        />
+                                        {inputUrl && (
+                                            <button
+                                                onClick={clearInputUrl}
+                                                className="absolute right-3 top-1/2 -translate-y-1/2 text-gray-500 hover:text-white"
+                                            >
+                                                ✕
+                                            </button>
+                                        )}
+                                    </div>
+                                    <p className="text-xs text-gray-500 pl-1">
+                                        支持抖音、B站、微博、小红书等主流平台视频链接
+                                    </p>
+                                </div>
+                            )}
+
+                            {/* File Upload Area */}
+                            {activeTab === "file" && (
+                                <div
+                                    className={`border-2 border-dashed rounded-xl p-8 text-center transition-colors ${dragActive
+                                        ? "border-purple-500 bg-purple-500/10"
+                                        : "border-white/10 hover:border-white/20"
+                                        }`}
+                                    onDragEnter={handleDrag}
+                                    onDragLeave={handleDrag}
+                                    onDragOver={handleDrag}
+                                    onDrop={handleDrop}
+                                >
+                                    {selectedFile ? (
+                                        <div className="space-y-2">
+                                            <p className="text-white">{selectedFile.name}</p>
+                                            <p className="text-sm text-gray-400">
+                                                {(selectedFile.size / 1024 / 1024).toFixed(2)} MB
+                                            </p>
+                                            <button
+                                                onClick={clearSelectedFile}
+                                                className="text-xs text-purple-400 hover:text-purple-300"
+                                            >
+                                                重新选择
+                                            </button>
+                                        </div>
+                                    ) : (
+                                        <div className="space-y-4">
+                                            <div className="text-4xl">📁</div>
+                                            <p className="text-gray-400">
+                                                拖放视频/音频文件到此处，或
+                                                <label className="text-purple-400 hover:text-purple-300 cursor-pointer">
+                                                    点击选择
+                                                    <input
+                                                        type="file"
+                                                        accept=".mp4,.mov,.avi,.mp3,.wav,.m4a"
+                                                        onChange={handleFileChange}
+                                                        className="hidden"
+                                                    />
+                                                </label>
+                                            </p>
+                                            <p className="text-xs text-gray-500">
+                                                支持 MP4, MOV, AVI, MP3, WAV, M4A
+                                            </p>
+                                        </div>
+                                    )}
+                                </div>
+                            )}
+
+                            {/* Options */}
+                            <div className="flex items-center gap-3 bg-white/5 rounded-xl p-4 border border-white/10">
+                                <label className="flex items-center gap-2 cursor-pointer">
+                                    <input
+                                        type="checkbox"
+                                        checked={doRewrite}
+                                        onChange={(e) => setDoRewrite(e.target.checked)}
+                                        className="w-4 h-4 rounded bg-white/10 border-white/20 text-purple-500 focus:ring-purple-500"
+                                    />
+                                    <span className="text-sm text-gray-300">
+                                        AI 智能改写（去口语化）
+                                    </span>
+                                </label>
+                            </div>
+
+                            {/* Error */}
+                            {error && (
+                                <div className="bg-red-500/10 border border-red-500/30 rounded-xl p-4">
+                                    <p className="text-red-400 text-sm">{error}</p>
+                                </div>
+                            )}
+
+                            {/* Action Button */}
+                            <div className="flex gap-3 pt-2">
+                                <button
+                                    onClick={onClose}
+                                    className="flex-1 py-3 px-4 bg-white/10 hover:bg-white/20 text-white rounded-xl transition-colors"
+                                >
+                                    取消
+                                </button>
+                                <button
+                                    onClick={handleExtract}
+                                    disabled={
+                                        (activeTab === "file" && !selectedFile) ||
+                                        (activeTab === "url" && !inputUrl.trim()) ||
+                                        isLoading
+                                    }
+                                    className="flex-1 py-3 px-4 bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-500 hover:to-pink-500 disabled:opacity-50 disabled:cursor-not-allowed text-white rounded-xl transition-all font-medium shadow-lg flex items-center justify-center gap-2"
+                                >
+                                    {isLoading ? <Loader2 className="w-5 h-5 animate-spin" /> : null}
+                                    开始提取
+                                </button>
+                            </div>
+                        </div>
+                    )}
+
+                    {step === "processing" && (
+                        <div className="flex flex-col items-center justify-center py-20">
+                            <div className="relative w-20 h-20 mb-6">
+                                <div className="absolute inset-0 border-4 border-purple-500/30 rounded-full"></div>
+                                <div className="absolute inset-0 border-4 border-t-purple-500 rounded-full animate-spin"></div>
+                            </div>
+                            <h4 className="text-xl font-medium text-white mb-2">
+                                正在处理中...
+                            </h4>
+                            <p className="text-sm text-gray-400 text-center max-w-sm px-4">
+                                {activeTab === "url" && "正在下载视频..."}
+                                <br />
+                                {doRewrite
+                                    ? "正在进行语音识别和 AI 智能改写..."
+                                    : "正在进行语音识别..."}
+                                <br />
+                                <span className="opacity-75">
+                                    大文件可能需要几分钟，请不要关闭窗口
+                                </span>
+                            </p>
+                        </div>
+                    )}
+
+                    {step === "result" && (
+                        <div className="space-y-6">
+                            {rewrittenScript && (
+                                <div className="space-y-2">
+                                    <div className="flex justify-between items-center">
+                                        <h4 className="font-semibold text-purple-300 flex items-center gap-2">
+                                            ✨ AI 洗稿结果{" "}
+                                            <span className="text-xs font-normal text-purple-400/70">
+                                                (推荐)
+                                            </span>
+                                        </h4>
+                                        {onApply && (
+                                            <button
+                                                onClick={() => handleApplyAndClose(rewrittenScript)}
+                                                className="text-xs bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-500 hover:to-pink-500 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1 shadow-sm"
+                                            >
+                                                📥 填入
+                                            </button>
+                                        )}
+                                        <button
+                                            onClick={() => copyToClipboard(rewrittenScript)}
+                                            className="text-xs bg-purple-600 hover:bg-purple-500 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1"
+                                        >
+                                            📋 复制内容
+                                        </button>
+                                    </div>
+                                    <div className="bg-purple-900/10 border border-purple-500/20 rounded-xl p-4 max-h-60 overflow-y-auto custom-scrollbar">
+                                        <p className="text-gray-200 text-sm leading-relaxed whitespace-pre-wrap">
+                                            {rewrittenScript}
+                                        </p>
+                                    </div>
+                                </div>
+                            )}
+
+                            <div className="space-y-2">
+                                <div className="flex justify-between items-center">
+                                    <h4 className="font-semibold text-gray-400 flex items-center gap-2">
+                                        🎙️ 原始识别结果
+                                    </h4>
+                                    {onApply && (
+                                        <button
+                                            onClick={() => handleApplyAndClose(script)}
+                                            className="text-xs bg-white/10 hover:bg-white/20 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1"
+                                        >
+                                            📥 填入
+                                        </button>
+                                    )}
+                                    <button
+                                        onClick={() => copyToClipboard(script)}
+                                        className="text-xs bg-white/10 hover:bg-white/20 text-white px-3 py-1.5 rounded-lg transition-colors"
+                                    >
+                                        复制
+                                    </button>
+                                </div>
+                                <div className="bg-white/5 border border-white/10 rounded-xl p-4 max-h-40 overflow-y-auto custom-scrollbar">
+                                    <p className="text-gray-400 text-sm leading-relaxed whitespace-pre-wrap">
+                                        {script}
+                                    </p>
+                                </div>
+                            </div>
+
+                            <div className="flex justify-center pt-4">
+                                <button
+                                    onClick={handleExtractNext}
+                                    className="px-6 py-2 bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors"
+                                >
+                                    提取下一个
+                                </button>
+                            </div>
+                        </div>
+                    )}
+                </div>
+            </div>
+        </div>
+    );
+}
--- a/frontend/src/features/home/ui/TimelineEditor.tsx
+++ b/frontend/src/features/home/ui/TimelineEditor.tsx
@@ -0,0 +1,283 @@
+import { useEffect, useRef, useCallback, useState } from "react";
+import WaveSurfer from "wavesurfer.js";
+import type { TimelineSegment } from "@/features/home/model/useTimelineEditor";
+import type { Material } from "@/shared/types/material";
+
+interface TimelineEditorProps {
+  audioDuration: number;
+  audioUrl: string;
+  segments: TimelineSegment[];
+  materials: Material[];
+  onReorderSegment: (fromIdx: number, toIdx: number) => void;
+  onClickSegment: (segment: TimelineSegment) => void;
+}
+
+function formatTime(sec: number): string {
+  const m = Math.floor(sec / 60);
+  const s = sec % 60;
+  return `${String(m).padStart(2, "0")}:${s.toFixed(1).padStart(4, "0")}`;
+}
+
+export function TimelineEditor({
+  audioDuration,
+  audioUrl,
+  segments,
+  materials,
+  onReorderSegment,
+  onClickSegment,
+}: TimelineEditorProps) {
+  const waveRef = useRef<HTMLDivElement>(null);
+  const wsRef = useRef<WaveSurfer | null>(null);
+  const [waveReady, setWaveReady] = useState(false);
+  const [isPlaying, setIsPlaying] = useState(false);
+
+  // Refs for high-frequency DOM updates (avoid 60fps re-renders)
+  const playheadRef = useRef<HTMLDivElement>(null);
+  const timeRef = useRef<HTMLSpanElement>(null);
+  const audioDurationRef = useRef(audioDuration);
+  audioDurationRef.current = audioDuration;
+
+  // Drag-to-reorder state
+  const [dragFromIdx, setDragFromIdx] = useState<number | null>(null);
+  const [dragOverIdx, setDragOverIdx] = useState<number | null>(null);
+
+  // Create / recreate wavesurfer when audioUrl changes
+  useEffect(() => {
+    if (!waveRef.current || !audioUrl) return;
+
+    // Destroy previous instance
+    if (wsRef.current) {
+      wsRef.current.destroy();
+      wsRef.current = null;
+    }
+
+    const ws = WaveSurfer.create({
+      container: waveRef.current,
+      height: 56,
+      waveColor: "#6d28d9",
+      progressColor: "#a855f7",
+      barWidth: 2,
+      barGap: 1,
+      barRadius: 2,
+      cursorWidth: 1,
+      cursorColor: "#e879f9",
+      interact: true,
+      normalize: true,
+    });
+
+    // Click waveform → seek + auto-play
+    ws.on("interaction", () => ws.play());
+    ws.on("play", () => setIsPlaying(true));
+    ws.on("pause", () => setIsPlaying(false));
+    ws.on("finish", () => {
+      setIsPlaying(false);
+      if (playheadRef.current) playheadRef.current.style.display = "none";
+    });
+    // High-frequency: update playhead + time via refs (no React re-render)
+    ws.on("timeupdate", (time: number) => {
+      const dur = audioDurationRef.current;
+      if (playheadRef.current && dur > 0) {
+        playheadRef.current.style.left = `${(time / dur) * 100}%`;
+        playheadRef.current.style.display = "block";
+      }
+      if (timeRef.current) {
+        timeRef.current.textContent = formatTime(time);
+      }
+    });
+
+    ws.load(audioUrl);
+    wsRef.current = ws;
+
+    return () => {
+      ws.destroy();
+      wsRef.current = null;
+      setIsPlaying(false);
+      if (playheadRef.current) playheadRef.current.style.display = "none";
+      if (timeRef.current) timeRef.current.textContent = formatTime(0);
+    };
+  }, [audioUrl, waveReady]);
+
+  // Callback ref to detect when waveRef div mounts
+  const waveCallbackRef = useCallback((node: HTMLDivElement | null) => {
+    (waveRef as React.MutableRefObject<HTMLDivElement | null>).current = node;
+    setWaveReady(!!node);
+  }, []);
+
+  const handlePlayPause = useCallback(() => {
+    wsRef.current?.playPause();
+  }, []);
+
+  // Drag-to-reorder handlers
+  const handleDragStart = useCallback((idx: number, e: React.DragEvent) => {
+    setDragFromIdx(idx);
+    e.dataTransfer.effectAllowed = "move";
+    e.dataTransfer.setData("text/plain", String(idx));
+  }, []);
+
+  const handleDragOver = useCallback((idx: number, e: React.DragEvent) => {
+    e.preventDefault();
+    e.dataTransfer.dropEffect = "move";
+    setDragOverIdx(idx);
+  }, []);
+
+  const handleDragLeave = useCallback(() => {
+    setDragOverIdx(null);
+  }, []);
+
+  const handleDrop = useCallback((toIdx: number, e: React.DragEvent) => {
+    e.preventDefault();
+    const fromIdx = parseInt(e.dataTransfer.getData("text/plain"), 10);
+    if (!isNaN(fromIdx) && fromIdx !== toIdx) {
+      onReorderSegment(fromIdx, toIdx);
+    }
+    setDragFromIdx(null);
+    setDragOverIdx(null);
+  }, [onReorderSegment]);
+
+  const handleDragEnd = useCallback(() => {
+    setDragFromIdx(null);
+    setDragOverIdx(null);
+  }, []);
+
+  // Filter visible vs overflow segments
+  const visibleSegments = segments.filter((s) => s.start < audioDuration);
+  const overflowSegments = segments.filter((s) => s.start >= audioDuration);
+  const hasSegments = visibleSegments.length > 0;
+
+  return (
+    <div className="bg-white/5 rounded-2xl p-4 sm:p-6 border border-white/10 backdrop-blur-sm">
+      <div className="flex items-center justify-between mb-3">
+        <h2 className="text-base sm:text-lg font-semibold text-white flex items-center gap-2">
+          🎞️ 时间轴编辑
+        </h2>
+        {audioUrl && (
+          <div className="flex items-center gap-2 text-xs text-gray-400">
+            <button
+              onClick={handlePlayPause}
+              className="w-7 h-7 flex items-center justify-center rounded-full bg-white/10 hover:bg-white/20 text-white transition-colors"
+              title={isPlaying ? "暂停" : "播放"}
+            >
+              {isPlaying ? "⏸" : "▶"}
+            </button>
+            <span ref={timeRef} className="tabular-nums">00:00.0</span>
+            <span className="text-gray-600">/</span>
+            <span className="tabular-nums">{formatTime(audioDuration)}</span>
+          </div>
+        )}
+      </div>
+
+      {/* Waveform — always rendered so ref stays mounted */}
+      <div className="relative mb-1">
+        <div ref={waveCallbackRef} className="rounded-lg overflow-hidden bg-black/20 cursor-pointer" style={{ minHeight: 56 }} />
+      </div>
+
+      {/* Segment blocks or empty placeholder */}
+      {hasSegments ? (
+        <>
+          <div className="relative h-14 flex select-none">
+            {/* Playhead — syncs with audio playback */}
+            <div
+              ref={playheadRef}
+              className="absolute top-0 h-full w-0.5 bg-fuchsia-400 z-10 pointer-events-none"
+              style={{ display: "none", left: "0%" }}
+            />
+            {visibleSegments.map((seg, i) => {
+              const left = (seg.start / audioDuration) * 100;
+              const width = ((seg.end - seg.start) / audioDuration) * 100;
+              const segDur = seg.end - seg.start;
+              const isDragTarget = dragOverIdx === i && dragFromIdx !== i;
+
+              // Compute loop portion for the last visible segment
+              const isLastVisible = i === visibleSegments.length - 1;
+              let loopPercent = 0;
+              if (isLastVisible && audioDuration > 0) {
+                const mat = materials.find((m) => m.id === seg.materialId);
+                const matDur = mat?.duration_sec ?? 0;
+                const effDur = (seg.sourceEnd > seg.sourceStart)
+                  ? (seg.sourceEnd - seg.sourceStart)
+                  : matDur;
+                if (effDur > 0 && segDur > effDur + 0.1) {
+                  loopPercent = ((segDur - effDur) / segDur) * 100;
+                }
+              }
+
+              return (
+                <div key={seg.id} className="absolute top-0 h-full" style={{ left: `${left}%`, width: `${width}%` }}>
+                  <button
+                    draggable
+                    onDragStart={(e) => handleDragStart(i, e)}
+                    onDragOver={(e) => handleDragOver(i, e)}
+                    onDragLeave={handleDragLeave}
+                    onDrop={(e) => handleDrop(i, e)}
+                    onDragEnd={handleDragEnd}
+                    onClick={() => onClickSegment(seg)}
+                    className={`relative w-full h-full rounded-lg flex flex-col items-center justify-center overflow-hidden cursor-grab active:cursor-grabbing transition-all border ${
+                      isDragTarget
+                        ? "ring-2 ring-purple-400 border-purple-400 scale-[1.02]"
+                        : dragFromIdx === i
+                        ? "opacity-50 border-white/10"
+                        : "hover:opacity-90 border-white/10"
+                    }`}
+                    style={{ backgroundColor: seg.color + "33", borderColor: isDragTarget ? undefined : seg.color + "66" }}
+                    title={`拖拽可调换顺序 · 点击设置截取范围\n${seg.materialName}\n${segDur.toFixed(1)}s${loopPercent > 0 ? ` (含循环 ${(segDur * loopPercent / 100).toFixed(1)}s)` : ""}`}
+                  >
+                    <span className="text-[11px] text-white/90 truncate max-w-full px-1 leading-tight z-[1]">
+                      {seg.materialName}
+                    </span>
+                    <span className="text-[10px] text-white/60 leading-tight z-[1]">
+                      {segDur.toFixed(1)}s
+                    </span>
+                    {seg.sourceStart > 0 && (
+                      <span className="text-[9px] text-amber-400/80 leading-tight z-[1]">
+                        ✂ {seg.sourceStart.toFixed(1)}s
+                      </span>
+                    )}
+                    {/* Loop fill stripe overlay */}
+                    {loopPercent > 0 && (
+                      <div
+                        className="absolute top-0 right-0 h-full pointer-events-none flex items-center justify-center"
+                        style={{
+                          width: `${loopPercent}%`,
+                          background: `repeating-linear-gradient(-45deg, transparent, transparent 3px, rgba(255,255,255,0.07) 3px, rgba(255,255,255,0.07) 6px)`,
+                          borderLeft: "1px dashed rgba(255,255,255,0.25)",
+                        }}
+                      >
+                        <span className="text-[9px] text-white/30">循环</span>
+                      </div>
+                    )}
+                  </button>
+                </div>
+              );
+            })}
+          </div>
+
+          {/* Overflow segments — shown as gray chips */}
+          {overflowSegments.length > 0 && (
+            <div className="flex flex-wrap items-center gap-1.5 mt-1.5">
+              <span className="text-[10px] text-gray-500">未使用:</span>
+              {overflowSegments.map((seg) => (
+                <span
+                  key={seg.id}
+                  className="text-[10px] text-gray-500 bg-white/5 border border-white/10 rounded px-1.5 py-0.5"
+                >
+                  {seg.materialName}
+                </span>
+              ))}
+            </div>
+          )}
+
+          <p className="text-[10px] text-gray-500 mt-1.5">
+            点击波形定位播放 · 拖拽色块调换顺序 · 点击色块设置截取范围
+          </p>
+        </>
+      ) : (
+        <>
+          <div className="h-14 bg-white/5 rounded-lg" />
+          <p className="text-[10px] text-gray-500 mt-1.5">
+            选中配音和素材后可编辑时间轴
+          </p>
+        </>
+      )}
+    </div>
+  );
+}
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Kevin Wong	e33dfc3031	更新	2026-02-10 13:31:29 +08:00
Kevin Wong	3129d45b25	更新	2026-02-09 14:47:19 +08:00
Kevin Wong	e226224119	更新	2026-02-08 19:54:11 +08:00
Kevin Wong	ee342cc40f	更新	2026-02-08 16:23:39 +08:00
Kevin Wong	1a291a03b8	更新	2026-02-08 10:46:08 +08:00
Kevin Wong	1e52346eb4	更新	2026-02-07 14:29:57 +08:00
Kevin Wong	945262a7fc	更新	2026-02-06 16:02:58 +08:00
				`@@ -1 +0,0 @@`
				`from app.modules.admin.router import router`
				`@@ -1 +0,0 @@`
				`from app.modules.assets.router import router`
				`@@ -1 +0,0 @@`
				`from app.modules.login_helper.router import router`
				`@@ -1 +0,0 @@`
				`from app.modules.materials.router import router`
				`@@ -1 +0,0 @@`
				`from app.modules.publish.router import router`
				`@@ -1 +0,0 @@`
				`from app.modules.ref_audios.router import router`
				`@@ -1 +0,0 @@`
				`from app.modules.tools.router import router`
				`@@ -1 +0,0 @@`
				`from app.modules.videos.router import router`