更新

2026-02-04 18:04:17 +08:00 · 2026-02-04 17:19:24 +08:00 · 2026-02-04 16:56:16 +08:00 · 2026-02-04 16:54:59 +08:00 · 2026-02-04 15:59:45 +08:00 · 2026-02-04 11:56:37 +08:00
132 changed files with 27703 additions and 2160 deletions
--- a/Docs/BACKEND_README.md
+++ b/Docs/BACKEND_README.md
@@ -0,0 +1,172 @@
+# ViGent2 后端开发指南
+
+本文档为后端开发人员提供架构概览、接口规范以及开发流程指南。
+
+---
+
+## 🏗️ 架构概览
+
+后端采用 **FastAPI** 框架，基于 Python 3.10+ 构建，主要负责业务逻辑处理、AI 任务调度以及与各微服务组件的交互。
+
+### 目录结构
+
+```
+backend/
+├── app/
+│   ├── api/              # API 路由定义 (endpoints)
+│   ├── core/             # 核心配置 (config.py, security.py)
+│   ├── models/           # Pydantic 数据模型 (schemas)
+│   ├── services/         # 业务逻辑服务层
+│   │   ├── auth_service.py       # 用户认证服务
+│   │   ├── glm_service.py        # GLM-4 大模型服务
+│   │   ├── lipsync_service.py    # LatentSync 唇形同步
+│   │   ├── publish_service.py    # 社交媒体发布
+│   │   └── voice_clone_service.py# Qwen3-TTS 声音克隆
+│   └── tests/            # 单元测试与集成测试
+├── scripts/              # 运维脚本 (watchdog.py, init_db.py)
+├── assets/               # 资源库 (fonts, bgm, styles)
+└── requirements.txt      # 依赖清单
+```
+
+---
+
+## 🔌 API 接口规范
+
+后端服务默认运行在 `8006` 端口。
+
+- **文档地址**: `http://localhost:8006/docs` (Swagger UI)
+- **认证方式**: Bearer Token (JWT)
+
+### 核心模块
+
+1.  **认证 (Auth)**
+    *   `POST /api/auth/login`: 用户登录 (手机号)
+    *   `POST /api/auth/register`: 用户注册
+    *   `GET /api/auth/me`: 获取当前用户信息
+
+2.  **视频生成 (Videos)**
+    *   `POST /api/videos/generate`: 提交生成任务
+    *   `GET /api/videos/tasks/{task_id}`: 查询任务状态
+    *   `GET /api/videos/generated`: 获取历史视频列表
+    *   `DELETE /api/videos/generated/{video_id}`: 删除历史视频
+
+> **修正 (16:20)**：任务查询与历史列表接口已更新为 `/api/videos/tasks/{task_id}` 与 `/api/videos/generated`。
+
+3.  **素材管理 (Materials)**
+    *   `POST /api/materials/upload`: 上传素材 (Direct Upload to Supabase)
+    *   `GET /api/materials`: 获取素材列表
+
+4.  **社交发布 (Publish)**
+    *   `POST /api/publish`: 发布视频到 B站/抖音/小红书
+
+5.  **资源库 (Assets)**
+    *   `GET /api/assets/subtitle-styles`: 字幕样式列表
+    *   `GET /api/assets/title-styles`: 标题样式列表
+    *   `GET /api/assets/bgm`: 背景音乐列表
+
+---
+
+## 🎛️ 视频生成扩展参数
+
+`POST /api/videos/generate` 支持以下可选字段：
+
+- `subtitle_style_id`: 字幕样式 ID
+- `title_style_id`: 标题样式 ID
+- `subtitle_font_size`: 字幕字号（覆盖样式默认值）
+- `title_font_size`: 标题字号（覆盖样式默认值）
+- `bgm_id`: 背景音乐 ID
+- `bgm_volume`: 背景音乐音量（0-1，默认 0.2）
+
+## 📦 资源库与静态资源
+
+- 本地资源目录：`backend/assets/{fonts,bgm,styles}`
+- 静态访问路径：`/assets`（用于前端样式预览与背景音乐试听）
+
+## 🎵 背景音乐混音策略
+
+- 混音发生在 **唇形对齐之后**，避免影响字幕/口型时间轴。
+- 使用 FFmpeg `amix`，禁用归一化以保持配音音量稳定。
+
+## 🛠️ 开发环境搭建
+
+### 1. 虚拟环境
+
+```bash
+cd backend
+python -m venv venv
+source venv/bin/activate  # Linux/macOS
+# .\venv\Scripts\activate # Windows
+```
+
+### 2. 依赖安装
+
+```bash
+pip install -r requirements.txt
+```
+
+### 3. 环境变量配置
+
+复制 `.env.example` 到 `.env` 并配置必要的 Key：
+
+```ini
+# Supabase
+SUPABASE_URL=http://localhost:8008
+SUPABASE_KEY=your_service_role_key
+
+# GLM API (用于 AI 标题生成)
+GLM_API_KEY=your_glm_api_key
+
+# LatentSync 配置
+LATENTSYNC_GPU_ID=1
+```
+
+### 4. 启动服务
+
+**开发模式 (热重载)**:
+```bash
+uvicorn app.main:app --host 0.0.0.0 --port 8006 --reload
+```
+
+---
+
+## 🧩 服务集成指南
+
+### 集成新模型
+
+如果需要集成新的 AI 模型 (例如新的 TTS 引擎)：
+
+1.  在 `app/services/` 下创建新的 Service 类 (如 `NewTTSService`)。
+2.  实现 `generate` 方法，可以使用 subprocess 调用，也可以是 HTTP 请求。
+3.  **重要**: 如果模型占用 GPU，请务必使用 `asyncio.Lock` 进行并发控制，防止 OOM。
+4.  在 `app/api/` 中添加对应的路由调用。
+
+### 添加定时任务
+
+目前推荐使用 **APScheduler** 或 **Crontab** 来管理定时任务。
+社交媒体的定时发布功能目前依赖 `playwright` 的延迟执行，未来计划迁移到 Celery 队列。
+
+---
+
+## 🛡️ 错误处理
+
+全项目统一使用 `Loguru` 进行日志记录。
+
+```python
+from loguru import logger
+
+try:
+    # 业务逻辑
+except Exception as e:
+    logger.error(f"操作失败: {str(e)}")
+    raise HTTPException(status_code=500, detail="服务器内部错误")
+```
+
+---
+
+## 🧪 测试
+
+运行测试套件：
+
+```bash
+pytest
+```
--- a/Docs/DEPLOY_MANUAL.md
+++ b/Docs/DEPLOY_MANUAL.md
@@ -98,6 +98,15 @@ playwright install chromium

 ---

+### 可选：AI 标题/标签生成
+
+> ✅ 如需启用“AI 标题/标签生成”功能，请确保后端可访问外网 API。
+
+- 需要可访问 `https://open.bigmodel.cn`
+- API Key 配置在 `backend/app/services/glm_service.py`（建议替换为自己的密钥）
+
+---
+
 ## 步骤 5: 部署用户认证系统 (Supabase + Auth)

 > 🔐 **包含**: 登录/注册、Supabase 数据库配置、JWT 认证、管理员后台
@@ -107,7 +116,30 @@ playwright install chromium

 ---

-## 步骤 6: 配置环境变量
+## 步骤 6: 配置 Supabase RLS 策略 (重要)
+ 
+ > ⚠️ **注意**：为了支持前端直传文件，必须配置存储桶的行级安全策略 (RLS)。
+ 
+ 1. 确保 Supabase 容器正在运行 (`docker ps`).
+ 2. 将项目根目录下的 `supabase_rls.sql` (如果有) 或以下 SQL 内容在数据库中执行。
+ 3. **执行命令**:
+    ```bash
+    # 进入后端目录
+    cd /home/rongye/ProgramFiles/ViGent2/backend
+    
+    # 执行 SQL (允许 anon 角色上传/读取 materials 桶)
+    docker exec -i supabase-db psql -U postgres <<EOF
+    INSERT INTO storage.buckets (id, name, public) VALUES ('materials', 'materials', true) ON CONFLICT (id) DO NOTHING;
+    INSERT INTO storage.buckets (id, name, public) VALUES ('outputs', 'outputs', true) ON CONFLICT (id) DO NOTHING;
+    CREATE POLICY "Allow public uploads" ON storage.objects FOR INSERT TO anon WITH CHECK (bucket_id = 'materials');
+    CREATE POLICY "Allow public read" ON storage.objects FOR SELECT TO anon USING (bucket_id = 'materials' OR bucket_id = 'outputs');
+    EOF
+    ```
+ 
+ ---
+ 
+ ## 步骤 7: 配置环境变量
+

 ```bash
 cd /home/rongye/ProgramFiles/ViGent2/backend
@@ -121,6 +153,8 @@ cp .env.example .env

 | 配置项 | 默认值 | 说明 |
 |--------|--------|------|
+| `SUPABASE_URL` | `http://localhost:8008` | Supabase API 内部地址 |
+| `SUPABASE_PUBLIC_URL` | `https://api.hbyrkj.top` | Supabase API 公网地址 (前端访问) |
 | `LATENTSYNC_GPU_ID` | 1 | GPU 选择 (0 或 1) |
 | `LATENTSYNC_USE_SERVER` | false | 设为 true 以启用常驻服务加速 |
 | `LATENTSYNC_INFERENCE_STEPS` | 20 | 推理步数 (20-50) |
@@ -129,7 +163,7 @@ cp .env.example .env

 ---

-## 步骤 7: 安装前端依赖
+## 步骤 8: 安装前端依赖

 ```bash
 cd /home/rongye/ProgramFiles/ViGent2/frontend
@@ -143,7 +177,7 @@ npm run build

 ---

-## 步骤 8: 测试运行
+## 步骤 9: 测试运行

 > 💡 先手动启动测试，确认一切正常后再配置 pm2 常驻服务。

@@ -178,7 +212,7 @@ python -m scripts.server

 ---

-## 步骤 9: 使用 pm2 管理常驻服务
+## 步骤 10: 使用 pm2 管理常驻服务

 > 推荐使用 pm2 管理所有服务，支持自动重启和日志管理。

@@ -233,7 +267,42 @@ chmod +x run_latentsync.sh
 pm2 start ./run_latentsync.sh --name vigent2-latentsync
 ```

-### 4. 保存当前列表 (开机自启)
+### 4. 启动 Qwen3-TTS 声音克隆服务 (可选)
+
+> 如需使用声音克隆功能，需要启动此服务。
+
+1. 安装 HTTP 服务依赖:
+```bash
+conda activate qwen-tts
+pip install fastapi uvicorn python-multipart
+```
+
+2. 启动脚本位于项目根目录: `run_qwen_tts.sh`
+
+3. 使用 pm2 启动:
+```bash
+cd /home/rongye/ProgramFiles/ViGent2
+pm2 start ./run_qwen_tts.sh --name vigent2-qwen-tts
+pm2 save
+```
+
+4. 验证服务:
+```bash
+# 检查健康状态
+curl http://localhost:8009/health
+```
+
+### 5. 启动服务看门狗 (Watchdog)
+
+> 🛡️ **推荐**：监控 Qwen-TTS 和 LatentSync 服务健康状态，卡死时自动重启。
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2
+pm2 start ./run_watchdog.sh --name vigent2-watchdog
+pm2 save
+```
+
+### 6. 保存当前列表 (开机自启)

 ```bash
 pm2 save
@@ -246,6 +315,7 @@ pm2 startup
 pm2 status                    # 查看所有服务状态
 pm2 logs                      # 查看所有日志
 pm2 logs vigent2-backend      # 查看后端日志
+pm2 logs vigent2-qwen-tts     # 查看 Qwen3-TTS 日志
 pm2 restart all               # 重启所有服务
 pm2 stop vigent2-latentsync   # 停止 LatentSync 服务
 pm2 delete all                # 删除所有服务
@@ -254,7 +324,7 @@ pm2 delete all                # 删除所有服务

 ---

-## 步骤 10: 配置 Nginx HTTPS (可选 - 公网访问)
+## 步骤 11: 配置 Nginx HTTPS (可选 - 公网访问)

 如果您需要通过公网域名 HTTPS 访问 (如 `https://vigent.hbyrkj.top`)，请参考以下 Nginx 配置。

@@ -294,8 +364,81 @@ server {

 ---

+
+---
+
+---
+
+## 步骤 13: 部署可选功能 (字幕与文案助手)
+
+本节介绍如何部署逐字高亮字幕、片头标题以及文案提取助手功能。
+
+### 13.1 部署字幕系统 (Subtitle System)
+
+包含 `faster-whisper` (字幕生成) 和 `Remotion` (视频渲染) 组件。
+
+详细步骤请参考：**[字幕功能部署指南](SUBTITLE_DEPLOY.md)**
+
+简要步骤：
+1. 安装 Python 依赖: `faster-whisper`
+2. 安装 Node.js 依赖: `npm install` (在 `remotion/` 目录)
+3. 验证: `npx remotion --version`
+
+### 13.2 部署文案提取助手 (Copywriting Assistant)
+
+支持 B站/抖音/TikTok 视频链接提取文案与 AI 洗稿。
+
+1. **安装核心依赖**:
+   ```bash
+   cd /home/rongye/ProgramFiles/ViGent2/backend
+   source venv/bin/activate
+   pip install yt-dlp zai-sdk
+   ```
+
+2. **配置 AI 洗稿 (GLM)**:
+   确保 `.env` 中已配置 `GLM_API_KEY`:
+   ```ini
+   GLM_API_KEY=your_zhipu_api_key
+   ```
+
+3. **验证**:
+   访问 `http://localhost:8006/docs`，测试 `/api/tools/extract-script` 接口。
+
+---
+
+## 步骤 14: 配置阿里云 Nginx 网关 (关键)
+
+> ⚠️ **CRITICAL**: 如果使用 `api.hbyrkj.top` 等域名作为入口，必须在阿里云 (或公网入口) 的 Nginx 配置中解除上传限制。
+> **这是导致 500/413 错误的核心原因。**
+
+**关键配置项**：
+```nginx
+server {
+    listen 443 ssl;
+    server_name api.hbyrkj.top;
+    
+    # ... 其他 SSL 配置 ...
+
+    # 允许大文件上传 (0 表示不限制，或设置为 100M, 500M)
+    client_max_body_size 0; 
+
+    location / {
+        proxy_pass http://127.0.0.1:YOUR_FRP_PORT;
+        
+        # 延长超时时间
+        proxy_read_timeout 600s;
+        proxy_send_timeout 600s;
+    }
+}
+```
+
+**后果**：如果没有这个配置，上传会在 ~1MB 或 ~10MB 时直接断开，报 413 Payload Too Large 或 500/502 错误。
+
+---
+
 ## 故障排除

+
 ### GPU 不可用

 ```bash
@@ -311,6 +454,7 @@ python3 -c "import torch; print(torch.cuda.is_available())"
 sudo lsof -i :8006
 sudo lsof -i :3002
 sudo lsof -i :8007
+sudo lsof -i :8009  # Qwen3-TTS
 ```

 ### 查看日志
@@ -320,6 +464,7 @@ sudo lsof -i :8007
 pm2 logs vigent2-backend
 pm2 logs vigent2-frontend
 pm2 logs vigent2-latentsync
+pm2 logs vigent2-qwen-tts
 ```

 ### SSH 连接卡顿 / 系统响应慢
@@ -346,6 +491,7 @@ pm2 logs vigent2-latentsync
 | `fastapi` | Web API 框架 |
 | `uvicorn` | ASGI 服务器 |
 | `edge-tts` | 微软 TTS 配音 |
+| `httpx` | GLM API HTTP 客户端 |
 | `playwright` | 社交媒体自动发布 |
 | `biliup` | B站视频上传 |
 | `loguru` | 日志管理 |
--- a/Docs/DevLogs/Day11.md
+++ b/Docs/DevLogs/Day11.md
@@ -0,0 +1,278 @@
+
+## 🔧 上传架构重构 (Direct Upload)
+
+### 🚨 问题描述 (10:30)
+**现象**：上传大于 7MB 的文件时，后端返回 500 Internal Server Error，实际为 `ClientDisconnect`。
+**ROOT CAUSE (关键原因)**：
+- **Aliyun Nginx 网关限制**：`api.hbyrkj.top` 域名的 Nginx 配置缺少 `client_max_body_size 0;`。
+- **默认限制**：Nginx 默认限制请求体为 1MB (或少量)，导致大文件上传时连接被网关强制截断。
+- **误判**：初期待查方向集中在 FRP 和 Backend Proxy 超时，实际是网关层的硬限制。
+
+### ✅ 解决方案：前端直传 Supabase + 网关配置 (14:00)
+
+**核心变更**：
+1. **网关配置**：在 Aliyun Nginx 的 `api.hbyrkj.top` 配置块中添加 `client_max_body_size 0;` (解除大小限制)。
+2. **架构优化**：移除后端文件转发逻辑，改由前端直接上传到 Supabase Storage (减少链路节点)。
+
+#### 1. 前端改造 (`frontend/src/app/page.tsx`)
+- 引入 `@supabase/supabase-js` 客户端。
+- 使用 `supabase.storage.from('materials').upload()` 直接上传。
+- 移除旧的 `XMLHttpRequest` 代理上传逻辑。
+- 添加文件重命名策略：`{timestamp}_{sanitized_filename}`。
+
+```typescript
+// V2: Direct Upload (Bypass Backend)
+const { data, error } = await supabase.storage
+    .from('materials')
+    .upload(path, file, {
+        cacheControl: '3600',
+        upsert: false
+    });
+```
+
+#### 2. 后端适配 (`backend/app/api/materials.py`)
+- **上传接口**：(已废弃/保留用于极小文件) 主要流量走直传。
+- **列表接口**：更新为返回 **签名 URL (Signed URL)**，而非本地路径。
+- **兼容性**：前端直接接收 `path` 字段为完整 URL，无需再次拼接。
+
+#### 3. 权限控制 (RLS)
+- Supabase 默认禁止匿名写入。
+- 执行 SQL 策略允许 `anon` 角色对 `materials` 桶的 `INSERT` 和 `SELECT` 权限。
+
+```sql
+-- Allow anonymous uploads
+CREATE POLICY "Allow public uploads" 
+ON storage.objects FOR INSERT 
+TO anon WITH CHECK (bucket_id = 'materials');
+```
+
+### 结果
+- ✅ **彻底解决超时**：上传不再经过 Nginx/FRP，直接走 Supabase CDN。
+- ✅ **解除大小限制**：不再受限于后端服务的 `client_max_body_size`。
+- ✅ **用户体验提升**：上传速度更快，进度条更准确。
+
+
+
+## 🔧 Supabase 部署与 RLS 配置
+
+### 相关文件
+- `supabase_rls.sql`: 定义存储桶权限的 SQL 脚本。
+- `docker-compose.yml`: 确认 Storage 服务配置正常。
+
+### 操作步骤
+1. 将 `supabase_rls.sql` 上传至服务器。
+2. 通过 Docker 执行 SQL：
+   ```bash
+   cat supabase_rls.sql | docker exec -i supabase-db psql -U postgres
+   ```
+3. 验证前端上传成功。
+
+---
+
+## 🔐 用户隔离实现 (15:00)
+
+### 问题描述
+不同账户登录后能看到其他用户上传的素材和生成的视频，缺乏数据隔离。
+
+### 解决方案：存储路径前缀隔离
+
+#### 1. 素材模块 (`backend/app/api/materials.py`)
+
+```python
+# 上传时添加用户ID前缀
+storage_path = f"{user_id}/{timestamp}_{safe_name}"
+
+# 列表时只查询当前用户目录
+files_obj = await storage_service.list_files(
+    bucket=storage_service.BUCKET_MATERIALS,
+    path=user_id  # 只列出用户目录下的文件
+)
+
+# 删除时验证权限
+if not material_id.startswith(f"{user_id}/"):
+    raise HTTPException(403, "无权删除此素材")
+```
+
+#### 2. 视频模块 (`backend/app/api/videos.py`)
+
+```python
+# 生成视频时使用用户ID目录
+storage_path = f"{user_id}/{task_id}_output.mp4"
+
+# 列表/删除同样基于用户目录隔离
+```
+
+#### 3. 发布模块 (`backend/app/services/publish_service.py`)
+- Cookie 存储支持用户隔离：`cookies/{user_id}/{platform}.json`
+
+### 存储结构
+```
+Supabase Storage/
+├── materials/
+│   ├── {user_id_1}/
+│   │   ├── 1737000001_video1.mp4
+│   │   └── 1737000002_video2.mp4
+│   └── {user_id_2}/
+│       └── 1737000003_video3.mp4
+└── outputs/
+    ├── {user_id_1}/
+    │   └── {task_id}_output.mp4
+    └── {user_id_2}/
+        └── ...
+```
+
+### 结果
+- ✅ 不同用户数据完全隔离
+- ✅ Cookie 和登录状态按用户存储
+- ✅ 删除操作验证所有权
+
+---
+
+## 🌐 Storage URL 修复 (16:00)
+
+### 问题描述
+生成的视频 URL 为 `http://localhost:8008/...`，前端无法访问。
+
+### 解决方案
+
+#### 1. 后端配置 (`backend/.env`)
+```ini
+SUPABASE_URL=http://localhost:8008        # 内部访问
+SUPABASE_PUBLIC_URL=https://api.hbyrkj.top  # 公网访问
+```
+
+#### 2. URL 转换 (`backend/app/services/storage.py`)
+```python
+def _convert_to_public_url(self, url: str) -> str:
+    """将内部 URL 转换为公网可访问的 URL"""
+    if settings.SUPABASE_PUBLIC_URL and settings.SUPABASE_URL:
+        internal_url = settings.SUPABASE_URL.rstrip('/')
+        public_url = settings.SUPABASE_PUBLIC_URL.rstrip('/')
+        return url.replace(internal_url, public_url)
+    return url
+```
+
+### 结果
+- ✅ 前端获取的 URL 可正常访问
+- ✅ 视频预览和下载功能正常
+
+---
+
+## ⚡ 发布服务优化 - 本地文件直读 (16:30)
+
+### 问题描述
+发布视频时需要先通过 HTTP 下载 Supabase Storage 文件到临时目录，效率低且浪费资源。
+
+### 发现
+Supabase Storage 文件实际存储在本地磁盘：
+```
+/home/rongye/ProgramFiles/Supabase/volumes/storage/stub/stub/{bucket}/{path}/{internal_uuid}
+```
+
+### 解决方案
+
+#### 1. 添加本地路径获取方法 (`storage.py`)
+```python
+SUPABASE_STORAGE_LOCAL_PATH = Path("/home/rongye/ProgramFiles/Supabase/volumes/storage/stub/stub")
+
+def get_local_file_path(self, bucket: str, path: str) -> Optional[str]:
+    """获取 Storage 文件的本地磁盘路径"""
+    dir_path = SUPABASE_STORAGE_LOCAL_PATH / bucket / path
+    if not dir_path.exists():
+        return None
+    files = list(dir_path.iterdir())
+    return str(files[0]) if files else None
+```
+
+#### 2. 发布服务优先使用本地文件 (`publish_service.py`)
+```python
+# 解析 URL 获取 bucket 和 path
+match = re.search(r'/storage/v1/object/sign/([^/]+)/(.+?)\?', video_path)
+if match:
+    bucket, storage_path = match.group(1), match.group(2)
+    local_video_path = storage_service.get_local_file_path(bucket, storage_path)
+
+if local_video_path and os.path.exists(local_video_path):
+    logger.info(f"[发布] 直接使用本地文件: {local_video_path}")
+else:
+    # Fallback: HTTP 下载
+```
+
+### 结果
+- ✅ 发布速度显著提升（跳过下载步骤）
+- ✅ 减少临时文件占用
+- ✅ 保留 HTTP 下载作为 Fallback
+
+---
+
+## 🔧 Supabase Studio 配置 (17:00)
+
+### 修改内容
+更新 `/home/rongye/ProgramFiles/Supabase/.env`：
+```ini
+# 修改前
+SUPABASE_PUBLIC_URL=http://localhost:8000
+
+# 修改后
+SUPABASE_PUBLIC_URL=https://api.hbyrkj.top
+```
+
+### 原因
+通过 `supabase.hbyrkj.top` 公网访问 Studio 时，需要正确的 API 公网地址。
+
+### 操作
+```bash
+docker compose restart studio
+```
+
+### 待解决
+- 🔄 Studio Settings 页面加载问题（401 Unauthorized）- 可能与 Nginx Basic Auth 配置冲突
+
+---
+
+## 📁 今日修改文件清单
+
+| 文件 | 变更类型 | 说明 |
+|------|----------|------|
+| `backend/app/api/materials.py` | 修改 | 添加用户隔离 |
+| `backend/app/api/videos.py` | 修改 | 添加用户隔离 |
+| `backend/app/services/storage.py` | 修改 | URL转换 + 本地路径获取 |
+| `backend/app/services/publish_service.py` | 修改 | 本地文件直读优化 |
+| `backend/.env` | 修改 | 添加 SUPABASE_PUBLIC_URL |
+| `Supabase/.env` | 修改 | SUPABASE_PUBLIC_URL |
+| `frontend/src/app/page.tsx` | 修改 | 改用后端API上传 |
+
+---
+
+## 📅 明日任务规划 (Day 12)
+
+### 🎯 目标：部署 Qwen3-TTS 0.6B 声音克隆系统
+
+**任务背景**：
+- 当前使用 EdgeTTS（微软云端 TTS），音色固定，无法自定义
+- Qwen3-TTS 支持**零样本声音克隆**，可用少量音频克隆任意人声
+
+**核心任务**：
+1. **模型部署**
+   - 创建独立 Conda 环境 (`qwen-tts`)
+   - 下载 Qwen3-TTS 0.6B 模型权重
+   - 配置 GPU 推理环境
+
+2. **后端集成**
+   - 新增 `qwen_tts_service.py` 服务
+   - 支持声音克隆：上传参考音频 → 生成克隆语音
+   - 兼容现有 `tts_service.py` 接口
+
+3. **前端适配**
+   - 添加"声音克隆"选项
+   - 支持上传参考音频（3-10秒）
+   - 音色预览功能
+
+**预期成果**：
+- ✅ 用户可上传自己的声音样本
+- ✅ 生成的口播视频使用克隆后的声音
+- ✅ 保留 EdgeTTS 作为备选方案
+
+**参考资源**：
+- 模型：[Qwen/Qwen3-TTS-0.6B](https://huggingface.co/Qwen/Qwen3-TTS-0.6B)
+- 显存需求：~4GB (0.6B 参数量)
--- a/Docs/DevLogs/Day12.md
+++ b/Docs/DevLogs/Day12.md
@@ -0,0 +1,347 @@
+# Day 12 - iOS 兼容与移动端 UI 优化
+
+**日期**：2026-01-28
+
+---
+
+## 🔐 Axios 全局拦截器优化
+
+### 背景
+统一处理 API 请求的认证失败场景，避免各页面重复处理 401/403 错误。
+
+### 实现 (`frontend/src/shared/api/axios.ts`)
+
+```typescript
+import axios from 'axios';
+
+// 动态获取 API 地址：服务端使用 localhost，客户端使用当前域名
+const API_BASE = typeof window === 'undefined'
+    ? 'http://localhost:8006'
+    : '';
+
+// 防止重复跳转
+let isRedirecting = false;
+
+const api = axios.create({
+    baseURL: API_BASE,
+    withCredentials: true, // 自动携带 HttpOnly cookie
+    headers: { 'Content-Type': 'application/json' },
+});
+
+// 响应拦截器 - 全局处理 401/403
+api.interceptors.response.use(
+    (response) => response,
+    async (error) => {
+        const status = error.response?.status;
+
+        if ((status === 401 || status === 403) && !isRedirecting) {
+            isRedirecting = true;
+
+            // 调用 logout API 清除 HttpOnly cookie
+            try {
+                await fetch('/api/auth/logout', { method: 'POST' });
+            } catch (e) { /* 忽略 */ }
+
+            // 跳转登录页
+            if (typeof window !== 'undefined') {
+                window.location.replace('/login');
+            }
+        }
+        return Promise.reject(error);
+    }
+);
+
+export default api;
+```
+
+### 关键特性
+- ✅ **自动携带 Cookie**: `withCredentials: true` 确保 HttpOnly JWT cookie 被发送
+- ✅ **401/403 自动跳转**: 认证失败时自动清理并跳转登录页
+- ✅ **防重复跳转**: `isRedirecting` 标志避免多个请求同时触发跳转
+- ✅ **SSR 兼容**: 服务端渲染时使用 `localhost`，客户端使用相对路径
+
+---
+
+## 🔧 iOS Safari 安全区域白边修复
+
+### 问题描述
+iPhone Safari 浏览器底部和顶部显示白色区域，安卓正常。原因是 iOS Safari 有安全区域 (Safe Area)，页面背景没有延伸到该区域。
+
+### 根本原因
+1. 缺少 `viewport-fit=cover` 配置
+2. `min-h-screen` (100vh) 在 iOS Safari 中不包含安全区域
+3. 背景渐变在页面 div 上，而非 body 上，导致安全区域显示纯色
+
+### 解决方案
+
+#### 1. 添加 viewport 配置 (`layout.tsx`)
+```typescript
+export const viewport: Viewport = {
+  width: 'device-width',
+  initialScale: 1,
+  viewportFit: 'cover',    // 允许内容延伸到安全区域
+  themeColor: '#0f172a',   // 顶部状态栏颜色
+};
+```
+
+#### 2. 统一渐变背景到 body (`layout.tsx`)
+```tsx
+<html lang="en" style={{ backgroundColor: '#0f172a' }}>
+  <body
+    style={{
+      margin: 0,
+      minHeight: '100dvh',
+      background: 'linear-gradient(to bottom, #0f172a 0%, #0f172a 5%, #581c87 50%, #0f172a 95%, #0f172a 100%)',
+    }}
+  >
+    {children}
+  </body>
+</html>
+```
+
+#### 3. CSS 安全区域支持 (`globals.css`)
+```css
+html {
+  background-color: #0f172a !important;
+  min-height: 100%;
+}
+
+body {
+  margin: 0 !important;
+  min-height: 100dvh;
+  padding-top: env(safe-area-inset-top);
+  padding-bottom: env(safe-area-inset-bottom);
+}
+```
+
+#### 4. 移除页面独立渐变背景
+各页面的根 div 移除 `bg-gradient-to-br` 类，统一使用 body 渐变：
+- `page.tsx`
+- `login/page.tsx`
+- `publish/page.tsx`
+- `admin/page.tsx`
+- `register/page.tsx`
+
+### 结果
+- ✅ 顶部状态栏颜色与页面一致 (themeColor)
+- ✅ 底部安全区域颜色与渐变边缘一致
+- ✅ 消除分层感，背景统一
+
+---
+
+## 📱 移动端 Header 响应式优化
+
+### 问题描述
+移动端顶部导航按钮（视频生成、发布管理、退出）过于拥挤，文字换行显示。
+
+### 解决方案
+
+#### 首页 Header (`page.tsx`)
+```tsx
+<header className="border-b border-white/10 bg-black/20 backdrop-blur-sm">
+  <div className="max-w-6xl mx-auto px-4 sm:px-6 py-3 sm:py-4 flex items-center justify-between">
+    <Link href="/" className="text-xl sm:text-2xl font-bold ...">
+      <span className="text-3xl sm:text-4xl">🎬</span>
+      ViGent
+    </Link>
+    <div className="flex items-center gap-1 sm:gap-4">
+      <span className="px-2 sm:px-4 py-1 sm:py-2 text-sm sm:text-base ...">
+        视频生成
+      </span>
+      <!-- 其他按钮同样处理 -->
+    </div>
+  </div>
+</header>
+```
+
+#### 发布管理页 Header (`publish/page.tsx`)
+同步应用相同的响应式类名。
+
+### 关键改动
+| 属性 | 移动端 | 桌面端 |
+|------|--------|--------|
+| 容器内边距 | `px-4 py-3` | `px-6 py-4` |
+| 按钮间距 | `gap-1` | `gap-4` |
+| 按钮内边距 | `px-2 py-1` | `px-4 py-2` |
+| 字体大小 | `text-sm` | `text-base` |
+| Logo 大小 | `text-xl` + `text-3xl` | `text-2xl` + `text-4xl` |
+
+### 结果
+- ✅ 移动端按钮紧凑排列，不再换行
+- ✅ 桌面端保持原有宽松布局
+
+---
+
+## 🚀 发布页面 UI 重构
+
+### 问题描述
+原有设计将"发布时间"选项放在表单中，用户可能误选"定时发布"但忘记设置时间。
+
+### 解决方案
+将"一键发布"按钮改为两个独立按钮：
+- **立即发布** (绿色，占 3/4 宽度) - 主要操作
+- **定时** (占 1/4 宽度) - 点击展开时间选择器
+
+#### 新布局 (`publish/page.tsx`)
+```tsx
+{/* 发布按钮区域 */}
+<div className="space-y-3">
+  <div className="flex gap-3">
+    {/* 立即发布 - 占 3/4 */}
+    <button
+      onClick={() => { setScheduleMode("now"); handlePublish(); }}
+      className="flex-[3] py-4 rounded-xl font-bold text-lg bg-gradient-to-r from-green-600 to-teal-600 ..."
+    >
+      🚀 立即发布
+    </button>
+
+    {/* 定时发布 - 占 1/4 */}
+    <button
+      onClick={() => setScheduleMode(scheduleMode === "scheduled" ? "now" : "scheduled")}
+      className="flex-1 py-4 rounded-xl font-bold text-base ..."
+    >
+      ⏰ 定时
+    </button>
+  </div>
+
+  {/* 定时发布时间选择器 (展开时显示) */}
+  {scheduleMode === "scheduled" && (
+    <div className="flex gap-3 items-center">
+      <input type="datetime-local" ... />
+      <button>确认定时</button>
+    </div>
+  )}
+</div>
+```
+
+### 结果
+- ✅ 主操作（立即发布）更醒目
+- ✅ 定时发布需要二次确认，防止误触
+- ✅ 从表单区域移除发布时间选项，界面更简洁
+
+---
+
+## 🛤️ 后续优化项
+
+### 后端定时发布 (待实现)
+**当前状态**：定时发布使用平台端定时（在各平台设置发布时间）
+
+**优化方向**：改为后端定时任务
+- 使用 APScheduler 实现任务调度
+- 存储定时任务到数据库
+- 到时间后端自动触发发布 API
+- 支持查看/取消定时任务
+
+**优势**：
+- 统一逻辑，不依赖平台定时 UI
+- 更灵活，可管理定时任务
+- 平台页面更新不影响功能
+
+---
+
+## 🤖 Qwen3-TTS 0.6B 声音克隆部署
+
+### 背景
+为实现用户自定义声音克隆功能，部署 Qwen3-TTS 0.6B-Base 模型，支持 3 秒参考音频快速克隆。
+
+### GPU 分配
+| GPU | 服务 | 模型 |
+|-----|------|------|
+| GPU0 | Qwen3-TTS | 0.6B-Base (声音克隆) |
+| GPU1 | LatentSync | 1.6 (唇形同步) |
+
+### 部署步骤
+
+#### 1. 克隆仓库
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/models
+git clone https://github.com/QwenLM/Qwen3-TTS.git
+```
+
+#### 2. 创建 conda 环境
+```bash
+conda create -n qwen-tts python=3.10 -y
+conda activate qwen-tts
+```
+
+#### 3. 安装依赖
+```bash
+cd Qwen3-TTS
+pip install -e .
+conda install -y -c conda-forge sox  # 音频处理依赖
+```
+
+#### 4. 下载模型权重 (使用 ModelScope，国内更快)
+```bash
+pip install modelscope
+# Tokenizer (651MB)
+modelscope download --model Qwen/Qwen3-TTS-Tokenizer-12Hz --local_dir ./checkpoints/Tokenizer
+# 0.6B-Base 模型 (2.4GB)
+modelscope download --model Qwen/Qwen3-TTS-12Hz-0.6B-Base --local_dir ./checkpoints/0.6B-Base
+```
+
+#### 5. 测试推理
+```python
+# test_inference.py
+import torch
+import soundfile as sf
+from qwen_tts import Qwen3TTSModel
+
+model = Qwen3TTSModel.from_pretrained(
+    "./checkpoints/0.6B-Base",
+    device_map="cuda:0",
+    dtype=torch.bfloat16,
+)
+
+wavs, sr = model.generate_voice_clone(
+    text="测试文本",
+    language="Chinese",
+    ref_audio="./examples/myvoice.wav",
+    ref_text="参考音频的文字内容",
+)
+sf.write("output.wav", wavs[0], sr)
+```
+
+### 测试结果
+- ✅ 模型加载成功 (GPU0)
+- ✅ 声音克隆推理成功
+- ✅ 输出音频 24000Hz，质量良好
+
+### 目录结构
+```
+models/Qwen3-TTS/
+├── checkpoints/
+│   ├── Tokenizer/     # 651MB
+│   └── 0.6B-Base/     # 2.4GB
+├── qwen_tts/          # 源码
+├── examples/
+│   └── myvoice.wav    # 参考音频
+└── test_inference.py  # 测试脚本
+```
+
+---
+
+## 📁 今日修改文件清单
+
+| 文件 | 变更类型 | 说明 |
+|------|----------|------|
+| `frontend/src/shared/api/axios.ts` | 修改 | Axios 全局拦截器 (401/403 自动跳转) |
+| `frontend/src/app/layout.tsx` | 修改 | viewport 配置 + body 渐变背景 |
+| `frontend/src/app/globals.css` | 修改 | 安全区域 CSS 支持 |
+| `frontend/src/app/page.tsx` | 修改 | 移除独立渐变 + Header 响应式 |
+| `frontend/src/app/login/page.tsx` | 修改 | 移除独立渐变 |
+| `frontend/src/app/publish/page.tsx` | 修改 | Header 响应式 + 发布按钮重构 |
+| `frontend/src/app/admin/page.tsx` | 修改 | 移除独立渐变 |
+| `frontend/src/app/register/page.tsx` | 修改 | 移除独立渐变 |
+| `README.md` | 修改 | 添加 "iOS/Android 移动端适配" 功能说明 |
+| `Docs/FRONTEND_DEV.md` | 修改 | iOS Safari 安全区域兼容规范 + 移动端响应式规则 |
+| `models/Qwen3-TTS/` | 新增 | Qwen3-TTS 声音克隆模型部署 |
+| `Docs/QWEN3_TTS_DEPLOY.md` | 新增 | Qwen3-TTS 部署指南 |
+
+---
+
+## 🔗 相关文档
+
+- [task_complete.md](../task_complete.md) - 任务总览
+- [Day11.md](./Day11.md) - 上传架构重构
+- [QWEN3_TTS_DEPLOY.md](../QWEN3_TTS_DEPLOY.md) - Qwen3-TTS 部署指南
--- a/Docs/DevLogs/Day13.md
+++ b/Docs/DevLogs/Day13.md
@@ -0,0 +1,431 @@
+# Day 13 - 声音克隆功能集成 + 字幕功能
+
+**日期**：2026-01-29
+
+---
+
+## 🎙️ Qwen3-TTS 服务集成
+
+### 背景
+在 Day 12 完成 Qwen3-TTS 模型部署后，今日重点是将其集成到 ViGent2 系统中，提供完整的声音克隆功能。
+
+### 架构设计
+
+```
+┌─────────────────────────────────────────────────────────────┐
+│                    前端 (Next.js)                             │
+│      参考音频上传 → TTS 模式选择 → 视频生成请求               │
+└─────────────────────────────────────────────────────────────┘
+                            │
+                            ▼
+┌─────────────────────────────────────────────────────────────┐
+│                   后端 (FastAPI :8006)                        │
+│  ref-audios API → voice_clone_service → video_service        │
+└─────────────────────────────────────────────────────────────┘
+                            │
+                            ▼
+┌─────────────────────────────────────────────────────────────┐
+│               Qwen3-TTS 服务 (FastAPI :8009)                  │
+│            HTTP /generate → 返回克隆音频                      │
+└─────────────────────────────────────────────────────────────┘
+```
+
+### Qwen3-TTS HTTP 服务 (`qwen_tts_server.py`)
+
+创建独立的 FastAPI 服务，运行在 8009 端口：
+
+```python
+from fastapi import FastAPI, UploadFile, Form, HTTPException
+from fastapi.responses import Response
+import torch
+import soundfile as sf
+from qwen_tts import Qwen3TTSModel
+import io, os
+
+app = FastAPI(title="Qwen3-TTS Voice Clone Service")
+
+# GPU 配置
+GPU_ID = os.getenv("QWEN_TTS_GPU_ID", "0")
+model = None
+
+@app.on_event("startup")
+async def load_model():
+    global model
+    model = Qwen3TTSModel.from_pretrained(
+        "./checkpoints/0.6B-Base",
+        device_map=f"cuda:{GPU_ID}",
+        dtype=torch.bfloat16,
+    )
+
+@app.get("/health")
+async def health():
+    return {"service": "Qwen3-TTS", "ready": model is not None, "gpu_id": GPU_ID}
+
+@app.post("/generate")
+async def generate(
+    ref_audio: UploadFile,
+    text: str = Form(...),
+    ref_text: str = Form(""),
+    language: str = Form("Chinese"),
+):
+    # 保存临时参考音频
+    ref_path = f"/tmp/ref_{ref_audio.filename}"
+    with open(ref_path, "wb") as f:
+        f.write(await ref_audio.read())
+    
+    # 生成克隆音频
+    wavs, sr = model.generate_voice_clone(
+        text=text,
+        language=language,
+        ref_audio=ref_path,
+        ref_text=ref_text or "一段参考音频。",
+    )
+    
+    # 返回 WAV 音频
+    buffer = io.BytesIO()
+    sf.write(buffer, wavs[0], sr, format="WAV")
+    buffer.seek(0)
+    return Response(content=buffer.read(), media_type="audio/wav")
+```
+
+### 后端声音克隆服务 (`voice_clone_service.py`)
+
+通过 HTTP 调用 Qwen3-TTS 服务：
+
+```python
+import aiohttp
+from loguru import logger
+
+QWEN_TTS_URL = "http://localhost:8009"
+
+async def generate_cloned_audio(
+    ref_audio_path: str,
+    text: str,
+    output_path: str,
+    ref_text: str = "",
+) -> str:
+    """调用 Qwen3-TTS 服务生成克隆音频"""
+    
+    async with aiohttp.ClientSession() as session:
+        with open(ref_audio_path, "rb") as f:
+            data = aiohttp.FormData()
+            data.add_field("ref_audio", f, filename="ref.wav")
+            data.add_field("text", text)
+            data.add_field("ref_text", ref_text)
+            
+            async with session.post(f"{QWEN_TTS_URL}/generate", data=data) as resp:
+                if resp.status != 200:
+                    raise Exception(f"Qwen3-TTS error: {resp.status}")
+                    
+                audio_data = await resp.read()
+                with open(output_path, "wb") as out:
+                    out.write(audio_data)
+    
+    return output_path
+```
+
+---
+
+## 📂 参考音频管理 API
+
+### 新增 API 端点 (`ref_audios.py`)
+
+| 端点 | 方法 | 功能 |
+|------|------|------|
+| `/api/ref-audios` | GET | 获取参考音频列表 |
+| `/api/ref-audios` | POST | 上传参考音频 |
+| `/api/ref-audios/{id}` | DELETE | 删除参考音频 |
+
+### Supabase Bucket 配置
+
+为参考音频创建独立存储桶：
+
+```sql
+-- 创建 ref-audios bucket
+INSERT INTO storage.buckets (id, name, public)
+VALUES ('ref-audios', 'ref-audios', true)
+ON CONFLICT (id) DO NOTHING;
+
+-- RLS 策略
+CREATE POLICY "Allow public uploads" ON storage.objects
+FOR INSERT TO anon WITH CHECK (bucket_id = 'ref-audios');
+
+CREATE POLICY "Allow public read" ON storage.objects
+FOR SELECT TO anon USING (bucket_id = 'ref-audios');
+
+CREATE POLICY "Allow public delete" ON storage.objects
+FOR DELETE TO anon USING (bucket_id = 'ref-audios');
+```
+
+---
+
+## 🎨 前端声音克隆 UI
+
+### TTS 模式选择
+
+在视频生成页面新增声音克隆选项：
+
+```tsx
+{/* TTS 模式选择 */}
+<div className="flex gap-2 mb-4">
+  <button
+    onClick={() => setTtsMode("edge")}
+    className={`px-4 py-2 rounded-lg ${ttsMode === "edge" ? "bg-purple-600" : "bg-white/10"}`}
+  >
+    🔊 EdgeTTS
+  </button>
+  <button
+    onClick={() => setTtsMode("clone")}
+    className={`px-4 py-2 rounded-lg ${ttsMode === "clone" ? "bg-purple-600" : "bg-white/10"}`}
+  >
+    🎙️ 声音克隆
+  </button>
+</div>
+```
+
+### 参考音频管理
+
+新增参考音频上传和列表展示功能：
+
+| 功能 | 实现 |
+|------|------|
+| 音频上传 | 拖拽上传 WAV/MP3，直传 Supabase |
+| 列表展示 | 显示文件名、时长、上传时间 |
+| 快速选择 | 点击即选中作为参考音频 |
+| 删除功能 | 删除不需要的参考音频 |
+
+---
+
+## ✅ 端到端测试验证
+
+### 测试流程
+1. **上传参考音频**: 3 秒参考音频 → Supabase ref-audios bucket
+2. **选择声音克隆模式**: TTS 模式切换为 "声音克隆"
+3. **输入文案**: 测试口播文案
+4. **生成视频**: 
+   - TTS 阶段调用 Qwen3-TTS (17.7s)
+   - LipSync 阶段调用 LatentSync (122.8s)
+5. **播放验证**: 视频声音与参考音色一致
+
+### 测试结果
+- ✅ 参考音频上传成功
+- ✅ Qwen3-TTS 生成克隆音频 (15s 推理，4.6s 音频)
+- ✅ LatentSync 唇形同步正常
+- ✅ 总生成时间 143.1s
+- ✅ 前端视频播放正常
+
+---
+
+## 🔧 PM2 服务配置
+
+### 新增 Qwen3-TTS 服务
+
+**前置依赖安装**：
+```bash
+conda activate qwen-tts
+pip install fastapi uvicorn python-multipart
+```
+
+启动脚本 `run_qwen_tts.sh` (位于项目**根目录**)：
+```bash
+#!/bin/bash
+cd /home/rongye/ProgramFiles/ViGent2/models/Qwen3-TTS
+/home/rongye/ProgramFiles/miniconda3/envs/qwen-tts/bin/python qwen_tts_server.py
+```
+
+PM2 管理命令：
+```bash
+# 进入根目录启动
+cd /home/rongye/ProgramFiles/ViGent2
+pm2 start ./run_qwen_tts.sh --name vigent2-qwen-tts
+pm2 save
+
+# 查看状态
+pm2 status
+
+# 查看日志
+pm2 logs vigent2-qwen-tts --lines 50
+```
+
+### 完整服务列表
+
+| 服务名 | 端口 | 功能 |
+|--------|------|------|
+| vigent2-backend | 8006 | FastAPI 后端 |
+| vigent2-frontend | 3002 | Next.js 前端 |
+| vigent2-latentsync | 8007 | LatentSync 唇形同步 |
+| vigent2-qwen-tts | 8009 | Qwen3-TTS 声音克隆 |
+
+---
+
+## 📁 今日修改文件清单
+
+| 文件 | 变更类型 | 说明 |
+|------|----------|------|
+| `models/Qwen3-TTS/qwen_tts_server.py` | 新增 | Qwen3-TTS HTTP 推理服务 |
+| `run_qwen_tts.sh` | 新增 | PM2 启动脚本 (根目录) |
+| `backend/app/services/voice_clone_service.py` | 新增 | 声音克隆服务 (HTTP 调用) |
+| `backend/app/api/ref_audios.py` | 新增 | 参考音频管理 API |
+| `backend/app/main.py` | 修改 | 注册 ref-audios 路由 |
+| `frontend/src/app/page.tsx` | 修改 | TTS 模式选择 + 参考音频 UI |
+
+---
+
+## 🔗 相关文档
+
+- [task_complete.md](../task_complete.md) - 任务总览
+- [Day12.md](./Day12.md) - iOS 兼容与 Qwen3-TTS 部署
+- [QWEN3_TTS_DEPLOY.md](../QWEN3_TTS_DEPLOY.md) - Qwen3-TTS 部署指南
+- [SUBTITLE_DEPLOY.md](../SUBTITLE_DEPLOY.md) - 字幕功能部署指南
+- [DEPLOY_MANUAL.md](../DEPLOY_MANUAL.md) - 完整部署手册
+
+---
+
+## 🎬 逐字高亮字幕 + 片头标题功能
+
+### 背景
+
+为提升视频质量，新增逐字高亮字幕（卡拉OK效果）和片头标题功能。
+
+### 技术方案
+
+| 组件 | 技术 | 说明 |
+|------|------|------|
+| 字幕对齐 | **faster-whisper** | 生成字级别时间戳 |
+| 视频渲染 | **Remotion** | React 视频合成框架 |
+
+### 架构设计
+
+```
+原有流程:
+  文本 → EdgeTTS → 音频 → LatentSync → FFmpeg合成 → 最终视频
+
+新流程:
+  文本 → EdgeTTS → 音频 ─┬→ LatentSync → 唇形视频 ─┐
+                        └→ faster-whisper → 字幕JSON ─┴→ Remotion合成 → 最终视频
+```
+
+### 后端新增服务
+
+#### 1. 字幕服务 (`whisper_service.py`)
+
+基于 faster-whisper 生成字级别时间戳：
+
+```python
+from faster_whisper import WhisperModel
+
+class WhisperService:
+    def __init__(self, model_size="large-v3", device="cuda"):
+        self.model = WhisperModel(model_size, device=device)
+
+    async def align(self, audio_path: str, text: str, output_path: str):
+        segments, info = self.model.transcribe(audio_path, word_timestamps=True)
+        # 将词拆分成单字，时间戳线性插值
+        result = {"segments": [...]}
+        # 保存到 JSON
+```
+
+**字幕拆字算法**：faster-whisper 对中文返回词级别，系统自动拆分成单字并线性插值：
+
+```python
+# 输入: {"word": "大家好", "start": 0.0, "end": 0.9}
+# 输出:
+[
+  {"word": "大", "start": 0.0, "end": 0.3},
+  {"word": "家", "start": 0.3, "end": 0.6},
+  {"word": "好", "start": 0.6, "end": 0.9}
+]
+```
+
+#### 2. Remotion 渲染服务 (`remotion_service.py`)
+
+调用 Remotion 渲染字幕和标题：
+
+```python
+class RemotionService:
+    async def render(self, video_path, output_path, captions_path, title, ...):
+        cmd = f"npx ts-node render.ts --video {video_path} --output {output_path} ..."
+        # 执行渲染
+```
+
+### Remotion 项目结构
+
+```
+remotion/
+├── package.json              # Node.js 依赖
+├── render.ts                 # 服务端渲染脚本
+└── src/
+    ├── Video.tsx             # 主视频组件
+    ├── components/
+    │   ├── Title.tsx         # 片头标题（淡入淡出）
+    │   ├── Subtitles.tsx     # 逐字高亮字幕
+    │   └── VideoLayer.tsx    # 视频图层
+    └── utils/
+        └── captions.ts       # 字幕数据类型
+```
+
+### 前端 UI
+
+新增标题和字幕设置区块：
+
+| 功能 | 说明 |
+|------|------|
+| 片头标题输入 | 可选，在视频开头显示 3 秒 |
+| 字幕开关 | 默认开启，可关闭 |
+
+### 遇到的问题与修复
+
+#### 问题 1: `fs` 模块错误
+
+**现象**：Remotion 打包失败，提示 `fs.js doesn't exist`
+
+**原因**：`captions.ts` 中有 `loadCaptions` 函数使用了 Node.js 的 `fs` 模块
+
+**修复**：删除未使用的 `loadCaptions` 函数
+
+#### 问题 2: 视频文件读取失败
+
+**现象**：`file://` 协议无法读取本地视频
+
+**修复**：
+1. `render.ts` 使用 `publicDir` 指向视频目录
+2. `VideoLayer.tsx` 使用 `staticFile()` 加载视频
+
+```typescript
+// render.ts
+const publicDir = path.dirname(path.resolve(options.videoPath));
+const bundleLocation = await bundle({
+  entryPoint: path.resolve(__dirname, './src/index.ts'),
+  publicDir,  // 关键配置
+});
+
+// VideoLayer.tsx
+const videoUrl = staticFile(videoSrc);
+```
+
+### 测试结果
+
+- ✅ faster-whisper 字幕对齐成功（~1秒）
+- ✅ Remotion 渲染成功（~10秒）
+- ✅ 字幕逐字高亮效果正常
+- ✅ 片头标题淡入淡出正常
+- ✅ 降级机制正常（Remotion 失败时回退到 FFmpeg）
+
+---
+
+## 📁 今日修改文件清单（完整）
+
+| 文件 | 变更类型 | 说明 |
+|------|----------|------|
+| `models/Qwen3-TTS/qwen_tts_server.py` | 新增 | Qwen3-TTS HTTP 推理服务 |
+| `run_qwen_tts.sh` | 新增 | PM2 启动脚本 (根目录) |
+| `backend/app/services/voice_clone_service.py` | 新增 | 声音克隆服务 (HTTP 调用) |
+| `backend/app/services/whisper_service.py` | 新增 | 字幕对齐服务 (faster-whisper) |
+| `backend/app/services/remotion_service.py` | 新增 | Remotion 渲染服务 |
+| `backend/app/api/ref_audios.py` | 新增 | 参考音频管理 API |
+| `backend/app/api/videos.py` | 修改 | 集成字幕和标题功能 |
+| `backend/app/main.py` | 修改 | 注册 ref-audios 路由 |
+| `backend/requirements.txt` | 修改 | 添加 faster-whisper 依赖 |
+| `remotion/` | 新增 | Remotion 视频渲染项目 |
+| `frontend/src/app/page.tsx` | 修改 | TTS 模式选择 + 标题字幕 UI |
+| `Docs/SUBTITLE_DEPLOY.md` | 新增 | 字幕功能部署文档 |
--- a/Docs/DevLogs/Day14.md
+++ b/Docs/DevLogs/Day14.md
@@ -0,0 +1,402 @@
+# Day 14 - 模型升级 + 标题标签生成 + 前端修复
+
+**日期**：2026-01-30
+
+---
+
+## 🚀 Qwen3-TTS 模型升级 (0.6B → 1.7B)
+
+### 背景
+
+为提升声音克隆质量，将 Qwen3-TTS 模型从 0.6B-Base 升级到 1.7B-Base。
+
+### 变更内容
+
+| 项目 | 升级前 | 升级后 |
+|------|--------|--------|
+| 模型 | 0.6B-Base | **1.7B-Base** |
+| 大小 | 2.4GB | 6.8GB |
+| 质量 | 基础 | 更高质量 |
+
+### 代码修改
+
+**文件**: `models/Qwen3-TTS/qwen_tts_server.py`
+
+```python
+# 升级前
+MODEL_PATH = Path(__file__).parent / "checkpoints" / "0.6B-Base"
+
+# 升级后
+MODEL_PATH = Path(__file__).parent / "checkpoints" / "1.7B-Base"
+```
+
+### 模型下载
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/models/Qwen3-TTS
+
+# 下载 1.7B-Base 模型 (6.8GB)
+modelscope download --model Qwen/Qwen3-TTS-12Hz-1.7B-Base --local_dir ./checkpoints/1.7B-Base
+```
+
+### 结果
+
+- ✅ 模型加载正常 (GPU0, bfloat16)
+- ✅ 声音克隆质量提升
+- ✅ 推理速度可接受
+
+---
+
+## 🎨 标题和字幕显示优化
+
+### 字幕组件优化 (`Subtitles.tsx`)
+
+**文件**: `remotion/src/components/Subtitles.tsx`
+
+优化内容：
+- 调整高亮颜色配置
+- 优化文字描边效果（多层阴影）
+- 调整字间距和行高
+
+```typescript
+export const Subtitles: React.FC<SubtitlesProps> = ({
+  captions,
+  highlightColor = '#FFFF00',  // 高亮颜色
+  normalColor = '#FFFFFF',      // 普通文字颜色
+  fontSize = 52,
+}) => {
+  // 样式优化
+  const style = {
+    textShadow: `
+      2px 2px 4px rgba(0,0,0,0.8),
+      -2px -2px 4px rgba(0,0,0,0.8),
+      ...
+    `,
+    letterSpacing: '2px',
+    lineHeight: 1.4,
+    maxWidth: '90%',
+  };
+};
+```
+
+### 标题组件优化 (`Title.tsx`)
+
+**文件**: `remotion/src/components/Title.tsx`
+
+优化内容：
+- 淡入淡出动画效果
+- 下滑入场动画
+- 可配置显示时长
+
+```typescript
+interface TitleProps {
+  title: string;
+  duration?: number;        // 标题显示时长（秒，默认3秒）
+  fadeOutStart?: number;    // 开始淡出的时间（秒，默认2秒）
+}
+
+// 动画效果
+// 淡入：0-0.5 秒
+// 淡出：2-3 秒
+// 下滑：0-0.5 秒，-20px → 0px
+```
+
+### 结果
+
+- ✅ 字幕显示更清晰
+- ✅ 标题动画更流畅
+
+---
+
+## 🤖 标题标签自动生成功能
+
+### 功能描述
+
+使用 AI（智谱 GLM-4-Flash）根据口播文案自动生成视频标题和标签。
+
+### 后端实现
+
+#### 1. GLM 服务 (`glm_service.py`)
+
+**文件**: `backend/app/services/glm_service.py`
+
+```python
+class GLMService:
+    """智谱 GLM AI 服务"""
+
+    async def generate_meta(self, text: str) -> dict:
+        """根据文案生成标题和标签"""
+
+        prompt = """根据以下口播文案，生成一个吸引人的短视频标题和3个相关标签。
+
+要求：
+1. 标题要简洁有力，能吸引观众点击，不超过10个字
+2. 标签要与内容相关，便于搜索和推荐，只要3个
+
+返回格式：{"title": "标题", "tags": ["标签1", "标签2", "标签3"]}
+"""
+        # 调用 GLM-4-Flash API
+        response = await self._call_api(prompt + text)
+        return self._parse_json(response)
+```
+
+**JSON 解析容错**：
+- 支持直接 JSON 解析
+- 支持提取 JSON 块
+- 支持 ```json 代码块提取
+
+#### 2. API 端点 (`ai.py`)
+
+**文件**: `backend/app/api/ai.py`
+
+```python
+from pydantic import BaseModel
+
+class GenerateMetaRequest(BaseModel):
+    text: str  # 口播文案
+
+class GenerateMetaResponse(BaseModel):
+    title: str        # 生成的标题
+    tags: list[str]   # 生成的标签列表
+
+@router.post("/generate-meta", response_model=GenerateMetaResponse)
+async def generate_meta(request: GenerateMetaRequest):
+    """AI 生成标题和标签"""
+    result = await glm_service.generate_meta(request.text)
+    return result
+```
+
+### 前端实现
+
+**文件**: `frontend/src/app/page.tsx`
+
+#### UI 按钮
+
+```tsx
+<button
+  onClick={handleGenerateMeta}
+  disabled={isGeneratingMeta || !text.trim()}
+  className="px-2 py-1 text-xs rounded transition-all whitespace-nowrap"
+>
+  {isGeneratingMeta ? "⏳ 生成中..." : "🤖 AI生成标题标签"}
+</button>
+```
+
+#### 处理逻辑
+
+```typescript
+const handleGenerateMeta = async () => {
+  if (!text.trim()) {
+    alert("请先输入口播文案");
+    return;
+  }
+
+  setIsGeneratingMeta(true);
+  try {
+    const { data } = await api.post('/api/ai/generate-meta', { text: text.trim() });
+
+    // 更新首页标题
+    setVideoTitle(data.title || "");
+
+    // 同步到发布页 localStorage
+    localStorage.setItem(`vigent_${storageKey}_publish_title`, data.title || "");
+    localStorage.setItem(`vigent_${storageKey}_publish_tags`, JSON.stringify(data.tags || []));
+  } catch (err: any) {
+    alert(`AI 生成失败: ${err.message}`);
+  } finally {
+    setIsGeneratingMeta(false);
+  }
+};
+```
+
+### 发布页集成
+
+**文件**: `frontend/src/app/publish/page.tsx`
+
+从 localStorage 恢复 AI 生成的标题和标签：
+
+```typescript
+// 恢复标题和标签
+const savedTitle = localStorage.getItem(`vigent_${storageKey}_publish_title`);
+const savedTags = localStorage.getItem(`vigent_${storageKey}_publish_tags`);
+
+if (savedTags) {
+  try {
+    const parsed = JSON.parse(savedTags);
+    if (Array.isArray(parsed)) {
+      setTags(parsed.join(', '));  // 数组转逗号分隔字符串
+    } else {
+      setTags(savedTags);
+    }
+  } catch {
+    setTags(savedTags);
+  }
+}
+```
+
+### 结果
+
+- ✅ AI 生成标题和标签功能正常
+- ✅ 数据自动同步到发布页
+- ✅ 支持 JSON 数组和字符串格式兼容
+
+---
+
+## 🐛 前端文本保存问题修复
+
+### 问题描述
+
+**现象**：页面刷新后，用户输入的文案、标题等数据丢失
+
+**原因**：
+1. 认证状态恢复失败时，`userId` 为 `null`
+2. 原代码判断 `!userId` 后用默认值覆盖 localStorage 数据
+3. 导致已保存的用户数据被清空
+
+### 解决方案
+
+**文件**: `frontend/src/app/page.tsx`
+
+#### 1. 添加恢复完成标志
+
+```typescript
+const [isRestored, setIsRestored] = useState(false);
+```
+
+#### 2. 等待认证完成后恢复数据
+
+```typescript
+useEffect(() => {
+  if (isAuthLoading) return;  // 等待认证完成
+
+  // 使用 userId 或 'guest' 作为 key
+  const key = userId || 'guest';
+
+  // 从 localStorage 恢复数据
+  const savedText = localStorage.getItem(`vigent_${key}_text`);
+  if (savedText) setText(savedText);
+
+  // ... 恢复其他数据
+
+  setIsRestored(true);  // 标记恢复完成
+}, [userId, isAuthLoading]);
+```
+
+#### 3. 恢复完成后才保存
+
+```typescript
+useEffect(() => {
+  if (isRestored) {
+    localStorage.setItem(`vigent_${storageKey}_text`, text);
+  }
+}, [text, storageKey, isRestored]);
+```
+
+### 用户隔离机制
+
+```typescript
+const storageKey = userId || 'guest';
+```
+
+| 用户状态 | storageKey | 说明 |
+|----------|------------|------|
+| 已登录 | `user_xxx` | 数据按用户隔离 |
+| 未登录/认证失败 | `guest` | 使用统一 key |
+
+### 数据恢复流程
+
+```
+1. 页面加载
+   ↓
+2. 检查 isAuthLoading
+   ├─ true: 等待认证完成
+   └─ false: 继续
+   ↓
+3. 确定 storageKey (userId || 'guest')
+   ↓
+4. 从 localStorage 读取数据
+   ├─ 有保存数据: 恢复到状态
+   └─ 无保存数据: 使用默认值
+   ↓
+5. 设置 isRestored = true
+   ↓
+6. 后续状态变化时保存到 localStorage
+```
+
+### 保存的数据项
+
+| Key | 说明 |
+|-----|------|
+| `vigent_${key}_text` | 口播文案 |
+| `vigent_${key}_title` | 视频标题 |
+| `vigent_${key}_subtitles` | 字幕开关 |
+| `vigent_${key}_ttsMode` | TTS 模式 |
+| `vigent_${key}_voice` | 选择的音色 |
+| `vigent_${key}_material` | 选择的素材 |
+| `vigent_${key}_publish_title` | 发布标题 |
+| `vigent_${key}_publish_tags` | 发布标签 |
+
+### 结果
+
+- ✅ 页面刷新后数据正常恢复
+- ✅ 认证失败时不会覆盖已保存数据
+- ✅ 多用户数据隔离正常
+
+---
+
+## 🐛 登录页刷新循环修复
+
+### 问题描述
+
+**现象**：登录页未登录时不断刷新，无法停留在表单页面。
+
+**原因**：
+1. `AuthProvider` 初始化时调用 `/api/auth/me`
+2. 未登录返回 401
+3. `axios` 全局拦截器遇到 401/403 重定向 `/login`
+4. 登录页本身也在 Provider 中，导致循环刷新
+
+### 解决方案
+
+**文件**: `frontend/src/shared/api/axios.ts`
+
+在拦截器中对公开路由跳过重定向，仅在受保护页面触发登录跳转：
+
+```typescript
+const PUBLIC_PATHS = new Set(['/login', '/register']);
+const isPublicPath = typeof window !== 'undefined' && PUBLIC_PATHS.has(window.location.pathname);
+
+if ((status === 401 || status === 403) && !isRedirecting && !isPublicPath) {
+  // ... 保持原有重定向逻辑
+}
+```
+
+### 结果
+
+- ✅ 登录页不再刷新，表单可正常输入
+- ✅ 受保护页面仍会在 401/403 时跳转登录页
+
+---
+
+## 📁 今日修改文件清单
+
+| 文件 | 变更类型 | 说明 |
+|------|----------|------|
+| `models/Qwen3-TTS/qwen_tts_server.py` | 修改 | 模型路径升级到 1.7B-Base |
+| `Docs/QWEN3_TTS_DEPLOY.md` | 修改 | 更新部署文档为 1.7B 版本 |
+| `remotion/src/components/Subtitles.tsx` | 修改 | 优化字幕显示效果 |
+| `remotion/src/components/Title.tsx` | 修改 | 优化标题动画效果 |
+| `backend/app/services/glm_service.py` | 新增 | GLM AI 服务 |
+| `backend/app/api/ai.py` | 新增 | AI 生成标题标签 API |
+| `backend/app/main.py` | 修改 | 注册 ai 路由 |
+| `frontend/src/app/page.tsx` | 修改 | AI 生成按钮 + localStorage 修复 |
+| `frontend/src/app/publish/page.tsx` | 修改 | 恢复 AI 生成的标签 |
+| `frontend/src/shared/api/axios.ts` | 修改 | 公开路由跳过 401/403 登录重定向 |
+
+---
+
+## 🔗 相关文档
+
+- [task_complete.md](../task_complete.md) - 任务总览
+- [Day13.md](./Day13.md) - 声音克隆功能集成 + 字幕功能
+- [QWEN3_TTS_DEPLOY.md](../QWEN3_TTS_DEPLOY.md) - Qwen3-TTS 1.7B 部署指南
--- a/Docs/DevLogs/Day15.md
+++ b/Docs/DevLogs/Day15.md
@@ -0,0 +1,410 @@
+# Day 15 - 手机号登录迁移 + 账户设置功能
+
+**日期**：2026-02-02
+
+---
+
+## 🔐 认证系统迁移：邮箱 → 手机号
+
+### 背景
+
+根据业务需求，将用户认证从邮箱登录迁移到手机号登录（11位中国手机号）。
+
+### 变更范围
+
+| 组件 | 变更内容 |
+|------|----------|
+| 数据库 Schema | `email` 字段替换为 `phone` |
+| 后端 API | 注册/登录/获取用户信息接口使用 `phone` |
+| 前端页面 | 登录/注册页面改为手机号输入框 |
+| 管理员配置 | `ADMIN_EMAIL` 改为 `ADMIN_PHONE` |
+
+---
+
+## 📦 后端修改
+
+### 1. 数据库 Schema (`schema.sql`)
+
+**文件**: `backend/database/schema.sql`
+
+```sql
+CREATE TABLE users (
+    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
+    phone TEXT UNIQUE NOT NULL,  -- 原 email 改为 phone
+    password_hash TEXT NOT NULL,
+    username TEXT,
+    role TEXT DEFAULT 'pending' CHECK (role IN ('pending', 'user', 'admin')),
+    is_active BOOLEAN DEFAULT FALSE,
+    expires_at TIMESTAMP WITH TIME ZONE,
+    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
+    updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
+);
+
+CREATE INDEX idx_users_phone ON users(phone);
+```
+
+### 2. 认证 API (`auth.py`)
+
+**文件**: `backend/app/api/auth.py`
+
+#### 请求模型更新
+
+```python
+class RegisterRequest(BaseModel):
+    phone: str
+    password: str
+    username: Optional[str] = None
+
+    @field_validator('phone')
+    @classmethod
+    def validate_phone(cls, v):
+        if not re.match(r'^\d{11}$', v):
+            raise ValueError('手机号必须是11位数字')
+        return v
+```
+
+#### 新增修改密码接口
+
+```python
+class ChangePasswordRequest(BaseModel):
+    old_password: str
+    new_password: str
+
+    @field_validator('new_password')
+    @classmethod
+    def validate_new_password(cls, v):
+        if len(v) < 6:
+            raise ValueError('新密码长度至少6位')
+        return v
+
+@router.post("/change-password")
+async def change_password(request: ChangePasswordRequest, req: Request, response: Response):
+    """修改密码，验证当前密码后更新"""
+    # 1. 验证当前密码
+    # 2. 更新密码 hash
+    # 3. 重新生成 session token
+    # 4. 返回新的 JWT Cookie
+```
+
+### 3. 配置更新
+
+**文件**: `backend/app/core/config.py`
+
+```python
+# 管理员配置
+ADMIN_PHONE: str = ""  # 原 ADMIN_EMAIL
+ADMIN_PASSWORD: str = ""
+```
+
+**文件**: `backend/.env`
+
+```bash
+ADMIN_PHONE=15549380526
+ADMIN_PASSWORD=lam1988324
+```
+
+### 4. 管理员初始化 (`main.py`)
+
+**文件**: `backend/app/main.py`
+
+```python
+@app.on_event("startup")
+async def init_admin():
+    admin_phone = settings.ADMIN_PHONE  # 原 ADMIN_EMAIL
+    # ... 使用 phone 字段创建管理员
+```
+
+### 5. 管理员 API (`admin.py`)
+
+**文件**: `backend/app/api/admin.py`
+
+```python
+class UserListItem(BaseModel):
+    id: str
+    phone: str  # 原 email
+    username: Optional[str]
+    role: str
+    is_active: bool
+    expires_at: Optional[str]
+    created_at: str
+```
+
+---
+
+## 🖥️ 前端修改
+
+### 1. 登录页面 (`login/page.tsx`)
+
+**文件**: `frontend/src/app/login/page.tsx`
+
+```tsx
+const [phone, setPhone] = useState('');
+
+// 验证手机号格式
+if (!/^\d{11}$/.test(phone)) {
+    setError('请输入正确的11位手机号');
+    return;
+}
+
+<input
+    type="tel"
+    value={phone}
+    onChange={(e) => setPhone(e.target.value.replace(/\D/g, '').slice(0, 11))}
+    maxLength={11}
+    placeholder="请输入11位手机号"
+/>
+```
+
+### 2. 注册页面 (`register/page.tsx`)
+
+同样使用手机号输入，增加 11 位数字验证。
+
+### 3. Auth 工具函数 (`auth.ts`)
+
+**文件**: `frontend/src/shared/lib/auth.ts`
+
+```typescript
+export interface User {
+    id: string;
+    phone: string;  // 原 email
+    username: string | null;
+    role: string;
+    is_active: boolean;
+}
+
+export async function login(phone: string, password: string): Promise<AuthResponse> { ... }
+export async function register(phone: string, password: string, username?: string): Promise<AuthResponse> { ... }
+export async function changePassword(oldPassword: string, newPassword: string): Promise<AuthResponse> { ... }
+```
+
+### 4. 首页账户设置下拉菜单 (`page.tsx`)
+
+**文件**: `frontend/src/app/page.tsx`
+
+将原来的"退出"按钮改为账户设置下拉菜单：
+
+```tsx
+function AccountSettingsDropdown() {
+  const [isOpen, setIsOpen] = useState(false);
+  const [showPasswordModal, setShowPasswordModal] = useState(false);
+  // ...
+
+  return (
+    <div className="relative">
+      <button onClick={() => setIsOpen(!isOpen)}>
+        ⚙️ 账户
+      </button>
+
+      {/* 下拉菜单 */}
+      {isOpen && (
+        <div className="absolute right-0 mt-2 w-40 bg-gray-800 ...">
+          <button onClick={() => setShowPasswordModal(true)}>
+            🔐 修改密码
+          </button>
+          <button onClick={handleLogout} className="text-red-300">
+            🚪 退出登录
+          </button>
+        </div>
+      )}
+
+      {/* 修改密码弹窗 */}
+      {showPasswordModal && (
+        <div className="fixed inset-0 z-50 ...">
+          <form onSubmit={handleChangePassword}>
+            <input placeholder="当前密码" />
+            <input placeholder="新密码" />
+            <input placeholder="确认新密码" />
+          </form>
+        </div>
+      )}
+    </div>
+  );
+}
+```
+
+### 5. 管理员页面 (`admin/page.tsx`)
+
+**文件**: `frontend/src/app/admin/page.tsx`
+
+```tsx
+interface UserListItem {
+    id: string;
+    phone: string;  // 原 email
+    // ...
+}
+
+// 显示手机号而非邮箱
+<div className="text-gray-400 text-sm">{user.phone}</div>
+```
+
+---
+
+## 🗄️ 数据库迁移
+
+### 迁移脚本
+
+**文件**: `backend/database/migrate_to_phone.sql`
+
+```sql
+-- 删除旧表 (CASCADE 处理外键依赖)
+DROP TABLE IF EXISTS user_sessions CASCADE;
+DROP TABLE IF EXISTS social_accounts CASCADE;
+DROP TABLE IF EXISTS users CASCADE;
+
+-- 重新创建使用 phone 字段的表
+CREATE TABLE users (
+    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
+    phone TEXT UNIQUE NOT NULL,
+    -- ...
+);
+
+-- 重新创建依赖表和索引
+CREATE TABLE user_sessions (...);
+CREATE TABLE social_accounts (...);
+CREATE INDEX idx_users_phone ON users(phone);
+```
+
+### 执行方式
+
+```bash
+# 方式一：Docker 命令
+docker exec -i supabase-db psql -U postgres < backend/database/migrate_to_phone.sql
+
+# 方式二：Supabase Studio SQL Editor
+# 打开 https://supabase.hbyrkj.top -> SQL Editor -> 粘贴执行
+```
+
+---
+
+## ✅ 部署步骤
+
+```bash
+# 1. 执行数据库迁移
+docker exec -i supabase-db psql -U postgres < backend/database/migrate_to_phone.sql
+
+# 2. 重新构建前端
+cd frontend && npm run build
+
+# 3. 重启服务
+pm2 restart vigent2-backend vigent2-frontend
+```
+
+---
+
+## 📁 今日修改文件清单
+
+| 文件 | 变更类型 | 说明 |
+|------|----------|------|
+| `backend/database/schema.sql` | 修改 | email → phone |
+| `backend/database/migrate_to_phone.sql` | 新增 | 数据库迁移脚本 |
+| `backend/app/api/auth.py` | 修改 | 手机号验证 + 修改密码 API |
+| `backend/app/api/admin.py` | 修改 | UserListItem.email → phone |
+| `backend/app/core/config.py` | 修改 | ADMIN_EMAIL → ADMIN_PHONE |
+| `backend/app/main.py` | 修改 | 管理员初始化使用 phone |
+| `backend/.env` | 修改 | ADMIN_PHONE=15549380526 |
+| `frontend/src/app/login/page.tsx` | 修改 | 手机号登录 + 11位验证 |
+| `frontend/src/app/register/page.tsx` | 修改 | 手机号注册 + 11位验证 |
+| `frontend/src/shared/lib/auth.ts` | 修改 | phone 参数 + changePassword 函数 |
+| `frontend/src/app/page.tsx` | 修改 | AccountSettingsDropdown 组件 |
+| `frontend/src/app/admin/page.tsx` | 修改 | 用户列表显示手机号 |
+| `frontend/src/contexts/AuthContext.tsx` | 修改 | 存储完整用户信息含 expires_at |
+
+---
+
+## 🆕 后续完善 (Day 15 下午)
+
+### 账户有效期显示
+
+在账户下拉菜单中显示用户的有效期：
+
+| 显示情况 | 格式 |
+|----------|------|
+| 有设置 expires_at | `2026-03-15` |
+| NULL | `永久有效` |
+
+**相关修改**：
+- `backend/app/api/auth.py`: UserResponse 新增 `expires_at` 字段
+- `frontend/src/contexts/AuthContext.tsx`: 存储完整用户对象
+- `frontend/src/app/page.tsx`: 格式化并显示有效期
+
+### 点击外部关闭下拉菜单
+
+使用 `useRef` + `useEffect` 监听全局点击事件，点击菜单外部自动关闭。
+
+### 修改密码后强制重新登录
+
+密码修改成功后：
+1. 显示"密码修改成功，正在跳转登录页..."
+2. 1.5秒后调用登出 API
+3. 跳转到登录页面
+
+---
+
+## 🔗 相关文档
+
+- [task_complete.md](../task_complete.md) - 任务总览
+- [Day14.md](./Day14.md) - 模型升级 + AI 标题标签
+- [AUTH_DEPLOY.md](../AUTH_DEPLOY.md) - 认证系统部署指南
+
+---
+
+## 🤖 模型与功能增强 (Day 15 晚)
+
+### 1. GLM-4.7-Flash 升级
+
+**文件**: `backend/app/services/glm_service.py`
+
+将文案洗稿模型从 `glm-4-flash` 升级为 `glm-4.7-flash`：
+
+```python
+response = client.chat.completions.create(
+    model="glm-4.7-flash",  # Upgrade from glm-4-flash
+    messages=[...],
+    # ...
+)
+```
+
+**改进**:
+- 响应速度提升
+- 洗稿文案的流畅度和逻辑性增强
+
+### 2. 独立文案提取助手
+
+实现了独立的文案提取工具，支持从视频/音频文件或 URL 提取文字。
+
+#### 后端实现 (`backend/app/api/tools.py`)
+
+- **多源支持**: 文件上传 (MP4/MP3/WAV) 或 URL 下载
+- **智能下载**:
+  - `yt-dlp`: 通用下载 (Douyin/TikTok/Bilibili)
+  - `Playwright`: 智能回退机制 (Bilibili Dashboard API, Douyin Cookie Bypass)
+- **URL 自动清洗**: 正则提取分享文本中的 HTTP 链接
+- **流程**: 下载 -> FFmpeg 转 WAV (16k) -> Whisper 识别 -> GLM-4.7 洗稿
+
+#### 前端实现 (`frontend/src/components/ScriptExtractionModal.tsx`)
+
+- **独立模态框**: 通过顶部导航栏打开
+- **功能**:
+  - 链接粘贴 / 文件拖拽
+  - 实时进度显示 (下载 -> 识别 -> 洗稿)
+  - **一键填入**: 将提取结果直接填充到主输入框
+  - **自动识别**: 自动区分平台与链接
+- **交互优化**:
+  - 防止误触背景关闭
+  - 复制功能兼容 HTTP 环境 (Fallback textArea)
+
+### 3. 上传视频预览功能
+
+在素材列表 (`frontend/src/app/page.tsx`) 中为上传的视频添加预览功能：
+- 点击缩略图弹出视频播放模态框
+- 支持下载与发布快捷跳转
+
+---
+
+## 📝 任务清单更新
+
+- [x] 认证系统迁移 (手机号)
+- [x] 账户管理 (密码修改/有效期)
+- [x] GLM-4.7 模型升级
+- [x] 独立文案提取助手 (B站/抖音支持)
+- [x] 视频预览功能
--- a/Docs/DevLogs/Day16.md
+++ b/Docs/DevLogs/Day16.md
@@ -0,0 +1,139 @@
+## 🔧 Qwen-TTS Flash Attention 优化 (10:00)
+
+### 优化背景
+Qwen3-TTS 1.7B 模型在默认情况下加载速度慢，推理显存占用高。通过引入 Flash Attention 2，可以显著提升模型加载速度和推理效率。
+
+### 实施方案
+在 `qwen-tts` Conda 环境中安装 `flash-attn`：
+
+```bash
+conda activate qwen-tts
+pip install -U flash-attn --no-build-isolation
+```
+
+### 验证结果
+- **加载速度**: 从 ~60s 提升至 **8.9s** ⚡
+- **显存占用**: 显著降低，消除 OOM 风险
+- **代码变动**: 无代码变动，仅环境优化 (自动检测)
+
+## 🛡️ 服务看门狗 Watchdog (10:30)
+
+### 问题描述
+常驻服务 (`vigent2-qwen-tts` 和 `vigent2-latentsync`) 可能会因显存碎片或长时间运行出现僵死 (Port open but unresponsive)。
+
+### 解决方案
+开发了一个 Python Watchdog 脚本，每 30 秒轮询服务的 `/health` 接口，如果连续 3 次失败则自动重启服务。
+
+1. **Watchdog 脚本**: `backend/scripts/watchdog.py`
+2. **启动脚本**: `run_watchdog.sh` (基于 PM2)
+
+### 核心逻辑
+```python
+# 连续 3 次心跳失败触发重启
+if service["failures"] >= service['threshold']:
+    subprocess.run(["pm2", "restart", service["name"]])
+```
+
+### 部署状态
+- `vigent2-watchdog` 已启动并加入 PM2 列表
+- 监控对象: `vigent2-qwen-tts` (8009), `vigent2-latentsync` (8007)
+
+---
+
+## ⚡ LatentSync 性能确认
+
+经代码审计，LatentSync 1.6 已内置优化：
+- ✅ **Flash Attention**: 原生使用 `torch.nn.functional.scaled_dot_product_attention`
+- ✅ **DeepCache**: 已启用 (`cache_interval=3`)，提供 ~2.5x 加速
+- ✅ **GPU 并发**: 双卡流水线 (GPU0 TTS | GPU1 LipSync) 已确认工作正常
+
+---
+
+## 🎨 交互体验与视图优化 (14:20)
+
+### 主页优化
+- 视频生成完成后，预览优先选中最新输出
+- 选择项持久化：素材 / 背景音乐 / 历史作品
+- 列表内滚动定位选中项，避免页面跳动
+- 刷新回到顶部（首页）
+- 标题/字幕样式预览面板
+- 背景音乐试听即选中并自动开启，音量滑块实时影响试听
+
+### 发布页优化
+- 刷新回到顶部（发布页）
+
+---
+
+## 🎵 背景音乐链路修复 (15:00)
+
+### 修复点
+- FFmpeg 混音改为 `shell=False`，避免 `filter_complex` 被 shell 误解析
+- `amix` 禁用归一化，避免配音音量被压低
+
+### 关键修改
+`backend/app/services/video_service.py`
+
+---
+
+## 🗣️ 字幕断句修复 (15:20)
+
+### 内容
+- 字幕切分逻辑保留英文单词整体，避免中英混合被硬切
+
+### 涉及文件
+- `backend/app/services/whisper_service.py`
+
+---
+
+## 🧱 资源库与样式能力接入 (15:40)
+
+### 内容
+- 字体库 / BGM 资源接入本地 assets
+- 新增样式配置文件（字幕/标题）
+- 新增资源 API 与静态挂载 `/assets`
+- Remotion 支持样式参数与字体加载
+
+### 涉及文件
+- `backend/assets/fonts/`
+- `backend/assets/bgm/`
+- `backend/assets/styles/subtitle.json`
+- `backend/assets/styles/title.json`
+- `backend/app/services/assets_service.py`
+- `backend/app/api/assets.py`
+- `backend/app/main.py`
+- `backend/app/api/videos.py`
+- `backend/app/services/remotion_service.py`
+- `remotion/src/components/Subtitles.tsx`
+- `remotion/src/components/Title.tsx`
+- `remotion/src/Video.tsx`
+- `remotion/render.ts`
+- `frontend/src/app/page.tsx`
+- `frontend/next.config.ts`
+
+---
+
+## 🛠️ 运维调整 (16:10)
+
+### 内容
+- Watchdog 移除 LatentSync 监控，避免长推理误杀
+- LatentSync PM2 增加内存重启阈值（运行时配置）
+
+---
+
+## 🎯 前端按钮图标统一 (16:40)
+
+### 内容
+- 首页与发布页按钮图标统一替换为 Lucide SVG
+- 交互按钮保持一致尺寸与对齐
+
+### 涉及文件
+- `frontend/src/features/home/ui/`
+- `frontend/src/app/publish/page.tsx`
+
+---
+
+## 📝 文档更新
+
+- [x] `Docs/QWEN3_TTS_DEPLOY.md`: 添加 Flash Attention 安装指南
+- [x] `Docs/DEPLOY_MANUAL.md`: 添加 Watchdog 部署说明
+- [x] `Docs/task_complete.md`: 更新进度至 100% (Day 16)
--- a/Docs/DevLogs/Day17.md
+++ b/Docs/DevLogs/Day17.md
@@ -0,0 +1,176 @@
+# Day 17 - 前端重构与体验优化
+
+## 🧩 前端 UI 拆分 (09:10)
+
+### 内容
+- 首页 `page.tsx` 拆分为独立 UI 组件，状态与逻辑仍集中在页面
+- 新增首页组件目录 `frontend/src/features/home/ui/`
+
+### 组件列表
+- `HomeHeader`
+- `MaterialSelector`
+- `ScriptEditor`
+- `TitleSubtitlePanel`
+- `VoiceSelector`
+- `RefAudioPanel`
+- `BgmPanel`
+- `GenerateActionBar`
+- `PreviewPanel`
+- `HistoryList`
+
+---
+
+## 🧰 前端通用工具抽取 (09:30)
+
+### 内容
+- 抽取 API Base / 资源 URL / 日期格式化等通用工具
+- 首页与发布页统一调用，消除重复逻辑
+
+### 涉及文件
+- `frontend/src/shared/lib/media.ts`
+- `frontend/src/app/page.tsx`
+- `frontend/src/app/publish/page.tsx`
+
+---
+
+## 📝 前端规范更新 (09:40)
+
+### 内容
+- 更新 `FRONTEND_DEV.md` 以匹配最新目录结构
+- 新增 `media.ts` 使用规范与示例
+- 增加组件拆分规范与页面 checklist
+
+### 涉及文件
+- `Docs/FRONTEND_DEV.md`
+
+---
+
+## 🎨 交互体验与视图优化 (10:00)
+
+### 标题/字幕预览
+- 标题/字幕预览按素材分辨率缩放，字号更接近成片
+- 标题/字幕样式选择持久化，刷新不回默认
+- 默认样式更新：标题 90px 站酷快乐体，字幕 60px 经典黄字 + DingTalkJinBuTi
+
+### 发布页优化
+- 选择作品改为卡片列表 + 搜索 + 预览弹窗
+
+---
+
+## ⚡ 性能微优化 (10:30)
+
+### 内容
+- 列表渲染启用 `content-visibility`（素材/历史/参考音频/发布作品），BGM 列表保留滚动定位
+- 首屏数据请求并行化（`Promise.allSettled`）
+- localStorage 写入防抖（文本/标题/BGM 音量/发布表单）
+
+---
+
+## 🖼️ 预览弹窗增强 (11:10)
+
+### 内容
+- 预览弹窗统一为可复用组件，支持标题与提示
+- 发布页预览与素材预览共享弹窗样式
+- 弹窗头部样式统一（图标 + 标题 + 关闭按钮）
+
+### 涉及文件
+- `frontend/src/components/VideoPreviewModal.tsx`
+- `frontend/src/app/page.tsx`
+- `frontend/src/app/publish/page.tsx`
+
+---
+
+## 🧭 术语统一 (11:20)
+
+### 内容
+- “视频预览” → “作品预览”
+- “历史视频” → “历史作品”
+- “选择要发布的视频” → “选择要发布的作品”
+- “选择素材视频” → “视频素材”
+- “选择配音方式” → “配音方式”
+
+---
+
+## 🧱 Phase 2 Hook 抽取 (11:45)
+
+### 内容
+- `useTitleSubtitleStyles`：标题/字幕样式获取与默认选择逻辑
+- `useMaterials`：素材列表/上传/删除逻辑抽取
+- `useRefAudios`：参考音频列表/上传/删除逻辑抽取
+- `useBgm`：背景音乐列表与加载状态抽取
+- `useMediaPlayers`：音频试听逻辑集中管理（参考音频/背景音乐）
+- `useGeneratedVideos`：历史作品列表获取 + 选择逻辑抽取
+
+### 涉及文件
+- `frontend/src/features/home/model/useTitleSubtitleStyles.ts`
+- `frontend/src/features/home/model/useMaterials.ts`
+- `frontend/src/features/home/model/useRefAudios.ts`
+- `frontend/src/features/home/model/useBgm.ts`
+- `frontend/src/features/home/model/useMediaPlayers.ts`
+- `frontend/src/features/home/model/useGeneratedVideos.ts`
+- `frontend/src/app/page.tsx`
+
+---
+
+## 🧩 首页持久化修复 (12:20)
+
+### 内容
+- 接入 `useHomePersistence`，补齐 `isRestored` 恢复/保存逻辑
+- 修复首页刷新后选择项恢复链路，`npm run build` 通过
+
+### 涉及文件
+- `frontend/src/app/page.tsx`
+- `frontend/src/features/home/model/useHomePersistence.ts`
+
+---
+
+## 🧩 发布预览与播放修复 (14:10)
+
+### 内容
+- 发布页作品预览兼容签名 URL 与相对路径
+- 参考音频试听统一走 `resolveMediaUrl`
+- 素材/BGM 选择在列表变化时自动回退有效项
+- 录音预览 URL 回收、预览弹窗滚动状态恢复、全局任务提示挂载
+
+### 涉及文件
+- `frontend/src/app/publish/page.tsx`
+- `frontend/src/features/home/model/useMediaPlayers.ts`
+- `frontend/src/features/home/model/useBgm.ts`
+- `frontend/src/features/home/model/useMaterials.ts`
+- `frontend/src/features/home/ui/RefAudioPanel.tsx`
+- `frontend/src/components/VideoPreviewModal.tsx`
+- `frontend/src/app/layout.tsx`
+
+---
+
+## 🧩 标题同步与长度限制 (15:30)
+
+### 内容
+- 片头标题修改同步写入发布信息标题
+- 标题输入兼容中文输入法，限制 15 字（发布信息同规则）
+
+### 涉及文件
+- `frontend/src/features/home/model/useHomeController.ts`
+- `frontend/src/features/home/ui/TitleSubtitlePanel.tsx`
+- `frontend/src/features/publish/model/usePublishController.ts`
+
+---
+
+## 🧱 轻量 FSD 迁移 (16:20)
+
+### 内容
+- 页面瘦身：`app` 仅保留入口组件，业务逻辑集中到 Controller Hook
+- 引入 `features/*` 分层：UI 与 model 分离，Home/Publish 按功能聚合
+- 通用能力下沉到 `shared/*`（lib/hooks/api）
+
+### 涉及文件
+- `frontend/src/features/home/ui/HomePage.tsx`
+- `frontend/src/features/home/model/useHomeController.ts`
+- `frontend/src/features/publish/ui/PublishPage.tsx`
+- `frontend/src/features/publish/model/usePublishController.ts`
+- `frontend/src/shared/lib/media.ts`
+- `frontend/src/shared/lib/title.ts`
+- `frontend/src/shared/api/axios.ts`
+- `frontend/src/shared/hooks/useTitleInput.ts`
+- `frontend/src/app/page.tsx`
+- `frontend/src/app/publish/page.tsx`
--- a/Docs/DevLogs/Day9.md
+++ b/Docs/DevLogs/Day9.md
@@ -228,11 +228,11 @@ else:

 | 文件 | 说明 | 状态 |
 |------|------|------|
-| `src/lib/auth.ts` | 认证工具函数 | ✅ |
+| `src/shared/lib/auth.ts` | 认证工具函数 | ✅ |
 | `src/app/login/page.tsx` | 登录页 | ✅ |
 | `src/app/register/page.tsx` | 注册页 | ✅ |
 | `src/app/admin/page.tsx` | 管理后台 | ✅ |
-| `src/middleware.ts` | 路由保护 | ✅ |
+| `src/proxy.ts` | 路由保护 | ✅ |

 ### 账号隔离集成

--- a/Docs/Doc_Rules.md
+++ b/Docs/Doc_Rules.md
@@ -24,10 +24,12 @@
 | :---: | :--- | :--- |
 | 🔥 **High** | `Docs/DevLogs/DayN.md` | **(最新日志)** 详细记录变更、修复、代码片段 |
 | 🔥 **High** | `Docs/task_complete.md` | **(任务总览)** 更新 `[x]`、进度条、时间线 |
-| ⚡ **Med** | `README.md` | **(项目主页)** 功能特性、技术栈、最新截图 |
-| ⚡ **Med** | `Docs/DEPLOY_MANUAL.md` | **(部署手册)** 环境变量、依赖包、启动命令变更 |
-| 🧊 **Low** | `Docs/implementation_plan.md` | **(实施计划)** 核对计划与实际实现的差异 |
-| 🧊 **Low** | `frontend/README.md` | **(前端文档)** 新页面路由、组件用法、UI变更 |
+| ⚡ **Med** | `README.md` | **(项目主页)** 功能特性、技术栈、最新截图 |
+| ⚡ **Med** | `Docs/DEPLOY_MANUAL.md` | **(部署手册)** 环境变量、依赖包、启动命令变更 |
+| ⚡ **Med** | `Docs/FRONTEND_DEV.md` | **(前端规范)** API封装、日期格式化、新页面规范 |
+| ⚡ **Med** | `Docs/FRONTEND_README.md` | **(前端文档)** 功能说明、页面变更 |
+| 🧊 **Low** | `Docs/implementation_plan.md` | **(实施计划)** 核对计划与实际实现的差异 |
+| 🧊 **Low** | `Docs/architecture_plan.md` | **(前端架构)** 拆分计划与阶段目标 |

 ---

@@ -139,20 +141,20 @@

 > **核心原则**：使用正确的工具，避免字符编码问题

-### ✅ 推荐工具：replace_file_content
+### ✅ 推荐工具：apply_patch

-**使用场景**：
+**使用场景**：
 - 追加新章节到文件末尾
 - 修改/替换现有章节内容
 - 更新状态标记（🔄 → ✅）
 - 修正错误内容

-**优势**：
+**优势**：
 - ✅ 自动处理字符编码（Windows CRLF）
 - ✅ 精确替换，不会误删其他内容
 - ✅ 有错误提示，方便调试

-**注意事项**：
+**注意事项**：
 ```markdown
 1. **必须精确匹配**：TargetContent 必须与文件完全一致
 2. **处理换行符**：文件使用 \r\n，不要漏掉 \r
@@ -176,36 +178,45 @@

 ### 📝 最佳实践示例

-**追加新章节**：
-```python
-replace_file_content(
-    TargetFile="path/to/DayN.md",
-    TargetContent="## 🔗 相关文档\n\n...\n\n",  # 文件末尾的内容
-    ReplacementContent="## 🔗 相关文档\n\n...\n\n---\n\n## 🆕 新章节\n内容...",
-    StartLine=280,
-    EndLine=284
-)
-```
+**追加新章节**：
+```diff
+*** Begin Patch
+*** Update File: Docs/DevLogs/DayN.md
+@@
+ ## 🔗 相关文档
+ 
+ ...
+---
+
+## 🆕 新章节
+内容...
+*** End Patch
+```

-**修改现有内容**：
-```python
-replace_file_content(
-    TargetContent="**状态**：🔄 待修复",
-    ReplacementContent="**状态**：✅ 已修复",
-    StartLine=310,
-    EndLine=310
-)
-```
+**修改现有内容**：
+```diff
+*** Begin Patch
+*** Update File: Docs/DevLogs/DayN.md
+@@
+-**状态**：🔄 待修复
+**状态**：✅ 已修复
+*** End Patch
+```


 ---

-## 📁 文件结构
+## 📁 文件结构

 ```
-ViGent/Docs/
-├── task_complete.md              # 任务总览（仅按需更新）
-├── Doc_Rules.md                  # 本文件
+ViGent2/Docs/
+├── task_complete.md              # 任务总览（仅按需更新）
+├── Doc_Rules.md                  # 本文件
+├── FRONTEND_DEV.md               # 前端开发规范
+├── FRONTEND_README.md            # 前端功能文档
+├── architecture_plan.md          # 前端拆分计划
+├── DEPLOY_MANUAL.md              # 部署手册
+├── SUPABASE_DEPLOY.md            # Supabase 部署文档
 └── DevLogs/
    ├── Day1.md                   # 开发日志
    └── ...
@@ -213,7 +224,7 @@ ViGent/Docs/

 ---

-## 📅 DayN.md 更新规则（日常更新）
+## 📅 DayN.md 更新规则（日常更新）

 ### 新建判断 (对话开始前)
 1. **回顾进度**：查看 `task_complete.md` 了解当前状态
@@ -221,9 +232,9 @@ ViGent/Docs/
   - **今天 (与当前日期相同)** → 🚨 **绝对禁止创建新文件**，必须**追加**到现有 `DayN.md` 末尾！即使是完全不同的功能模块。
   - **之前 (昨天或更早)** → 创建 `Day{N+1}.md`

-### 追加格式
-```markdown
---
+### 追加格式
+```markdown
+---

 ## 🔧 [章节标题]

@@ -239,14 +250,18 @@ ViGent/Docs/
 - ✅ 修复了 xxx
 ```

-### 快速修复格式
-```markdown
-## 🐛 [Bug 简述] (HH:MM)
+### 快速修复格式
+```markdown
+## 🐛 [Bug 简述] (HH:MM)

 **问题**：一句话描述
 **修复**：修改了 `文件名` 中的 xxx
-**状态**：✅ 已修复 / 🔄 待验证
-```
+**状态**：✅ 已修复 / 🔄 待验证
+```
+
+### ⚠️ 注意
+- **DayN.md 文件开头禁止使用 `---`**，避免被解析为 Front Matter。
+- 分隔线只用于章节之间，不作为文件第一行。

 ---

@@ -301,4 +316,4 @@ ViGent/Docs/

 ---

-**最后更新**：2026-01-23
+**最后更新**：2026-02-04
--- a/Docs/FRONTEND_DEV.md
+++ b/Docs/FRONTEND_DEV.md
@@ -0,0 +1,298 @@
+# 前端开发规范
+
+## 目录结构
+
+```
+frontend/src/
+├── app/                # Next.js App Router 页面
+│   ├── page.tsx        # 首页（视频生成）
+│   ├── publish/        # 发布页面
+│   ├── admin/          # 管理员页面
+│   ├── login/          # 登录页面
+│   └── register/       # 注册页面
+├── components/         # 可复用组件
+│   ├── home/            # 首页拆分组件
+│   └── ...
+├── lib/                # 公共工具函数
+│   ├── axios.ts        # Axios 实例（含 401/403 拦截器）
+│   ├── auth.ts         # 认证相关函数
+│   └── media.ts        # API Base / URL / 日期等通用工具
+└── proxy.ts            # 路由代理（原 middleware）
+```
+
+---
+
+## iOS Safari 安全区域兼容
+
+### 问题
+iPhone Safari 浏览器顶部（刘海/灵动岛）和底部（Home 指示条）有安全区域，默认情况下页面背景不会延伸到这些区域，导致白边。
+
+### 解决方案（三层配合）
+
+#### 1. Viewport 配置 (`layout.tsx`)
+```typescript
+import type { Viewport } from "next";
+
+export const viewport: Viewport = {
+  width: 'device-width',
+  initialScale: 1,
+  viewportFit: 'cover',    // 允许内容延伸到安全区域
+  themeColor: '#0f172a',   // 顶部状态栏颜色（与背景一致）
+};
+```
+
+#### 2. 全局背景统一到 body (`layout.tsx`)
+```tsx
+<html lang="en" style={{ backgroundColor: '#0f172a' }}>
+  <body
+    style={{
+      margin: 0,
+      minHeight: '100dvh',  // 使用 dvh 而非 vh
+      background: 'linear-gradient(to bottom, #0f172a 0%, #0f172a 5%, #581c87 50%, #0f172a 95%, #0f172a 100%)',
+    }}
+  >
+    {children}
+  </body>
+</html>
+```
+
+#### 3. CSS 安全区域支持 (`globals.css`)
+```css
+html {
+  background-color: #0f172a !important;
+  min-height: 100%;
+}
+
+body {
+  margin: 0 !important;
+  min-height: 100dvh;
+  padding-top: env(safe-area-inset-top);
+  padding-bottom: env(safe-area-inset-bottom);
+}
+```
+
+### 关键要点
+- **渐变背景放 body，不放页面 div** - 安全区域在 div 之外
+- **使用 `100dvh` 而非 `100vh`** - dvh 是动态视口高度，适配移动端
+- **themeColor 与背景边缘色一致** - 避免状态栏色差
+- **页面 div 移除独立背景** - 使用透明，继承 body 渐变
+
+---
+
+## 移动端响应式规范
+
+### Header 按钮布局
+```tsx
+// 移动端紧凑，桌面端宽松
+<div className="flex items-center gap-1 sm:gap-4">
+  <button className="px-2 sm:px-4 py-1 sm:py-2 text-sm sm:text-base ...">
+    按钮
+  </button>
+</div>
+```
+
+### 常用响应式断点
+| 断点 | 宽度 | 用途 |
+|------|------|------|
+| 默认 | < 640px | 移动端 |
+| `sm:` | ≥ 640px | 平板/桌面 |
+| `lg:` | ≥ 1024px | 大屏桌面 |
+
+---
+
+## API 请求规范
+
+### 必须使用 `api` (axios 实例)
+
+所有需要认证的 API 请求**必须**使用 `@/shared/api/axios` 导出的 axios 实例。该实例已配置：
+- 自动携带 `credentials: include`
+- 遇到 401/403 时自动清除 cookie 并跳转登录页
+
+**使用方式：**
+
+```typescript
+import api from '@/shared/api/axios';
+
+// GET 请求
+const { data } = await api.get('/api/materials');
+
+// POST 请求
+const { data } = await api.post('/api/videos/generate', {
+    text: '...',
+    voice: '...',
+});
+
+// DELETE 请求
+await api.delete(`/api/materials/${id}`);
+
+// 带上传进度的文件上传
+await api.post('/api/materials', formData, {
+    headers: { 'Content-Type': 'multipart/form-data' },
+    onUploadProgress: (e) => {
+        if (e.total) {
+            const progress = Math.round((e.loaded / e.total) * 100);
+            setProgress(progress);
+        }
+    },
+});
+```
+
+### SWR 配合使用
+
+```typescript
+import api from '@/shared/api/axios';
+
+// SWR fetcher 使用 axios
+const fetcher = (url: string) => api.get(url).then(res => res.data);
+
+const { data } = useSWR('/api/xxx', fetcher, { refreshInterval: 2000 });
+```
+
+---
+
+## 通用工具函数 (media.ts)
+
+### 统一 API Base / URL 解析
+使用 `@/shared/lib/media` 统一处理服务端/客户端 API Base 与资源地址，避免硬编码：
+
+```typescript
+import { getApiBaseUrl, resolveMediaUrl, resolveAssetUrl, formatDate } from '@/shared/lib/media';
+
+const apiBase = getApiBaseUrl(); // SSR: http://localhost:8006 / Client: ''
+const playableUrl = resolveMediaUrl(video.path); // 兼容签名 URL 与相对路径
+const fontUrl = resolveAssetUrl(`fonts/${fontFile}`);
+const timeText = formatDate(video.created_at);
+```
+
+### 资源路径规则
+- 视频/音频：优先用 `resolveMediaUrl()`
+- 字体/BGM：使用 `resolveAssetUrl()`（自动编码中文路径）
+- 预览前若已有签名 URL，先用 `isAbsoluteUrl()` 判定，避免再次拼接
+
+---
+
+## 日期格式化规范
+
+### 禁止使用 `toLocaleString()`
+
+`toLocaleString()` 在服务端和客户端可能返回不同格式，导致 Hydration 错误。
+
+**错误示例：**
+```typescript
+// ❌ 会导致 Hydration 错误
+new Date(timestamp * 1000).toLocaleString('zh-CN')
+```
+
+**正确做法：**
+```typescript
+// ✅ 使用固定格式
+import { formatDate } from '@/shared/lib/media';
+```
+
+---
+
+## 组件拆分规范
+
+当页面组件超过 300-500 行，建议按功能拆分到 `features/*/ui`：
+
+- `page.tsx` 仅做组合与布局
+- 业务逻辑集中在 `features/*/model` 的 Controller Hook
+- UI 组件只接受 props 与回调，尽量不直接发 API
+- 首页拆分组件统一放在 `features/home/ui/`
+
+---
+
+## 轻量 FSD 结构
+
+- `app/`：页面入口，保持轻量
+- `features/*/model`：业务逻辑与状态 (hooks)
+- `features/*/ui`：功能 UI 组件
+- `shared/`：通用工具、通用 hooks、API 实例
+
+---
+
+## 用户偏好持久化
+
+首页涉及样式与字号等用户偏好时，需持久化并在刷新后恢复：
+
+- **必须持久化**：
+  - 标题样式 ID / 字幕样式 ID
+  - 标题字号 / 字幕字号
+  - 背景音乐选择 / 音量 / 开关状态
+  - 素材选择 / 历史作品选择
+
+### 实施规范
+- 使用 `storageKey = userId || 'guest'`，按用户隔离。
+- **恢复先于保存**：恢复完成前禁止写入（`isRestored` 保护）。
+- 避免默认值覆盖用户选择（优先读取已保存值）。
+- 优先使用 `useHomePersistence` 集中管理恢复/保存，页面内避免分散的 localStorage 读写。
+- 如需新增持久化字段，必须加入恢复与保存逻辑，并更新本节。
+
+---
+
+## 标题输入规则
+
+- 片头标题与发布信息标题统一限制 15 字。
+- 中文输入法合成阶段不截断，合成结束后才校验长度。
+- 首页片头标题修改会同步写入 `vigent_${storageKey}_publish_title`。
+- 避免使用 `maxLength` 强制截断输入法合成态。
+- 推荐使用 `@/shared/hooks/useTitleInput` 统一处理输入逻辑。
+
+---
+
+## 新增页面 Checklist
+
+1. [ ] 导入 `import api from '@/shared/api/axios'`
+2. [ ] 所有 API 请求使用 `api.get/post/delete()` 而非原生 `fetch`
+3. [ ] 日期格式化使用 `@/shared/lib/media` 的 `formatDate`
+4. [ ] 资源 URL 使用 `resolveMediaUrl`/`resolveAssetUrl`
+5. [ ] 添加 `'use client'` 指令（如需客户端交互）
+
+---
+
+## 声音克隆 (Voice Clone) 功能
+
+### API 端点
+
+| 接口 | 方法 | 功能 |
+|------|------|------|
+| `/api/ref-audios` | POST | 上传参考音频 (multipart/form-data: file + ref_text) |
+| `/api/ref-audios` | GET | 列出用户的参考音频 |
+| `/api/ref-audios/{id}` | DELETE | 删除参考音频 (id 需 encodeURIComponent) |
+
+### 视频生成 API 扩展
+
+```typescript
+// EdgeTTS 模式 (默认)
+await api.post('/api/videos/generate', {
+    material_path: '...',
+    text: '口播文案',
+    tts_mode: 'edgetts',
+    voice: 'zh-CN-YunxiNeural',
+});
+
+// 声音克隆模式
+await api.post('/api/videos/generate', {
+    material_path: '...',
+    text: '口播文案',
+    tts_mode: 'voiceclone',
+    ref_audio_id: 'user_id/timestamp_name.wav',
+    ref_text: '参考音频对应文字',
+});
+```
+
+### 在线录音
+
+使用 `MediaRecorder` API 录制音频，格式为 `audio/webm`，上传后后端自动转换为 WAV (16kHz mono)。
+
+```typescript
+// 录音需要用户授权麦克风
+const stream = await navigator.mediaDevices.getUserMedia({ audio: true });
+const mediaRecorder = new MediaRecorder(stream, { mimeType: 'audio/webm' });
+```
+
+### UI 结构
+
+配音方式使用 Tab 切换：
+- **EdgeTTS 音色** - 预设音色 2x3 网格
+- **声音克隆** - 参考音频列表 + 在线录音 + 参考文字输入
--- a/Docs/FRONTEND_README.md
+++ b/Docs/FRONTEND_README.md
@@ -0,0 +1,117 @@
+# ViGent2 Frontend
+
+ViGent2 的前端界面，采用 Next.js 16 + TailwindCSS 构建。
+
+## ✨ 核心功能
+
+### 1. 视频生成 (`/`)
+- **素材管理**: 拖拽上传人物视频，实时预览。
+- **文案配音**: 集成 EdgeTTS，支持多音色选择 (云溪 / 晓晓)。
+- **AI 标题/标签**: 一键生成视频标题与标签 (Day 14)。
+- **标题/字幕样式**: 样式选择 + 预览 + 字号调节 (Day 16)。
+- **背景音乐**: 试听 + 音量控制 + 选择持久化 (Day 16)。
+- **交互优化**: 选择项持久化、列表内定位、刷新回顶部 (Day 16)。
+- **预览一致性**: 标题/字幕预览按素材分辨率缩放，效果更接近成片 (Day 17)。
+- **进度追踪**: 实时显示视频生成进度 (10% -> 100%)。
+- **作品预览**: 生成完成后直接播放下载（作品预览 + 历史作品）。
+- **本地保存**: 文案/标题/偏好由 `useHomePersistence` 统一持久化，刷新后恢复 (Day 14/17)。
+
+### 2. 全自动发布 (`/publish`) [Day 7 新增]
+- **多平台管理**: 统一管理 B站、抖音、小红书账号状态。
+- **扫码登录**: 
+  - 集成后端 Playwright 生成的 QR Code。
+  - 实时检测扫码状态 (Wait/Success)。
+  - Cookie 自动保存与状态同步。
+- **发布配置**: 设置视频标题、标签、简介。
+- **作品选择**: 卡片列表 + 搜索 + 预览弹窗。
+- **预览兼容**: 签名 URL / 相对路径均可直接预览。
+- **定时任务**: 支持 "立即发布" 或 "定时发布"。
+
+### 3. 声音克隆 [Day 13 新增]
+- **TTS 模式选择**: EdgeTTS (预设音色) / 声音克隆 (自定义音色) 切换。
+- **参考音频管理**: 上传/列表/删除参考音频 (3-20秒 WAV)。
+- **一键克隆**: 选择参考音频后自动调用 Qwen3-TTS 服务。
+
+### 4. 字幕与标题 [Day 13 新增]
+- **片头标题**: 可选输入，限制 15 字，视频开头显示 3 秒淡入淡出标题。
+- **标题同步**: 首页片头标题修改会同步到发布信息标题。
+- **逐字高亮字幕**: 卡拉OK效果，默认开启，可关闭。
+- **自动对齐**: 基于 faster-whisper 生成字级别时间戳。
+- **样式预设**: 标题/字幕样式选择 + 预览 + 字号调节 (Day 16)。
+- **默认样式**: 标题 90px 站酷快乐体；字幕 60px 经典黄字 + DingTalkJinBuTi (Day 17)。
+- **样式持久化**: 标题/字幕样式与字号刷新保留 (Day 17)。
+
+### 5. 背景音乐 [Day 16 新增]
+- **试听预览**: 点击试听即选中，音量滑块实时生效。
+- **混音控制**: 仅影响 BGM，配音保持原音量。
+
+### 6. 账户设置 [Day 15 新增]
+- **手机号登录**: 11位中国手机号验证登录。
+- **账户下拉菜单**: 显示有效期 + 修改密码 + 安全退出。
+- **修改密码**: 弹窗输入当前密码与新密码，修改后强制重新登录。
+
+### 7. 文案提取助手 (`ScriptExtractionModal`) [Day 15 新增]
+- **多源提取**: 支持文件拖拽上传与 URL 粘贴 (B站/抖音/TikTok)。
+- **AI 洗稿**: 集成 GLM-4.7-Flash，自动改写为口播文案。
+- **一键填入**: 提取结果直接填充至视频生成输入框。
+- **智能交互**: 实时进度展示，防误触设计。
+
+## 🛠️ 技术栈
+
+- **框架**: Next.js 16 (App Router)
+- **样式**: TailwindCSS
+- **图标**: Lucide React
+- **组件**: 自定义现代化组件 (Glassmorphism 风格)
+- **API**: Axios 实例 `@/shared/api/axios` (对接后端 FastAPI :8006)
+
+## 🚀 开发指南
+
+### 安装依赖
+
+```bash
+npm install
+```
+
+### 启动开发服务器
+
+默认运行在 **3002** 端口 (通过 `package.json` 配置):
+
+```bash
+npm run dev
+# 访问: http://localhost:3002
+```
+
+### 目录结构
+
+```
+src/
+├── app/                   # 页面入口 (轻量)
+│   ├── page.tsx           # 视频生成主页
+│   ├── publish/           # 发布管理页
+│   │   └── page.tsx
+│   └── layout.tsx         # 全局布局 (导航栏)
+├── features/
+│   ├── home/
+│   │   ├── model/          # Home 业务逻辑 (hooks)
+│   │   └── ui/             # Home UI 组件
+│   └── publish/
+│       ├── model/          # Publish 业务逻辑 (hooks)
+│       └── ui/             # Publish UI 组件
+├── shared/
+│   ├── api/                # API 实例
+│   ├── hooks/              # 通用 hooks
+│   └── lib/                # 工具函数
+└── components/             # 跨页面复用 UI
+```
+
+## 🔌 后端对接
+
+- **Base URL**: `http://localhost:8006` (SSR) / 相对路径 (Client)
+- **URL 统一工具**: `@/shared/lib/media` 提供 `resolveMediaUrl` / `resolveAssetUrl`
+- **代理配置**: Next.js Rewrites (如需) 或直接 CORS。
+
+## 🎨 设计规范
+
+- **主色调**: 深紫/黑色系 (Dark Mode)
+- **交互**: 悬停微动画 (Hover Effects)
+- **响应式**: 适配桌面端大屏操作
--- a/models/LatentSync/DEPLOY.md
+++ b/models/LatentSync/DEPLOY.md
--- a/Docs/Logs.md
+++ b/Docs/Logs.md
@@ -1,29 +0,0 @@
-rongye@r730-ubuntu:~/ProgramFiles/Supabase$ docker compose up -d
-[+] up 136/136
- ✔ Image timberio/vector:0.28.1-alpine      Pulled                                                                63.3ss
- ✔ Image supabase/storage-api:v1.33.0       Pulled                                                                78.6ss
- ✔ Image darthsim/imgproxy:v3.30.1          Pulled                                                                151.9s
- ✔ Image supabase/postgres-meta:v0.95.1     Pulled                                                                87.5ss
- ✔ Image supabase/logflare:1.27.0           Pulled                                                                229.2s
- ✔ Image supabase/postgres:15.8.1.085       Pulled                                                                268.3s
- ✔ Image supabase/supavisor:2.7.4           Pulled                                                                101.6s
- ✔ Image supabase/realtime:v2.68.0          Pulled                                                                56.5ss
- ✔ Image postgrest/postgrest:v14.1          Pulled                                                                201.8s
- ✔ Image supabase/edge-runtime:v1.69.28     Pulled                                                                254.0s
- ✔ Network supabase_default                 Created                                                               0.1s
- ✔ Volume supabase_db-config                Created                                                               0.1s
- ✔ Container supabase-vector                Healthy                                                               16.9s
- ✔ Container supabase-imgproxy              Created                                                               7.4s
- ✔ Container supabase-db                    Healthy                                                               20.6s
- ✔ Container supabase-analytics             Created                                                               0.4s
- ✔ Container supabase-edge-functions        Created                                                               1.8s
- ✔ Container supabase-auth                  Created                                                               1.7s
- ✔ Container supabase-studio                Created                                                               2.0s
- ✔ Container realtime-dev.supabase-realtime Created                                                               1.7s
- ✔ Container supabase-pooler                Created                                                               1.8s
- ✔ Container supabase-kong                  Created                                                               1.7s
- ✔ Container supabase-meta                  Created                                                               2.0s
- ✔ Container supabase-rest                  Created                                                               0.9s
- ✔ Container supabase-storage               Created                                                               1.4s
-Error response from daemon: failed to set up container networking: driver failed programming external connectivity on endpoint supabase-analytics (2fd60a510a1f16bf29f8f5140f14ef457a284c5b65a2567b7be250a4f9708f34): failed to bind host port 0.0.0.0:4000/tcp: address already in use
-[ble: exit 1]
--- a/Docs/QWEN3_TTS_DEPLOY.md
+++ b/Docs/QWEN3_TTS_DEPLOY.md
@@ -0,0 +1,384 @@
+# Qwen3-TTS 1.7B 部署指南
+
+> 本文档描述如何在 Ubuntu 服务器上部署 Qwen3-TTS 1.7B-Base 声音克隆模型。
+
+## 系统要求
+
+| 要求 | 规格 |
+|------|------|
+| GPU | NVIDIA RTX 3090 24GB (或更高) |
+| VRAM | ≥ 8GB (推理), ≥ 12GB (带 flash-attn) |
+| CUDA | 12.1+ |
+| Python | 3.10.x |
+| 系统 | Ubuntu 20.04+ |
+
+---
+
+## GPU 分配
+
+| GPU | 服务 | 模型 |
+|-----|------|------|
+| GPU0 | **Qwen3-TTS** | 1.7B-Base (声音克隆，更高质量) |
+| GPU1 | LatentSync | 1.6 (唇形同步) |
+
+---
+
+## 步骤 1: 克隆仓库
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/models
+git clone https://github.com/QwenLM/Qwen3-TTS.git
+cd Qwen3-TTS
+```
+
+---
+
+## 步骤 2: 创建 Conda 环境
+
+```bash
+# 创建新的 conda 环境
+conda create -n qwen-tts python=3.10 -y
+conda activate qwen-tts
+```
+
+---
+
+## 步骤 3: 安装 Python 依赖
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/models/Qwen3-TTS
+
+# 安装 qwen-tts 包 (editable mode)
+pip install -e .
+
+# 安装 sox 音频处理库 (必须)
+conda install -y -c conda-forge sox
+```
+
+### 可选: 安装 FlashAttention (强烈推荐)
+
+FlashAttention 可以显著提升推理速度 (加载时间减少 85%) 并减少显存占用：
+
+```bash
+pip install -U flash-attn --no-build-isolation
+```
+
+如果内存不足，可以限制编译并发数：
+
+```bash
+MAX_JOBS=4 pip install -U flash-attn --no-build-isolation
+```
+
+---
+
+## 步骤 4: 下载模型权重
+
+### 方式 A: ModelScope (推荐，国内更快)
+
+```bash
+pip install modelscope
+
+# 下载 Tokenizer (651MB)
+modelscope download --model Qwen/Qwen3-TTS-Tokenizer-12Hz --local_dir ./checkpoints/Tokenizer
+
+# 下载 1.7B-Base 模型 (6.8GB)
+modelscope download --model Qwen/Qwen3-TTS-12Hz-1.7B-Base --local_dir ./checkpoints/1.7B-Base
+```
+
+### 方式 B: HuggingFace
+
+```bash
+pip install -U "huggingface_hub[cli]"
+
+huggingface-cli download Qwen/Qwen3-TTS-Tokenizer-12Hz --local-dir ./checkpoints/Tokenizer
+huggingface-cli download Qwen/Qwen3-TTS-12Hz-1.7B-Base --local-dir ./checkpoints/1.7B-Base
+```
+
+下载完成后，目录结构应如下：
+
+```
+checkpoints/
+├── Tokenizer/       # ~651MB
+│   ├── config.json
+│   ├── model.safetensors
+│   └── ...
+└── 1.7B-Base/       # ~6.8GB
+    ├── config.json
+    ├── model.safetensors
+    └── ...
+```
+
+---
+
+## 步骤 5: 验证安装
+
+### 5.1 检查环境
+
+```bash
+conda activate qwen-tts
+
+# 检查 PyTorch 和 CUDA
+python -c "import torch; print(f'PyTorch: {torch.__version__}'); print(f'CUDA: {torch.cuda.is_available()}')"
+
+# 检查 sox
+sox --version
+```
+
+### 5.2 运行推理测试
+
+创建测试脚本 `test_inference.py`:
+
+```python
+"""Qwen3-TTS 声音克隆测试"""
+import torch
+import soundfile as sf
+from qwen_tts import Qwen3TTSModel
+
+print("Loading Qwen3-TTS model on GPU:0...")
+model = Qwen3TTSModel.from_pretrained(
+    "./checkpoints/1.7B-Base",
+    device_map="cuda:0",
+    dtype=torch.bfloat16,
+)
+print("Model loaded!")
+
+# 测试声音克隆 (需要准备参考音频)
+ref_audio = "./examples/myvoice.wav"  # 3-20秒的参考音频
+ref_text = "参考音频的文字内容"
+
+test_text = "这是一段测试文本，用于验证声音克隆功能是否正常工作。"
+
+print("Generating cloned voice...")
+wavs, sr = model.generate_voice_clone(
+    text=test_text,
+    language="Chinese",
+    ref_audio=ref_audio,
+    ref_text=ref_text,
+)
+
+sf.write("test_output.wav", wavs[0], sr)
+print(f"✅ Saved: test_output.wav | {sr}Hz | {len(wavs[0])/sr:.2f}s")
+```
+
+运行测试：
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/models/Qwen3-TTS
+python test_inference.py
+```
+
+---
+
+## 步骤 6: 安装 HTTP 服务依赖
+
+```bash
+conda activate qwen-tts
+pip install fastapi uvicorn python-multipart
+```
+
+---
+
+## 步骤 7: 启动服务 (PM2 管理)
+
+### 手动测试
+
+```bash
+conda activate qwen-tts
+cd /home/rongye/ProgramFiles/ViGent2/models/Qwen3-TTS
+python qwen_tts_server.py
+```
+
+访问 http://localhost:8009/health 验证服务状态。
+
+### PM2 常驻服务
+
+> ⚠️ **注意**：启动脚本 `run_qwen_tts.sh` 位于项目**根目录**，而非 models/Qwen3-TTS 目录。
+
+1. 使用启动脚本:
+```bash
+cd /home/rongye/ProgramFiles/ViGent2
+pm2 start ./run_qwen_tts.sh --name vigent2-qwen-tts
+pm2 save
+```
+
+2. 查看日志:
+```bash
+pm2 logs vigent2-qwen-tts
+```
+
+3. 重启服务:
+```bash
+pm2 restart vigent2-qwen-tts
+```
+
+---
+
+## 目录结构
+
+部署完成后，目录结构应如下：
+
+```
+/home/rongye/ProgramFiles/ViGent2/
+├── run_qwen_tts.sh              # PM2 启动脚本 (根目录)
+└── models/Qwen3-TTS/
+    ├── checkpoints/
+    │   ├── Tokenizer/           # 语音编解码器
+    │   └── 1.7B-Base/           # 声音克隆模型 (更高质量)
+    ├── qwen_tts/                # 源码
+    │   ├── inference/
+    │   ├── models/
+    │   └── ...
+    ├── examples/
+    │   └── myvoice.wav          # 参考音频
+    ├── qwen_tts_server.py       # HTTP 推理服务 (端口 8009)
+    ├── pyproject.toml
+    ├── requirements.txt
+    └── test_inference.py        # 测试脚本
+```
+
+---
+
+## API 参考
+
+### 健康检查
+
+```
+GET http://localhost:8009/health
+```
+
+响应:
+```json
+{
+  "service": "Qwen3-TTS Voice Clone",
+  "model": "1.7B-Base",
+  "ready": true,
+  "gpu_id": 0
+}
+```
+
+### 声音克隆生成
+
+```
+POST http://localhost:8009/generate
+Content-Type: multipart/form-data
+
+Fields:
+  - ref_audio: 参考音频文件 (WAV)
+  - text: 要合成的文本
+  - ref_text: 参考音频的转写文字
+  - language: 语言 (默认 Chinese)
+
+Response: audio/wav 文件
+```
+
+---
+
+## 模型说明
+
+### 可用模型
+
+| 模型 | 功能 | 大小 |
+|------|------|------|
+| 0.6B-Base | 3秒快速声音克隆 | 2.4GB |
+| 0.6B-CustomVoice | 9种预设音色 | 2.4GB |
+| **1.7B-Base** | **声音克隆 (更高质量)** ✅ 当前使用 | 6.8GB |
+| 1.7B-VoiceDesign | 自然语言描述生成声音 | 6.8GB |
+
+### 支持语言
+
+中文、英语、日语、韩语、德语、法语、俄语、葡萄牙语、西班牙语、意大利语
+
+---
+
+## 故障排除
+
+### sox 未找到
+
+```
+SoX could not be found!
+```
+
+**解决**: 通过 conda 安装 sox：
+
+```bash
+conda install -y -c conda-forge sox
+```
+
+### CUDA 内存不足
+
+Qwen3-TTS 1.7B 通常需要 8-10GB VRAM。如果遇到 OOM：
+
+1. 确保 GPU0 没有运行其他程序
+2. 不使用 flash-attn (会增加显存占用)
+3. 使用更小的参考音频 (3-5秒)
+4. 如果显存仍不足，可降级使用 0.6B-Base 模型
+
+### 模型加载失败
+
+确保以下文件存在：
+- `checkpoints/1.7B-Base/config.json`
+- `checkpoints/1.7B-Base/model.safetensors`
+
+### 音频输出质量问题
+
+1. 参考音频质量：使用清晰、无噪音的 3-10 秒音频
+2. ref_text 准确性：参考音频的转写文字必须准确
+3. 语言设置：确保 `language` 参数与文本语言一致
+
+---
+
+## 后端 ViGent2 集成
+
+### 声音克隆服务 (`voice_clone_service.py`)
+
+后端通过 HTTP 调用 Qwen3-TTS 服务：
+
+```python
+import aiohttp
+
+QWEN_TTS_URL = "http://localhost:8009"
+
+async def generate_cloned_audio(ref_audio_path: str, text: str, output_path: str):
+    async with aiohttp.ClientSession() as session:
+        with open(ref_audio_path, "rb") as f:
+            data = aiohttp.FormData()
+            data.add_field("ref_audio", f, filename="ref.wav")
+            data.add_field("text", text)
+            
+            async with session.post(f"{QWEN_TTS_URL}/generate", data=data) as resp:
+                audio_data = await resp.read()
+                with open(output_path, "wb") as out:
+                    out.write(audio_data)
+    return output_path
+```
+
+### 参考音频 Supabase Bucket
+
+```sql
+-- 创建 ref-audios bucket
+INSERT INTO storage.buckets (id, name, public)
+VALUES ('ref-audios', 'ref-audios', true)
+ON CONFLICT (id) DO NOTHING;
+
+-- RLS 策略
+CREATE POLICY "Allow public uploads" ON storage.objects
+FOR INSERT TO anon WITH CHECK (bucket_id = 'ref-audios');
+```
+
+---
+
+## 更新日志
+
+| 日期 | 版本 | 说明 |
+|------|------|------|
+| 2026-01-30 | 1.1.0 | 明确默认模型升级为 1.7B-Base，替换旧版 0.6B 路径 |
+
+---
+
+## 参考链接
+
+- [Qwen3-TTS GitHub](https://github.com/QwenLM/Qwen3-TTS)
+- [ModelScope 模型](https://modelscope.cn/collections/Qwen/Qwen3-TTS)
+- [HuggingFace 模型](https://huggingface.co/collections/Qwen/qwen3-tts)
+- [技术报告](https://arxiv.org/abs/2601.15621)
+- [官方博客](https://qwen.ai/blog?id=qwen3tts-0115)
--- a/Docs/SUBTITLE_DEPLOY.md
+++ b/Docs/SUBTITLE_DEPLOY.md
@@ -0,0 +1,282 @@
+# ViGent2 字幕与标题功能部署指南
+
+本文档介绍如何部署 ViGent2 的逐字高亮字幕和片头标题功能。
+
+## 功能概述
+
+| 功能 | 说明 |
+|------|------|
+| **逐字高亮字幕** | 使用 faster-whisper 生成字级别时间戳，Remotion 渲染卡拉OK效果 |
+| **片头标题** | 视频开头显示标题，带淡入淡出动画，几秒后消失 |
+
+## 技术架构
+
+```
+原有流程:
+  文本 → EdgeTTS → 音频 → LatentSync → FFmpeg合成 → 最终视频
+
+新流程:
+  文本 → EdgeTTS → 音频 ─┬→ LatentSync → 唇形视频 ─┐
+                        └→ faster-whisper → 字幕JSON ─┴→ Remotion合成 → 最终视频
+```
+
+## 系统要求
+
+| 组件 | 要求 |
+|------|------|
+| Node.js | 18+ |
+| Python | 3.10+ |
+| GPU 显存 | faster-whisper 需要约 3-4GB VRAM |
+| FFmpeg | 已安装 |
+
+---
+
+## 部署步骤
+
+### 步骤 1: 安装 faster-whisper (Python)
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/backend
+source venv/bin/activate
+
+# 安装 faster-whisper
+pip install faster-whisper>=1.0.0 -i https://pypi.tuna.tsinghua.edu.cn/simple
+```
+
+> **注意**: 首次运行时，faster-whisper 会自动下载 `large-v3` Whisper 模型 (~3GB)
+
+### 步骤 2: 安装 Remotion (Node.js)
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/remotion
+
+# 安装依赖
+npm install
+```
+
+### 步骤 3: 重启后端服务
+
+```bash
+pm2 restart vigent2-backend
+```
+
+### 步骤 4: 验证安装
+
+```bash
+# 检查 faster-whisper 是否安装成功
+cd /home/rongye/ProgramFiles/ViGent2/backend
+source venv/bin/activate
+python -c "from faster_whisper import WhisperModel; print('faster-whisper OK')"
+
+# 检查 Remotion 是否安装成功
+cd /home/rongye/ProgramFiles/ViGent2/remotion
+npx remotion --version
+```
+
+---
+
+## 文件结构
+
+### 后端新增文件
+
+| 文件 | 说明 |
+|------|------|
+| `backend/app/services/whisper_service.py` | 字幕对齐服务 (基于 faster-whisper) |
+| `backend/app/services/remotion_service.py` | Remotion 渲染服务 |
+
+### Remotion 项目结构
+
+```
+remotion/
+├── package.json              # Node.js 依赖配置
+├── tsconfig.json             # TypeScript 配置
+├── render.ts                 # 服务端渲染脚本
+└── src/
+    ├── index.ts              # Remotion 入口
+    ├── Root.tsx              # 根组件
+    ├── Video.tsx             # 主视频组件
+    ├── components/
+    │   ├── Title.tsx         # 片头标题组件
+    │   ├── Subtitles.tsx     # 逐字高亮字幕组件
+    │   └── VideoLayer.tsx    # 视频图层组件
+    ├── utils/
+    │   └── captions.ts       # 字幕数据处理工具
+    └── fonts/                # 字体文件目录 (可选)
+```
+
+---
+
+## API 参数
+
+视频生成 API (`POST /api/videos/generate`) 新增以下参数：
+
+| 参数 | 类型 | 默认值 | 说明 |
+|------|------|--------|------|
+| `title` | string | null | 视频标题（片头显示，可选） |
+| `enable_subtitles` | boolean | true | 是否启用逐字高亮字幕 |
+
+### 请求示例
+
+```json
+{
+  "material_path": "https://...",
+  "text": "大家好，欢迎来到我的频道",
+  "tts_mode": "edgetts",
+  "voice": "zh-CN-YunxiNeural",
+  "title": "今日分享",
+  "enable_subtitles": true
+}
+```
+
+---
+
+## 视频生成流程
+
+新的视频生成流程进度分配：
+
+| 阶段 | 进度 | 说明 |
+|------|------|------|
+| 下载素材 | 0% → 5% | 从 Supabase 下载输入视频 |
+| TTS 语音生成 | 5% → 25% | EdgeTTS 或 Qwen3-TTS 生成音频 |
+| 唇形同步 | 25% → 80% | LatentSync 推理 |
+| 字幕对齐 | 80% → 85% | faster-whisper 生成字级别时间戳 |
+| Remotion 渲染 | 85% → 95% | 合成字幕和标题 |
+| 上传结果 | 95% → 100% | 上传到 Supabase Storage |
+
+---
+
+## 降级处理
+
+系统包含自动降级机制，确保基本功能不受影响：
+
+| 场景 | 处理方式 |
+|------|----------|
+| 字幕对齐失败 | 跳过字幕，继续生成视频 |
+| Remotion 未安装 | 使用 FFmpeg 直接合成 |
+| Remotion 渲染失败 | 回退到 FFmpeg 合成 |
+
+---
+
+## 配置说明
+
+### 字幕服务配置
+
+字幕服务位于 `backend/app/services/whisper_service.py`，默认配置：
+
+| 参数 | 默认值 | 说明 |
+|------|--------|------|
+| `model_size` | large-v3 | Whisper 模型大小 |
+| `device` | cuda | 运行设备 |
+| `compute_type` | float16 | 计算精度 |
+
+如需修改，可编辑 `whisper_service.py` 中的 `WhisperService` 初始化参数。
+
+### Remotion 配置
+
+Remotion 渲染参数在 `backend/app/services/remotion_service.py` 中配置：
+
+| 参数 | 默认值 | 说明 |
+|------|--------|------|
+| `fps` | 25 | 输出帧率 |
+| `title_duration` | 3.0 | 标题显示时长（秒） |
+
+---
+
+## 故障排除
+
+### faster-whisper 相关
+
+**问题**: `ModuleNotFoundError: No module named 'faster_whisper'`
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/backend
+source venv/bin/activate
+pip install faster-whisper>=1.0.0 -i https://pypi.tuna.tsinghua.edu.cn/simple
+```
+
+**问题**: GPU 显存不足
+
+修改 `whisper_service.py`，使用较小的模型：
+```python
+WhisperService(model_size="medium", compute_type="int8")
+```
+
+### Remotion 相关
+
+**问题**: `node_modules not found`
+
+```bash
+cd /home/rongye/ProgramFiles/ViGent2/remotion
+npm install
+```
+
+**问题**: Remotion 渲染失败 - `fs` 模块错误
+
+确保 `remotion/src/utils/captions.ts` 中没有使用 Node.js 的 `fs` 模块。Remotion 在浏览器环境打包，不支持 `fs`。
+
+**问题**: Remotion 渲染失败 - 视频文件读取错误 (`file://` 协议)
+
+确保 `render.ts` 使用 `publicDir` 选项指向视频所在目录，`VideoLayer.tsx` 使用 `staticFile()` 加载视频：
+
+```typescript
+// render.ts
+const publicDir = path.dirname(path.resolve(options.videoPath));
+const bundleLocation = await bundle({
+  entryPoint: path.resolve(__dirname, './src/index.ts'),
+  publicDir,  // 关键配置
+});
+
+// VideoLayer.tsx
+const videoUrl = staticFile(videoSrc);  // 使用 staticFile
+```
+
+**问题**: Remotion 渲染失败
+
+查看后端日志：
+```bash
+pm2 logs vigent2-backend
+```
+
+### 查看服务健康状态
+
+```bash
+# 字幕服务健康检查
+cd /home/rongye/ProgramFiles/ViGent2/backend
+source venv/bin/activate
+python -c "from app.services.whisper_service import whisper_service; import asyncio; print(asyncio.run(whisper_service.check_health()))"
+
+# Remotion 健康检查
+python -c "from app.services.remotion_service import remotion_service; import asyncio; print(asyncio.run(remotion_service.check_health()))"
+```
+
+---
+
+## 可选优化
+
+### 添加中文字体
+
+为获得更好的字幕渲染效果，可添加中文字体：
+
+```bash
+# 下载 Noto Sans SC 字体
+cd /home/rongye/ProgramFiles/ViGent2/remotion/src/fonts
+wget https://github.com/googlefonts/noto-cjk/raw/main/Sans/OTF/SimplifiedChinese/NotoSansSC-Regular.otf -O NotoSansSC.otf
+```
+
+### 使用 GPU 0
+
+faster-whisper 默认使用 GPU 0，与 LatentSync (GPU 1) 分开，避免显存冲突。如需指定 GPU：
+
+```python
+# 在 whisper_service.py 中修改
+WhisperService(device="cuda:0")  # 或 "cuda:1"
+```
+
+---
+
+## 更新日志
+
+| 日期 | 版本 | 说明 |
+|------|------|------|
+| 2026-01-29 | 1.0.0 | 初始版本，使用 faster-whisper + Remotion 实现逐字高亮字幕和片头标题 |
+| 2026-01-30 | 1.0.1 | 字幕高亮样式与标题动画优化，视觉表现更清晰 |
--- a/Docs/SUPABASE_DEPLOY.md
+++ b/Docs/SUPABASE_DEPLOY.md
@@ -57,6 +57,10 @@ STUDIO_PORT=3003
 # 如果配置了 Nginx 反代: https://api.hbyrkj.top
 # 如果直连: http://8.148.25.142:8008
 API_EXTERNAL_URL=https://api.hbyrkj.top
+
+# Studio 公网 API 地址 (通过公网访问 Studio 时必须配置)
+# 用于 Studio 前端调用 API
+SUPABASE_PUBLIC_URL=https://api.hbyrkj.top
 ```

 ### 4. 启动服务
@@ -67,7 +71,51 @@ docker compose up -d

 ---

-## 第二部分：安全访问配置 (Nginx)
+## 第二部分：Storage 本地文件结构
+
+### 1. 存储路径
+
+Supabase Storage 使用本地文件系统存储，路径结构如下：
+
+```
+/home/rongye/ProgramFiles/Supabase/volumes/storage/stub/stub/
+├── materials/                    # 素材桶
+│   └── {user_id}/               # 用户目录 (隔离)
+│       └── {timestamp}_{filename}/
+│           └── {internal_uuid}  # 实际文件 (Supabase 内部 UUID)
+└── outputs/                      # 输出桶
+    └── {user_id}/
+        └── {task_id}_output.mp4/
+            └── {internal_uuid}
+```
+
+### 2. 用户隔离策略
+
+所有用户数据通过路径前缀实现隔离：
+
+| 资源类型 | 路径格式 | 示例 |
+|----------|----------|------|
+| 素材 | `{bucket}/{user_id}/{timestamp}_{filename}` | `materials/abc123/1737000001_video.mp4` |
+| 输出 | `{bucket}/{user_id}/{task_id}_output.mp4` | `outputs/abc123/uuid-xxx_output.mp4` |
+| Cookie | `cookies/{user_id}/{platform}.json` | `cookies/abc123/bilibili.json` |
+
+### 3. 直接访问本地文件
+
+后端可以直接读取本地文件（跳过 HTTP），提升发布等操作的效率：
+
+```python
+# storage.py
+SUPABASE_STORAGE_LOCAL_PATH = Path("/home/rongye/ProgramFiles/Supabase/volumes/storage/stub/stub")
+
+def get_local_file_path(self, bucket: str, path: str) -> Optional[str]:
+    dir_path = SUPABASE_STORAGE_LOCAL_PATH / bucket / path
+    files = list(dir_path.iterdir())
+    return str(files[0]) if files else None
+```
+
+---
+
+## 第三部分：安全访问配置 (Nginx)

 建议在阿里云公网网关上配置 Nginx 反向代理，通过 Frp 隧道连接内网服务。

@@ -78,19 +126,36 @@ docker compose up -d
 ### 2. Nginx 配置示例

 ```nginx
-# Studio (需要密码保护)
+# Studio (需要密码保护，但静态资源和内部API需排除)
 server {
    server_name supabase.hbyrkj.top;
-    
+
    # SSL 配置略...

+    # 静态资源不需要认证
+    location ~ ^/(favicon|_next|static)/ {
+        auth_basic off;
+        proxy_pass http://127.0.0.1:3003;
+        proxy_set_header Host $host;
+        proxy_http_version 1.1;
+    }
+
+    # Studio 内部 API 调用不需要认证
+    location /api/ {
+        auth_basic off;
+        proxy_pass http://127.0.0.1:3003;
+        proxy_set_header Host $host;
+        proxy_http_version 1.1;
+        proxy_set_header Upgrade $http_upgrade;
+        proxy_set_header Connection "upgrade";
+    }
+
+    # 其他路径需要 Basic Auth 认证
    location / {
-        # Basic Auth 保护后台
        auth_basic "Restricted Studio";
        auth_basic_user_file /etc/nginx/.htpasswd;
-
        proxy_pass http://127.0.0.1:3003;
-        
+
        # WebSocket 支持 (Realtime 必须)
        proxy_http_version 1.1;
        proxy_set_header Upgrade $http_upgrade;
@@ -101,23 +166,39 @@ server {
 # API (公开访问)
 server {
    server_name api.hbyrkj.top;
-    
+
    # SSL 配置略...

+    # ⚠️ 重要：解除上传大小限制
+    client_max_body_size 0;
+
    location / {
        proxy_pass http://127.0.0.1:8008;
-        
+
        # 允许 WebSocket
        proxy_http_version 1.1;
        proxy_set_header Upgrade $http_upgrade;
        proxy_set_header Connection "upgrade";
+
+        # 大文件上传超时设置
+        proxy_read_timeout 600s;
+        proxy_send_timeout 600s;
    }
 }
 ```

+### 3. 关键配置说明
+
+| 配置项 | 作用 | 必要性 |
+|--------|------|--------|
+| `client_max_body_size 0` | 解除上传大小限制（默认 1MB） | **必须** |
+| `proxy_read_timeout 600s` | 大文件上传/下载超时 | 推荐 |
+| `proxy_http_version 1.1` | WebSocket 支持 | Realtime 必须 |
+| `auth_basic` | Studio 访问保护 | 推荐 |
+
 ---

-## 第三部分：数据库与认证配置 (Database & Auth)
+## 第四部分：数据库与认证配置 (Database & Auth)

 ### 1. 初始化表结构 (Schema)

@@ -184,7 +265,7 @@ JWT_EXPIRE_HOURS=168

 ---

-## 第四部分：常用维护命令
+## 第五部分：常用维护命令

 **查看服务状态**:
 ```bash
--- a/Docs/implementation_plan.md
+++ b/Docs/implementation_plan.md
@@ -6,6 +6,7 @@
 - 上传静态人物视频 → 生成口播视频（唇形同步）
 - TTS 配音或声音克隆
 - 字幕自动生成与渲染
+- AI 自动生成标题与标签
 - 一键发布到多个社交平台

 ---
@@ -41,17 +42,28 @@

 | 模块 | 技术选择 | 备选方案 |
 |------|----------|----------|
-| **前端框架** | Next.js 14 | Vue 3 + Vite |
-| **UI 组件库** | Tailwind + shadcn/ui | Ant Design |
-| **后端框架** | FastAPI | Flask |
-| **任务队列** | Celery + Redis | RQ / Dramatiq |
-| **唇形同步** | **LatentSync 1.6** | MuseTalk / Wav2Lip |
-| **TTS 配音** | EdgeTTS | CosyVoice |
-| **声音克隆** | GPT-SoVITS (可选) | - |
-| **视频处理** | FFmpeg | MoviePy |
-| **自动发布** | social-auto-upload | 自行实现 |
-| **数据库** | SQLite → PostgreSQL | MySQL |
-| **文件存储** | 本地 / MinIO | 阿里云 OSS |
+| **前端框架** | Next.js 16 | Vue 3 + Vite |
+| **UI 组件库** | TailwindCSS (自定义组件) | Ant Design |
+| **后端框架** | FastAPI | Flask |
+| **任务队列** | FastAPI BackgroundTasks (asyncio) | Celery + Redis |
+| **唇形同步** | **LatentSync 1.6** | MuseTalk / Wav2Lip |
+| **TTS 配音** | EdgeTTS | CosyVoice |
+| **声音克隆** | **Qwen3-TTS 1.7B** ✅ | GPT-SoVITS |
+| **视频处理** | FFmpeg | MoviePy |
+| **自动发布** | Playwright | 自行实现 |
+| **数据库** | Supabase (PostgreSQL) | MySQL |
+| **文件存储** | Supabase Storage | 阿里云 OSS |
+
+> **修正 (18:10)**：当前实现采用 Next.js 16、FastAPI BackgroundTasks 与 Supabase Storage/Auth，自动发布基于 Playwright。
+
+---
+
+## ✅ 现状补充 (Day 17)
+
+- 前端已拆分为组件化结构（`features/home/ui/`），主页面逻辑集中。
+- 通用工具 `media.ts` 统一处理 API Base / 资源 URL / 日期格式化。
+- 作品预览弹窗统一样式，并支持素材/发布预览复用。
+- 标题/字幕预览按素材分辨率缩放，效果更接近成片。

 ---

@@ -59,24 +71,11 @@

 ### 阶段一：核心功能验证 (MVP)

-> **目标**：验证 MuseTalk + EdgeTTS 效果，跑通端到端流程
+> **目标**：验证 LatentSync + EdgeTTS 效果，跑通端到端流程

-#### 1.1 环境搭建
-
-```bash
-# 创建项目目录
-mkdir TalkingHeadAgent
-cd TalkingHeadAgent
-
-# 克隆 MuseTalk
-git clone https://github.com/TMElyralab/MuseTalk.git
-
-# 安装依赖
-cd MuseTalk
-pip install -r requirements.txt
-
-# 下载模型权重 (按官方文档)
-```
+#### 1.1 环境搭建
+
+参考 `models/LatentSync/DEPLOY.md` 完成 LatentSync 环境与权重部署。

 #### 1.2 集成 EdgeTTS

@@ -97,13 +96,13 @@ async def text_to_speech(text: str, voice: str = "zh-CN-YunxiNeural", output_pat
 # test_pipeline.py
 """
 1. 文案 → EdgeTTS → 音频
-2. 静态视频 + 音频 → MuseTalk → 口播视频
+2. 静态视频 + 音频 → LatentSync → 口播视频
 3. 添加字幕 → FFmpeg → 最终视频
 """
 ```

 #### 1.4 验证标准
- [ ] MuseTalk 能正常推理
+- [ ] LatentSync 能正常推理
 - [ ] 唇形与音频同步率 > 90%
 - [ ] 单个视频生成时间 < 2 分钟

@@ -141,25 +140,19 @@ backend/

 | 端点 | 方法 | 功能 |
 |------|------|------|
-| `/api/materials` | POST | 上传素材视频 | ✅ |
+| `/api/materials` | POST | 上传视频素材 | ✅ |
 | `/api/materials` | GET | 获取素材列表 | ✅ |
 | `/api/videos/generate` | POST | 创建视频生成任务 | ✅ |
-| `/api/tasks/{id}` | GET | 查询任务状态 | ✅ |
-| `/api/videos/{id}/download` | GET | 下载生成的视频 | ✅ |
+| `/api/videos/tasks/{id}` | GET | 查询任务状态 | ✅ |
+| `/api/videos/generated` | GET | 获取历史作品列表 | ✅ |
 | `/api/publish` | POST | 发布到社交平台 | ✅ |

-#### 2.3 Celery 任务定义
-
-```python
-# tasks/celery_tasks.py
-@celery.task
-def generate_video_task(material_id: str, text: str, voice: str):
-    # 1. TTS 生成音频
-    # 2. MuseTalk 唇形同步
-    # 3. FFmpeg 添加字幕
-    # 4. 保存并返回视频 URL
-    pass
-```
+#### 2.3 BackgroundTasks 任务定义
+
+```python
+# app/api/videos.py
+background_tasks.add_task(_process_video_generation, task_id, req, user_id)
+```

 ---

@@ -171,7 +164,7 @@ def generate_video_task(material_id: str, text: str, voice: str):

 | 页面 | 功能 |
 |------|------|
-| **素材库** | 上传/管理多场景素材视频 |
+| **素材库** | 上传/管理多场景视频素材 |
 | **生成视频** | 输入文案、选择素材、生成预览 |
 | **任务中心** | 查看生成进度、下载视频 |
 | **发布管理** | 绑定平台、一键发布、定时发布 |
@@ -182,9 +175,9 @@ def generate_video_task(material_id: str, text: str, voice: str):
 # 创建 Next.js 项目
 npx create-next-app@latest frontend --typescript --tailwind --app

-# 安装依赖
-cd frontend
-npm install @tanstack/react-query axios
+# 安装依赖
+cd frontend
+npm install axios swr
 ```

 ---
@@ -219,6 +212,7 @@ cp -r SuperIPAgent/social-auto-upload backend/social_upload
 | 功能 | 实现方式 |
 |------|----------|
 | **声音克隆** | 集成 GPT-SoVITS，用自己的声音 |
+| **AI 标题/标签生成** | 调用大模型 API 自动生成标题与标签 ✅ |
 | **批量生成** | 上传 Excel/CSV，批量生成视频 |
 | **字幕编辑器** | 可视化调整字幕样式、位置 |
 | **Docker 部署** | 一键部署到云服务器 | ✅ |
@@ -323,25 +317,42 @@ cp -r SuperIPAgent/social-auto-upload backend/social_upload
 - [x] 端口冲突解决 (3003/8008/8444)
 - [x] Basic Auth 管理后台保护

+### 阶段十七：声音克隆功能集成 (Day 13) ✅
+
+> **目标**：实现用户自定义声音克隆能力
+
+- [x] Qwen3-TTS HTTP 服务 (独立 FastAPI，端口 8009)
+- [x] 声音克隆服务封装 (voice_clone_service.py)
+- [x] 参考音频管理 API (上传/列表/删除)
+- [x] 前端 TTS 模式选择 UI
+- [x] Supabase ref-audios Bucket 配置
+- [x] 端到端测试验证
+
+### 阶段十八：手机号登录迁移 (Day 15) ✅
+
+> **目标**：将认证系统从邮箱迁移到手机号
+
+- [x] 数据库 Schema 迁移 (email → phone)
+- [x] 后端 API 适配 (auth.py/admin.py)
+- [x] 11位手机号校验 (正则验证)
+- [x] 修改密码功能 (/api/auth/change-password)
+- [x] 账户设置下拉菜单 (修改密码 + 有效期显示 + 退出)
+- [x] 前端登录/注册页面更新
+- [x] 数据库迁移脚本 (migrate_to_phone.sql)
+
+### 阶段十九：深度性能优化与服务守护 (Day 16) ✅
+
+> **目标**：提升系统响应速度与服务稳定性
+
+- [x] Flash Attention 2 集成 (Qwen3-TTS 加速 5x)
+- [x] LatentSync 性能调优 (OMP 线程限制 + 原生 Flash Attn)
+- [x] Watchdog 服务守护 (自动重启僵死服务)
+- [x] 文档体系更新 (部署手册与运维指南)
+
 ---

 ## 项目目录结构 (最终)

-```
-TalkingHeadAgent/
-├── frontend/                # Next.js 前端
-│   ├── app/
-│   ├── components/
-│   └── package.json
-├── backend/                 # FastAPI 后端
-│   ├── app/
-│   ├── MuseTalk/            # 唇形同步模型
-│   ├── social_upload/       # 社交发布模块
-│   └── requirements.txt
-├── docker-compose.yml       # 一键部署
-└── README.md
-```
-
 ---

 ## 开发时间估算
--- a/Docs/task_complete.md
+++ b/Docs/task_complete.md
@@ -1,321 +1,109 @@
-# ViGent 数字人口播系统 - 开发任务清单
+# ViGent2 开发任务清单 (Task Log)

-**项目**：ViGent2 数字人口播视频生成系统  
-**服务器**：Dell R730 (2× RTX 3090 24GB)  
-**更新时间**：2026-01-26  
-**整体进度**：100%（Day 10 HTTPS 部署与细节完善）
-
-## 📖 快速导航
-
-| 章节 | 说明 |
-|------|------|
-| [已完成任务](#-已完成任务) | Day 1-4 完成的功能 |
-| [后续规划](#️-后续规划) | 待办项目 |
-| [进度统计](#-进度统计) | 各模块完成度 |
-| [里程碑](#-里程碑) | 关键节点 |
-| [时间线](#-时间线) | 开发历程 |
-
-**相关文档**：
- [Day 日志](file:///d:/CodingProjects/Antigravity/ViGent2/Docs/DevLogs/) (Day1-Day9)
- [部署指南](file:///d:/CodingProjects/Antigravity/ViGent2/Docs/DEPLOY_MANUAL.md)
+**项目**: ViGent2 数字人口播视频生成系统  
+**进度**: 100% (Day 17 - 前端重构与体验优化)  
+**更新时间**: 2026-02-04

 ---

-## ✅ 已完成任务
+## 📅 对话历史与开发日志

-### 阶段一：核心功能验证
- [x] EdgeTTS 配音集成
- [x] FFmpeg 视频合成
- [x] MuseTalk 唇形同步 (代码集成)
- [x] 端到端流程验证
+> 这里记录了每一天的核心开发内容与 milestone。

-### 阶段二：后端 API 开发
- [x] FastAPI 项目搭建
- [x] 视频生成 API
- [x] 素材管理 API
- [x] 文件存储管理
+### Day 17: 前端重构与体验优化 (Current) 🚀
+- [x] **UI 组件拆分**: 首页拆分为独立组件，降低 `page.tsx` 复杂度。
+- [x] **轻量 FSD 迁移**: `app` 页面轻量化，逻辑集中到 `features/*/model`，通用能力下沉 `shared/*`。
+- [x] **Controller Hooks**: Home/Publish 页面逻辑集中到 Controller Hook，Page 仅组合渲染。
+- [x] **通用工具抽取**: `media.ts` 统一 API Base / URL / 日期格式化。
+- [x] **交互优化**: 选择项持久化、列表内定位、刷新回顶部、最新作品优先预览。
+- [x] **发布页改造**: 作品列表卡片化 + 搜索 + 预览弹窗。
+- [x] **预览体验**: 预览弹窗统一头部样式与提示文案。
+- [x] **预览一致性**: 标题/字幕预览按素材分辨率缩放。
+- [x] **标题同步与限制**: 片头标题同步发布标题，输入法合成态兼容，限制 15 字。
+- [x] **样式默认与持久化**: 默认样式与字号调整，刷新保留用户选择。
+- [x] **性能微优化**: 列表渲染优化 + 并行请求 + localStorage 防抖。
+- [x] **资源能力**: 字体/BGM 资源库 + `/api/assets` 接入。
+- [x] **音频与字幕修复**: BGM 混音稳定性与字幕断句优化。
+- [x] **持久化修复**: 接入 `useHomePersistence`，恢复 `isRestored` 逻辑并通过构建。
+- [x] **预览与选择修复**: 发布预览兼容签名 URL，音频试听路径解析，素材/BGM 回退有效项。
+- [x] **体验细节优化**: 录音预览 URL 回收，预览弹窗滚动恢复，全局任务提示挂载。
+
+### Day 16: 深度性能优化
+- [x] **Qwen-TTS 加速**: 集成 Flash Attention 2，模型加载速度提升至 8.9s。
+- [x] **服务守护**: 开发 `Watchdog` 看门狗机制，自动监控并重启僵死服务。
+- [x] **LatentSync 性能确认**: 验证 DeepCache + 原生 Flash Attn 生效。
+- [x] **文档重构**: 全面更新 README、部署手册及后端文档。

-### 阶段三：前端 Web UI
- [x] Next.js 项目初始化
- [x] 视频生成页面
- [x] 发布管理页面
- [x] 任务状态展示
+### Day 15: 手机号认证迁移
+- [x] **认证系统升级**: 从邮箱迁移至 11 位手机号注册/登录。
+- [x] **账户管理**: 新增修改密码、有效期显示、安全退出功能。
+- [x] **AI 文案助手**: 升级 GLM-4.7-Flash，支持 B站/抖音链接提取与洗稿。

-### 阶段四：社交媒体发布
- [x] Playwright 自动化框架
- [x] Cookie 管理功能
- [x] 多平台发布 UI
- [x] 定时发布功能 (Day 7)
- [x] QR码自动登录 (Day 7)
+### Day 14: AI 增强与体验优化
+- [x] **AI 标题/标签**: 集成 GLM-4API 自动生成视频元数据。
+- [x] **字幕升级**: Remotion 逐字高亮字幕 (卡拉OK效果) 及动画片头。
+- [x] **模型升级**: Qwen3-TTS 升级至 1.7B-Base 版本。

-### 阶段五：部署与文档
- [x] 手动部署指南 (DEPLOY_MANUAL.md)
- [x] 一键部署脚本 (deploy.sh)
- [x] 环境配置模板 (.env.example)
- [x] 项目文档 (README.md)
- [x] 端口配置 (8006/3002)
+### Day 13: 声音克隆集成
+- [x] **声音克隆微服务**: 封装 Qwen3-TTS 为独立 API (8009端口)。
+- [x] **参考音频管理**: Supabase 存储桶配置与管理接口。
+- [x] **多模态 TTS**: 前端支持 EdgeTTS / Clone Voice 切换。

-### 阶段六：MuseTalk 服务器部署 (Day 2-3)
- [x] conda 环境配置 (musetalk)
- [x] 模型权重下载 (~7GB)
- [x] subprocess 调用方式实现
- [x] 健康检查功能
- [x] 实际推理调用验证 (Day 3 修复)
+### Day 12: 移动端适配
+- [x] **iOS 兼容**: 修复 Safari 安全区域、状态栏颜色、Cookie 拦截问题。
+- [x] **响应式 UI**: 移动端 Header 与发布页重构。

-### 阶段七：MuseTalk 完整修复 (Day 4)
- [x] 权重检测路径修复 (软链接)
- [x] 音视频长度不匹配修复 (audio_processor.py)
- [x] 推理脚本错误日志增强 (inference.py)
- [x] 视频合成 MP4 生成验证
- [x] 端到端流程完整测试
+### Day 11: 上传架构重构
+- [x] **直传优化**: 前端直传 Supabase Storage，解决 Nginx 30s 超时问题。
+- [x] **数据隔离**: 用户素材/视频按 UserID 物理隔离。

-### 阶段八：前端功能增强 (Day 5)
- [x] Web 视频上传功能
- [x] 上传进度显示
- [x] 自动刷新素材列表
+### Day 10: HTTPS 与安全
+- [x] **HTTPS 部署**: 配置 SSL 证书与 Nginx 反向代理。
+- [x] **安全加固**: Supabase Studio 增加 Basic Auth 保护。

-### 阶段九：唇形同步模型升级 (Day 6)
- [x] MuseTalk → LatentSync 1.6 迁移
- [x] 后端代码适配 (config.py, lipsync_service.py)
- [x] Conda 环境配置 (latentsync)
- [x] 模型权重部署指南
- [x] 服务器端到端验证
+### Day 9: 认证系统与发布闭环
+- [x] **用户系统**: 基于 Supabase Auth 实现 JWT 认证。
+- [x] **发布闭环**: 验证 B站/抖音/小红书 自动发布流程。
+- [x] **服务自愈**: 配置 PM2 进程守护。

-### 阶段十：性能优化 (Day 6)
- [x] 视频预压缩优化 (高分辨率自动压缩到720p)
- [x] 进度更新细化 (5% → 10% → 25% → ... → 100%)
- [x] LipSync 服务单例缓存
- [x] 健康检查缓存 (5分钟)
- [x] 异步子进程修复 (subprocess.run → asyncio)
- [x] 预加载模型服务 (常驻 Server + FastAPI)
- [x] 批量队列处理 (GPU 并发控制)
-
-### 阶段十一：社交媒体发布完善 (Day 7)
- [x] QR码自动登录 (Playwright headless)
- [x] 多平台上传器架构 (B站/抖音/小红书)
- [x] B站发布 (biliup官方库)
- [x] 抖音/小红书发布 (Playwright)
- [x] 定时发布功能
- [x] 前端发布UI优化
- [x] Cookie自动管理
- [x] UI一致性修复 (导航栏对齐、滚动条隐藏)
- [x] QR登录超时修复 (Stealth模式、多选择器fallback)
- [x] 文档规则优化 (智能修改标准、工具使用规范)
-
-### 阶段十二：用户体验优化 (Day 8)
- [x] 文件名保留 (时间戳前缀 + 原始名称)
- [x] 视频持久化 (从文件系统读取历史)
- [x] 历史视频列表组件
- [x] 素材/视频删除功能
- [x] 登出功能 (Logout API + 前端按钮)
- [x] 前端 SWR 轮询优化
- [x] QR 登录状态检测修复
-
-### 阶段十三：发布模块优化 (Day 9)
- [x] B站/抖音发布验证通过
- [x] 资源清理保障 (try-finally)
- [x] 超时保护 (消除无限循环)
- [x] 小红书 headless 模式修复
- [x] API 输入验证
- [x] 完整类型提示
- [x] 扫码登录等待界面 (加载动画)
- [x] 抖音/B站登录策略优化 (Text优先)
- [x] 发布成功审核提示
-
-### 阶段十四：用户认证系统 (Day 9)
- [x] Supabase 数据库表设计与部署
- [x] JWT 认证 (HttpOnly Cookie)
- [x] 用户注册/登录/登出 API
- [x] 管理员权限控制 (is_active)
- [x] 单设备登录限制 (Session Token)
- [x] 防止 Supabase 暂停 (GitHub Actions/Crontab)
- [x] 认证部署文档 (AUTH_DEPLOY.md)
-
-### 阶段十五：部署稳定性优化 (Day 9)
- [x] 后端依赖修复 (bcrypt/email-validator)
- [x] 前端生产环境构建修复 (npm run build)
- [x] LatentSync 性能卡顿修复 (OMP_NUM_THREADS限制)
- [x] 部署服务自愈 (PM2 配置优化)
- [x] 部署手册全量更新 (DEPLOY_MANUAL.md)
-
-### 阶段十六：HTTPS 部署与细节完善 (Day 10)
- [x] 隧道访问修复 (StaticFiles 挂载 + Rewrite)
- [x] 平台账号列表 500 错误修复 (paths.py)
- [x] Nginx HTTPS 配置 (反向代理 + SSL)
- [x] 浏览器标题修改 (ViGent)
- [x] 代码自适应 HTTPS 验证
- [x] **Supabase 自托管部署** (Docker, 3003/8008端口)
- [x] **安全加固** (Basic Auth 保护后台)
- [x] **端口冲突解决** (迁移 Analytics/Kong)
+### Day 1-8: 核心功能构建
+- [x] **Day 8**: 历史记录持久化与文件管理。
+- [x] **Day 7**: 社交媒体自动登录与多平台发布。
+- [x] **Day 6**: **LatentSync 1.6** 升级与服务器部署。
+- [x] **Day 5**: 前端视频上传与进度反馈。
+- [x] **Day 4**: MuseTalk (旧版) 口型同步修复。
+- [x] **Day 3**: 服务器环境配置与模型权重下载。
+- [x] **Day 1-2**: 项目基础框架 (FastAPI + Next.js) 搭建。

 ---

-## 🛤️ 后续规划
+## 🛤️ 后续规划 (Roadmap)

 ### 🔴 优先待办
- [ ] 批量视频生成架构设计
- [ ] 字幕样式编辑器集成
-
-### 🟠 功能完善
- [x] 定时发布功能 ✅ Day 7 完成
- [ ] 批量视频生成
- [ ] 字幕样式编辑器
+- [ ] **批量生成架构**: 支持 Excel 导入，批量生产视频。
+- [ ] **定时任务后台化**: 迁移前端触发的定时发布到后端 APScheduler。

 ### 🔵 长期探索
- [ ] 声音克隆 (GPT-SoVITS)
- [ ] Docker 容器化
- [ ] Celery 分布式任务队列
+- [ ] **容器化交付**: 提供完整的 Docker Compose 一键部署包。
+- [ ] **分布式队列**: 引入 Celery + Redis 处理超高并发任务。

 ---

-## 📊 进度统计
-
-### 总体进度
-```
-████████████████████ 100%
-```
-
-### 各模块进度
+## 📊 模块完成度

 | 模块 | 进度 | 状态 |
 |------|------|------|
-| 后端 API | 100% | ✅ 完成 |
-| 前端 UI | 100% | ✅ 完成 |
-| TTS 配音 | 100% | ✅ 完成 |
-| 视频合成 | 100% | ✅ 完成 |
-| 唇形同步 | 100% | ✅ LatentSync 1.6 升级完成 |
-| 社交发布 | 100% | ✅ Day 9 验证通过 |
-| 用户认证 | 100% | ✅ Day 9 Supabase+JWT |
-| 服务器部署 | 100% | ✅ Day 9 稳定性优化完成 |
+| **核心 API** | 100% | ✅ 稳定 |
+| **Web UI** | 100% | ✅ 稳定 (移动端适配) |
+| **唇形同步** | 100% | ✅ LatentSync 1.6 |
+| **TTS 配音** | 100% | ✅ EdgeTTS + Qwen3 |
+| **自动发布** | 100% | ✅ B站/抖音/小红书 |
+| **用户认证** | 100% | ✅ 手机号 + JWT |
+| **部署运维** | 100% | ✅ PM2 + Watchdog |

 ---

-## 🎯 里程碑
-
-### Milestone 1: 项目框架搭建 ✅
-**完成时间**: Day 1  
-**成果**: 
- FastAPI 后端 + Next.js 前端
- EdgeTTS + FFmpeg 集成
- 视频生成端到端验证
-
-### Milestone 2: 服务器部署 ✅
-**完成时间**: Day 3  
-**成果**: 
- PyTorch 2.0.1 + MMLab 环境修复
- 模型目录重组与权重补全
- MuseTalk 推理成功运行
-
-### Milestone 3: 口型同步完整修复 ✅
-**完成时间**: Day 4  
-**成果**: 
- 权重检测路径修复 (软链接)
- 音视频长度不匹配修复
- 视频合成 MP4 验证通过 (28MB → 3.8MB)
-
-### Milestone 4: LatentSync 1.6 升级 ✅
-**完成时间**: Day 6  
-**成果**: 
- MuseTalk → LatentSync 1.6 迁移
- 512×512 高分辨率唇形同步
- Latent Diffusion 架构升级
- 性能优化 (视频预压缩、进度更新)
-
-### Milestone 5: 用户认证系统 ✅
-**完成时间**: Day 9
-**成果**:
- Supabase 云数据库集成
- 安全的 JWT + HttpOnly Cookie 认证
- 管理员后台与用户隔离
- 完善的部署与保活方案
-
-### Milestone 6: 生产环境部署稳定化 ✅
-**完成时间**: Day 9
-**成果**:
- 修复了后端 (bcrypt) 和前端 (build) 的启动崩溃问题
- 解决了 LatentSync 占用全量 CPU 导致服务器卡顿的严重问题
- 完善了部署手册，记录了关键的 Troubleshooting 步骤
- 实现了服务 Long-term 稳定运行 (Reset PM2 counter)
-
---
-
-## 📅 时间线
-
-```
-Day 1: 项目初始化 + 核心功能   ✅ 完成
-       - 后端 API 框架
-       - 前端 UI
-       - TTS + 视频合成
-       - 社交发布框架
-       - 部署文档
-
-Day 2: 服务器部署 + MuseTalk   ✅ 完成
-       - 端口配置 (8006/3002)
-       - MuseTalk conda 环境初始化
-       - subprocess 调用实现
-       - 健康检查验证
-
-Day 3: 环境修复与验证          ✅ 完成
-       - PyTorch 降级 (2.5 -> 2.0.1)
-       - MMLab 依赖全量安装
-       - 模型权重补全 (dwpose, syncnet)
-       - 目录结构修复 (symlinks)
-       - 推理脚本验证 (生成593帧)
-
-Day 4: 口型同步完整修复        ✅ 完成
-       - 权重检测路径修复 (软链接)
-       - audio_processor.py 音视频长度修复
-       - inference.py 错误日志增强
-       - MP4 视频合成验证通过
-
-Day 5: 前端功能增强            ✅ 完成
-       - Web 视频上传功能
-       - 上传进度显示
-       - 自动刷新素材列表
-
-Day 6: LatentSync 1.6 升级     ✅ 完成
-       - MuseTalk → LatentSync 迁移
-       - 后端代码适配
-       - 模型部署指南
-       - 服务器部署验证
-       - 性能优化 (视频预压缩、进度更新)
-
-Day 7: 社交媒体发布完善     ✅ 完成
-       - QR码自动登录 (B站/抖音验证通过)
-       - 智能定位策略 (CSS/Text并行)
-       - 多平台发布 (B站/抖音/小红书)
-       - UI 一致性优化
-       - 文档规则体系优化
-
-Day 8: 用户体验优化          ✅ 完成
-       - 文件名保留 (时间戳前缀)
-       - 视频持久化 (历史视频API)
-       - 历史视频列表组件
-       - 素材/视频删除功能
-
-Day 9: 发布模块优化          ✅ 完成
-       - B站/抖音登录+发布验证通过
-       - 资源清理保障 (try-finally)
-       - 超时保护 (消除无限循环)
-       - 小红书 headless 模式修复
-       - 扫码登录等待界面 (加载动画)
-       - 抖音/B站登录策略优化 (Text优先)
-       - 发布成功审核提示
-       - 用户认证系统规划 (FastAPI+Supabase)
-       - Supabase 表结构设计 (users/sessions)
-       - 后端 JWT 认证实现 (auth.py/deps.py)
-       - 数据库配置与 SQL 部署
-       - 独立认证部署文档 (AUTH_DEPLOY.md)
-       - 自动保活机制 (Crontab/Actions)
-       - 部署稳定性优化 (Backend依赖修复)
-       - 前端生产构建流程修复
-       - LatentSync 严重卡顿修复 (线程数限制)
-       - 部署手册全量更新
-
-Day 10: HTTPS 部署与细节完善 ✅ 完成
-       - 隧道访问视频修正 (挂载 uploads)
-       - 账号列表 Bug 修复 (paths.py 白名单)
-       - 阿里云 Nginx HTTPS 部署
-       - UI 细节优化 (Title 更新)
-```
+## 📎 相关文档

+- [详细开发日志 (DevLogs)](file:///d:/CodingProjects/Antigravity/ViGent2/Docs/DevLogs/)
+- [部署手册 (DEPLOY_MANUAL)](file:///d:/CodingProjects/Antigravity/ViGent2/Docs/DEPLOY_MANUAL.md)
--- a/README.md
+++ b/README.md
@@ -1,32 +1,67 @@
 # ViGent2 - 数字人口播视频生成系统

-基于 **LatentSync 1.6 + EdgeTTS** 的开源数字人口播视频生成系统。
+<div align="center">

-> 📹 上传静态人物视频 → 🎙️ 输入口播文案 → 🎬 自动生成唇形同步视频
+> 📹 **上传人物** · 🎙️ **输入文案** · 🎬 **一键成片**
+
+基于 **LatentSync 1.6 + EdgeTTS** 的开源数字人口播视频生成系统。
+集成 **Qwen3-TTS** 声音克隆与自动社交媒体发布功能。
+
+[功能特性](#-功能特性) • [技术栈](#-技术栈) • [文档中心](#-文档中心) • [部署指南](Docs/DEPLOY_MANUAL.md)
+
+</div>

 ---

 ## ✨ 功能特性

- 🎬 **唇形同步** - LatentSync 1.6 驱动，512×512 高分辨率 Diffusion 模型
- 🎙️ **TTS 配音** - EdgeTTS 多音色支持（云溪、晓晓等）
- 📱 **全自动发布** - 扫码登录 + Cookie持久化，支持多平台(B站/抖音/小红书)定时发布
- 🖥️ **Web UI** - Next.js 现代化界面
- 🔐 **用户系统** - Supabase + JWT 认证，支持管理员后台、注册/登录、账号隔离
- 🚀 **性能优化** - 视频预压缩、常驻模型服务 (0s加载)
+### 核心能力
+- 🎬 **高清唇形同步** - LatentSync 1.6 驱动，512×512 高分辨率 Latent Diffusion 模型。
+- 🎙️ **多模态配音** - 支持 **EdgeTTS** (微软超自然语音) 和 **Qwen3-TTS** (3秒极速声音克隆)。
+- 📝 **智能字幕** - 集成 faster-whisper + Remotion，自动生成逐字高亮 (卡拉OK效果) 字幕。
+- 🎨 **样式预设** - 标题/字幕样式选择 + 预览 + 字号调节，支持自定义字体库。
+- 🖼️ **作品预览一致性** - 标题/字幕预览按素材分辨率缩放，效果更接近成片。
+- 💾 **用户偏好持久化** - 首页状态统一恢复/保存，刷新后延续上次配置。
+- 🎵 **背景音乐** - 试听 + 音量控制 + 混音，保持配音音量稳定。
+- 🤖 **AI 辅助创作** - 内置 GLM-4.7-Flash，支持 B站/抖音链接文案提取、AI 洗稿、标题/标签自动生成。
+
+### 平台化功能
+- 📱 **全自动发布** - 支持 B站、抖音、小红书定时发布，扫码登录 + Cookie 持久化。
+- 🖥️ **发布管理预览** - 支持签名 URL / 相对路径作品预览，确保可直接播放。
+- 🔐 **认证与隔离** - 基于 Supabase 的用户隔离，支持手机号注册/登录、密码管理。
+- 🛡️ **服务守护** - 内置 Watchdog 看门狗机制，自动监控并重启僵死服务，确保 7x24h 稳定运行。
+- 🚀 **性能优化** - 视频预压缩、模型常驻服务（近实时加载）、双 GPU 流水线并发。
+
+---

 ## 🛠️ 技术栈

-| 模块 | 技术 |
-|------|------|
-| 前端 | Next.js 14 + TypeScript + TailwindCSS |
-| 后端 | FastAPI + Python 3.10 |
-| 数据库 | **Supabase** (PostgreSQL) Local Docker |
-| 认证 | **JWT** + HttpOnly Cookie |
-| 唇形同步 | **LatentSync 1.6** (Latent Diffusion, 512×512) |
-| TTS | EdgeTTS |
-| 视频处理 | FFmpeg |
-| 自动发布 | Playwright |
+| 领域 | 核心技术 | 说明 |
+|------|----------|------|
+| **前端** | Next.js 16 | TypeScript, TailwindCSS, SWR |
+| **后端** | FastAPI | Python 3.10, AsyncIO, PM2 |
+| **数据库** | Supabase | PostgreSQL, Storage (本地/S3), Auth |
+| **唇形同步** | LatentSync 1.6 | PyTorch 2.5, Diffusers, DeepCache |
+| **声音克隆** | Qwen3-TTS | 1.7B 参数量，Flash Attention 2 加速 |
+| **自动化** | Playwright | 社交媒体无头浏览器自动化 |
+| **部署** | Docker & PM2 | 混合部署架构 |
+
+---
+
+## 📖 文档中心
+
+我们提供了详尽的开发与部署文档：
+
+### 部署运维
+- **[部署手册 (DEPLOY_MANUAL.md)](Docs/DEPLOY_MANUAL.md)** - 👈 **部署请看这里**！包含完整的环境搭建步骤。
+- [参考音频服务部署 (QWEN3_TTS_DEPLOY.md)](Docs/QWEN3_TTS_DEPLOY.md) - 声音克隆模型部署指南。
+- [LatentSync 部署指南](models/LatentSync/DEPLOY.md) - 唇形同步模型独立部署。
+- [用户认证部署 (AUTH_DEPLOY.md)](Docs/AUTH_DEPLOY.md) - Supabase 与 Auth 系统配置。
+
+### 开发文档
+- [后端开发指南](Docs/BACKEND_README.md) - 接口规范与开发流程。
+- [前端开发指南](Docs/FRONTEND_DEV.md) - UI 组件与页面规范。
+- [开发日志 (DevLogs)](Docs/DevLogs/) - 每日开发进度与技术决策记录。

 ---

@@ -34,138 +69,33 @@

 ```
 ViGent2/
-├── backend/              # FastAPI 后端
-│   ├── app/
-│   │   ├── api/          # API 路由
-│   │   ├── services/     # 核心服务 (TTS, LipSync, Video)
-│   │   └── core/         # 配置
-│   ├── requirements.txt
-│   └── .env.example
-├── frontend/             # Next.js 前端
-│   └── src/app/
-├── models/               # AI 模型
-│   └── LatentSync/       # 唇形同步模型
-│       └── DEPLOY.md     # LatentSync 部署指南
-└── Docs/                 # 文档
-    ├── DEPLOY_MANUAL.md  # 部署手册
-    ├── AUTH_DEPLOY.md    # 认证部署指南
-    ├── task_complete.md
-    └── DevLogs/
+├── backend/              # FastAPI 后端服务
+│   ├── app/              # 核心业务逻辑
+│   ├── scripts/          # 运维脚本 (Watchdog 等)
+│   └── tests/            # 测试用例
+├── frontend/             # Next.js 前端应用
+├── models/               # AI 模型仓库
+│   ├── LatentSync/       # 唇形同步服务
+│   └── Qwen3-TTS/        # 声音克隆服务
+└── Docs/                 # 项目文档
 ```

 ---

-## 🚀 快速开始
+## 🌐 服务架构

-### 1. 克隆项目
+系统采用微服务架构设计，各组件独立运行：

-```bash
-git clone <仓库地址> /home/rongye/ProgramFiles/ViGent2
-cd /home/rongye/ProgramFiles/ViGent2
-```
-
-### 2. 安装后端
-
-```bash
-cd backend
-python -m venv venv
-source venv/bin/activate  # Windows: venv\Scripts\activate
-pip install -r requirements.txt
-cp .env.example .env
-```
-
-### 3. 安装前端
-
-```bash
-cd frontend
-npm install
-```
-
-### 4. 安装 LatentSync (服务器)
-
-详见 [models/LatentSync/DEPLOY.md](models/LatentSync/DEPLOY.md)
-
-```bash
-# 创建独立 Conda 环境
-conda create -n latentsync python=3.10.13
-conda activate latentsync
-
-# 安装依赖并下载权重
-cd models/LatentSync
-pip install -r requirements.txt
-huggingface-cli download ByteDance/LatentSync-1.6 --local-dir checkpoints
-```
-
-### 5. 启动服务
-
-```bash
-# 终端 1: 后端 (端口 8006)
-cd backend && source venv/bin/activate
-uvicorn app.main:app --host 0.0.0.0 --port 8006
-
-# 终端 2: 前端 (端口 3002)
-cd frontend
-npm run dev -- -p 3002
-
-# 终端 3: LatentSync 服务 (端口 8007, 推荐启动)
-cd models/LatentSync
-nohup python -m scripts.server > server.log 2>&1 &
-```
+| 服务名称 | 端口 | 用途 |
+|----------|------|------|
+| **Web UI** | 3002 | 用户访问入口 (Next.js) |
+| **Backend API** | 8006 | 核心业务接口 (FastAPI) |
+| **LatentSync** | 8007 | 唇形同步推理服务 |
+| **Qwen3-TTS** | 8009 | 声音克隆推理服务 |
+| **Supabase** | 8008 | 数据库与认证网关 |

 ---

-## 🖥️ 服务器配置
+## ⚖️ License

-**目标服务器**: Dell PowerEdge R730
-
-| 配置 | 规格 |
-|------|------|
-| CPU | 2× Intel Xeon E5-2680 v4 (56 线程) |
-| 内存 | 192GB DDR4 |
-| GPU | 2× NVIDIA RTX 3090 24GB |
-| 存储 | 4.47TB |
-
-**GPU 分配**:
- GPU 0: 其他服务
- GPU 1: **LatentSync** 唇形同步 (~18GB VRAM)
-
---
-
-## 🌐 访问地址
-
-| 服务 | 地址 | 说明 |
-|------|------|------|
-| **视频生成 (UI)** | `https://vigent.hbyrkj.top` | 用户访问入口 |
-| **API 服务** | `http://<服务器IP>:8006` | 后端 Swagger |
-| **认证管理 (Studio)** | `https://supabase.hbyrkj.top` | 需要 Basic Auth |
-| **认证 API (Kong)** | `https://api.hbyrkj.top` | Supabase 接口 |
-| **模型服务** | `http://<服务器IP>:8007` | LatentSync |
-
---
-
-## 📖 文档
-
- [LatentSync 部署指南](models/LatentSync/DEPLOY.md)
- [手动部署指南](Docs/DEPLOY_MANUAL.md)
- [LatentSync 部署指南](models/LatentSync/DEPLOY.md)
- [手动部署指南](Docs/DEPLOY_MANUAL.md)
- [Supabase 部署指南](Docs/SUPABASE_DEPLOY.md)
- [开发日志](Docs/DevLogs/)
- [任务进度](Docs/task_complete.md)
-
---
-
-## 🆚 与 ViGent 的区别
-
-| 特性 | ViGent (v1) | ViGent2 |
-|------|-------------|---------|
-| 唇形同步模型 | MuseTalk v1.5 | **LatentSync 1.6** |
-| 分辨率 | 256×256 | **512×512** |
-| 架构 | GAN | **Latent Diffusion** |
-| 视频预处理 | 无 | **自动压缩优化** |
-
---
-
-## 📄 License
-
-MIT
+[MIT License](LICENSE) © 2026 ViGent Team
--- a/backend/.env.example
+++ b/backend/.env.example
@@ -13,28 +13,29 @@ DEFAULT_TTS_VOICE=zh-CN-YunxiNeural

 # =============== LatentSync 配置 ===============
 # GPU 选择 (0=第一块GPU, 1=第二块GPU)
-LATENTSYNC_GPU_ID=0
+LATENTSYNC_GPU_ID=1

+# 使用本地模式 (true) 或远程 API (false)
 # 使用本地模式 (true) 或远程 API (false)
 LATENTSYNC_LOCAL=true

 # 使用常驻服务 (Persistent Server) 加速
-LATENTSYNC_USE_SERVER=false
+LATENTSYNC_USE_SERVER=true

 # 远程 API 地址 (常驻服务默认端口 8007)
 # LATENTSYNC_API_URL=http://localhost:8007

 # 推理步数 (20-50, 越高质量越好，速度越慢)
-LATENTSYNC_INFERENCE_STEPS=20
+LATENTSYNC_INFERENCE_STEPS=40

 # 引导系数 (1.0-3.0, 越高唇同步越准，但可能抖动)
-LATENTSYNC_GUIDANCE_SCALE=1.5
+LATENTSYNC_GUIDANCE_SCALE=2.0

 # 启用 DeepCache 加速 (推荐开启)
 LATENTSYNC_ENABLE_DEEPCACHE=true

 # 随机种子 (设为 -1 则随机)
-LATENTSYNC_SEED=-1
+LATENTSYNC_SEED=1247

 # =============== 上传配置 ===============
 # 最大上传文件大小 (MB)
@@ -46,16 +47,22 @@ MAX_UPLOAD_SIZE_MB=500

 # =============== Supabase 配置 ===============
 # 从 Supabase 项目设置 > API 获取
-SUPABASE_URL=your_supabase_url_here
-SUPABASE_KEY=your_supabase_anon_key_here
+SUPABASE_URL=http://localhost:8008/
+SUPABASE_PUBLIC_URL=https://api.hbyrkj.top
+SUPABASE_KEY=eyJhbGciOiAiSFMyNTYiLCAidHlwIjogIkpXVCJ9.eyJyb2xlIjogInNlcnZpY2Vfcm9sZSIsICJpc3MiOiAic3VwYWJhc2UiLCAiaWF0IjogMTc2OTQwNzU2NSwgImV4cCI6IDIwODQ3Njc1NjV9.LBPaimygpnM9o3mZ2Pi-iL8taJ90JjGbQ0HW6yFlmhg

 # =============== JWT 配置 ===============
 # 用于签名 JWT Token 的密钥 (请更换为随机字符串)
-JWT_SECRET_KEY=generate_your_secure_random_key_here
+JWT_SECRET_KEY=F4MagRkf7nJsN-ag9AB7Q-30MbZRe7Iu4E9p9xRzyic
 JWT_ALGORITHM=HS256
 JWT_EXPIRE_HOURS=168

 # =============== 管理员配置 ===============
 # 服务启动时自动创建的管理员账号
-ADMIN_EMAIL=admin@example.com
-ADMIN_PASSWORD=change_this_password_immediately
+ADMIN_PHONE=15549380526
+ADMIN_PASSWORD=lam1988324
+
+# =============== GLM AI 配置 ===============
+# 智谱 GLM API 配置 (用于生成标题和标签)
+GLM_API_KEY=32440cd3f3444d1f8fe721304acea8bd.YXNLrk7eIJMKcg4t
+GLM_MODEL=glm-4.7-flash
--- a/backend/app/api/admin.py
+++ b/backend/app/api/admin.py
@@ -14,7 +14,7 @@ router = APIRouter(prefix="/api/admin", tags=["管理"])

 class UserListItem(BaseModel):
    id: str
-    email: str
+    phone: str
    username: Optional[str]
    role: str
    is_active: bool
@@ -36,7 +36,7 @@ async def list_users(admin: dict = Depends(get_current_admin)):
        return [
            UserListItem(
                id=u["id"],
-                email=u["email"],
+                phone=u["phone"],
                username=u.get("username"),
                role=u["role"],
                is_active=u["is_active"],
@@ -87,7 +87,7 @@ async def activate_user(
                detail="用户不存在"
            )
        
-        logger.info(f"管理员 {admin['email']} 激活用户 {user_id}, 有效期: {request.expires_days or '永久'} 天")
+        logger.info(f"管理员 {admin['phone']} 激活用户 {user_id}, 有效期: {request.expires_days or '永久'} 天")
        
        return {
            "success": True,
@@ -128,7 +128,7 @@ async def deactivate_user(
        # 清除用户 session
        supabase.table("user_sessions").delete().eq("user_id", user_id).execute()
        
-        logger.info(f"管理员 {admin['email']} 停用用户 {user_id}")
+        logger.info(f"管理员 {admin['phone']} 停用用户 {user_id}")
        
        return {"success": True, "message": "用户已停用"}
    except HTTPException:
@@ -171,7 +171,7 @@ async def extend_user(
            "expires_at": expires_at
        }).eq("id", user_id).execute()
        
-        logger.info(f"管理员 {admin['email']} 延长用户 {user_id} 授权 {request.expires_days or '永久'} 天")
+        logger.info(f"管理员 {admin['phone']} 延长用户 {user_id} 授权 {request.expires_days or '永久'} 天")
        
        return {
            "success": True,
--- a/backend/app/api/ai.py
+++ b/backend/app/api/ai.py
@@ -0,0 +1,45 @@
+"""
+AI 相关 API 路由
+"""
+
+from fastapi import APIRouter, HTTPException
+from pydantic import BaseModel
+from loguru import logger
+
+from app.services.glm_service import glm_service
+
+
+router = APIRouter(prefix="/api/ai", tags=["AI"])
+
+
+class GenerateMetaRequest(BaseModel):
+    """生成标题标签请求"""
+    text: str
+
+
+class GenerateMetaResponse(BaseModel):
+    """生成标题标签响应"""
+    title: str
+    tags: list[str]
+
+
+@router.post("/generate-meta", response_model=GenerateMetaResponse)
+async def generate_meta(req: GenerateMetaRequest):
+    """
+    AI 生成视频标题和标签
+
+    根据口播文案自动生成吸引人的标题和相关标签
+    """
+    if not req.text or not req.text.strip():
+        raise HTTPException(status_code=400, detail="口播文案不能为空")
+
+    try:
+        logger.info(f"Generating meta for text: {req.text[:50]}...")
+        result = await glm_service.generate_title_tags(req.text)
+        return GenerateMetaResponse(
+            title=result.get("title", ""),
+            tags=result.get("tags", [])
+        )
+    except Exception as e:
+        logger.error(f"Generate meta failed: {e}")
+        raise HTTPException(status_code=500, detail=str(e))
--- a/backend/app/api/assets.py
+++ b/backend/app/api/assets.py
@@ -0,0 +1,22 @@
+from fastapi import APIRouter, Depends
+
+from app.core.deps import get_current_user
+from app.services.assets_service import list_styles, list_bgm
+
+
+router = APIRouter()
+
+
+@router.get("/subtitle-styles")
+async def list_subtitle_styles(current_user: dict = Depends(get_current_user)):
+    return {"styles": list_styles("subtitle")}
+
+
+@router.get("/title-styles")
+async def list_title_styles(current_user: dict = Depends(get_current_user)):
+    return {"styles": list_styles("title")}
+
+
+@router.get("/bgm")
+async def list_bgm_items(current_user: dict = Depends(get_current_user)):
+    return {"bgm": list_bgm()}
--- a/backend/app/api/auth.py
+++ b/backend/app/api/auth.py
@@ -1,8 +1,8 @@
 """
-认证 API：注册、登录、登出
+认证 API：注册、登录、登出、修改密码
 """
 from fastapi import APIRouter, HTTPException, Response, status, Request
-from pydantic import BaseModel, EmailStr
+from pydantic import BaseModel, field_validator
 from app.core.supabase import get_supabase
 from app.core.security import (
    get_password_hash,
@@ -15,27 +15,55 @@ from app.core.security import (
 )
 from loguru import logger
 from typing import Optional
+import re

 router = APIRouter(prefix="/api/auth", tags=["认证"])


 class RegisterRequest(BaseModel):
-    email: EmailStr
+    phone: str
    password: str
    username: Optional[str] = None

+    @field_validator('phone')
+    @classmethod
+    def validate_phone(cls, v):
+        if not re.match(r'^\d{11}$', v):
+            raise ValueError('手机号必须是11位数字')
+        return v
+

 class LoginRequest(BaseModel):
-    email: EmailStr
+    phone: str
    password: str

+    @field_validator('phone')
+    @classmethod
+    def validate_phone(cls, v):
+        if not re.match(r'^\d{11}$', v):
+            raise ValueError('手机号必须是11位数字')
+        return v
+
+
+class ChangePasswordRequest(BaseModel):
+    old_password: str
+    new_password: str
+
+    @field_validator('new_password')
+    @classmethod
+    def validate_new_password(cls, v):
+        if len(v) < 6:
+            raise ValueError('新密码长度至少6位')
+        return v
+

 class UserResponse(BaseModel):
    id: str
-    email: str
+    phone: str
    username: Optional[str]
    role: str
    is_active: bool
+    expires_at: Optional[str] = None


@router.post("/register")
@@ -48,29 +76,29 @@ async def register(request: RegisterRequest):
    try:
        supabase = get_supabase()
        
-        # 检查邮箱是否已存在
+        # 检查手机号是否已存在
        existing = supabase.table("users").select("id").eq(
-            "email", request.email
+            "phone", request.phone
        ).execute()
        
        if existing.data:
            raise HTTPException(
                status_code=status.HTTP_400_BAD_REQUEST,
-                detail="该邮箱已注册"
+                detail="该手机号已注册"
            )
        
        # 创建用户
        password_hash = get_password_hash(request.password)
        
        result = supabase.table("users").insert({
-            "email": request.email,
+            "phone": request.phone,
            "password_hash": password_hash,
-            "username": request.username or request.email.split("@")[0],
+            "username": request.username or f"用户{request.phone[-4:]}",
            "role": "pending",
            "is_active": False
        }).execute()
        
-        logger.info(f"新用户注册: {request.email}")
+        logger.info(f"新用户注册: {request.phone}")
        
        return {
            "success": True,
@@ -100,21 +128,21 @@ async def login(request: LoginRequest, response: Response):
        
        # 查找用户
        user_result = supabase.table("users").select("*").eq(
-            "email", request.email
+            "phone", request.phone
        ).single().execute()
        
        user = user_result.data
        if not user:
            raise HTTPException(
                status_code=status.HTTP_401_UNAUTHORIZED,
-                detail="邮箱或密码错误"
+                detail="手机号或密码错误"
            )
        
        # 验证密码
        if not verify_password(request.password, user["password_hash"]):
            raise HTTPException(
                status_code=status.HTTP_401_UNAUTHORIZED,
-                detail="邮箱或密码错误"
+                detail="手机号或密码错误"
            )
        
        # 检查是否激活
@@ -154,17 +182,18 @@ async def login(request: LoginRequest, response: Response):
        # 设置 HttpOnly Cookie
        set_auth_cookie(response, token)
        
-        logger.info(f"用户登录: {request.email}")
+        logger.info(f"用户登录: {request.phone}")
        
        return {
            "success": True,
            "message": "登录成功",
            "user": UserResponse(
                id=user["id"],
-                email=user["email"],
+                phone=user["phone"],
                username=user.get("username"),
                role=user["role"],
-                is_active=user["is_active"]
+                is_active=user["is_active"],
+                expires_at=user.get("expires_at")
            )
        }
    except HTTPException:
@@ -184,6 +213,91 @@ async def logout(response: Response):
    return {"success": True, "message": "已登出"}


+@router.post("/change-password")
+async def change_password(request: ChangePasswordRequest, req: Request, response: Response):
+    """
+    修改密码
+    
+    - 验证当前密码
+    - 设置新密码
+    - 重新生成 session token
+    """
+    # 从 Cookie 获取用户
+    token = req.cookies.get("access_token")
+    if not token:
+        raise HTTPException(
+            status_code=status.HTTP_401_UNAUTHORIZED,
+            detail="未登录"
+        )
+    
+    token_data = decode_access_token(token)
+    if not token_data:
+        raise HTTPException(
+            status_code=status.HTTP_401_UNAUTHORIZED,
+            detail="Token 无效"
+        )
+    
+    try:
+        supabase = get_supabase()
+        
+        # 获取用户信息
+        user_result = supabase.table("users").select("*").eq(
+            "id", token_data.user_id
+        ).single().execute()
+        
+        user = user_result.data
+        if not user:
+            raise HTTPException(
+                status_code=status.HTTP_401_UNAUTHORIZED,
+                detail="用户不存在"
+            )
+        
+        # 验证当前密码
+        if not verify_password(request.old_password, user["password_hash"]):
+            raise HTTPException(
+                status_code=status.HTTP_400_BAD_REQUEST,
+                detail="当前密码错误"
+            )
+        
+        # 更新密码
+        new_password_hash = get_password_hash(request.new_password)
+        supabase.table("users").update({
+            "password_hash": new_password_hash
+        }).eq("id", user["id"]).execute()
+        
+        # 生成新的 session token，使旧 token 失效
+        new_session_token = generate_session_token()
+        
+        supabase.table("user_sessions").delete().eq(
+            "user_id", user["id"]
+        ).execute()
+        
+        supabase.table("user_sessions").insert({
+            "user_id": user["id"],
+            "session_token": new_session_token,
+            "device_info": None
+        }).execute()
+        
+        # 生成新的 JWT Token
+        new_token = create_access_token(user["id"], new_session_token)
+        set_auth_cookie(response, new_token)
+        
+        logger.info(f"用户修改密码: {user['phone']}")
+        
+        return {
+            "success": True,
+            "message": "密码修改成功"
+        }
+    except HTTPException:
+        raise
+    except Exception as e:
+        logger.error(f"修改密码失败: {e}")
+        raise HTTPException(
+            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
+            detail="修改密码失败，请稍后重试"
+        )
+
+
@router.get("/me")
 async def get_me(request: Request):
    """获取当前用户信息"""
@@ -216,8 +330,9 @@ async def get_me(request: Request):
    
    return UserResponse(
        id=user["id"],
-        email=user["email"],
+        phone=user["phone"],
        username=user.get("username"),
        role=user["role"],
-        is_active=user["is_active"]
+        is_active=user["is_active"],
+        expires_at=user.get("expires_at")
    )
--- a/backend/app/api/materials.py
+++ b/backend/app/api/materials.py
@@ -1,100 +1,338 @@
-from fastapi import APIRouter, UploadFile, File, HTTPException
+from fastapi import APIRouter, UploadFile, File, HTTPException, Request, BackgroundTasks, Depends
 from app.core.config import settings
-import shutil
+from app.core.deps import get_current_user
+from app.services.storage import storage_service
 import re
 import time
+import traceback
+import os
+import aiofiles
 from pathlib import Path
+from loguru import logger
+from pydantic import BaseModel
+from typing import Optional
+import httpx
+

 router = APIRouter()

-
 def sanitize_filename(filename: str) -> str:
-    """清理文件名，移除不安全字符"""
-    # 移除路径分隔符和特殊字符
    safe_name = re.sub(r'[<>:"/\\|?*]', '_', filename)
-    # 限制长度
    if len(safe_name) > 100:
        ext = Path(safe_name).suffix
        safe_name = safe_name[:100 - len(ext)] + ext
    return safe_name

+async def process_and_upload(temp_file_path: str, original_filename: str, content_type: str, user_id: str):
+    """Background task to strip multipart headers and upload to Supabase"""
+    try:
+        logger.info(f"Processing raw upload: {temp_file_path} for user {user_id}")
+
+        # 1. Analyze file to find actual video content (strip multipart boundaries)
+        # This is a simplified manual parser for a SINGLE file upload.
+        # Structure:
+        # --boundary
+        # Content-Disposition: form-data; name="file"; filename="..."
+        # Content-Type: video/mp4
+        # \r\n\r\n
+        # [DATA]
+        # \r\n--boundary--
+
+        # We need to read the first few KB to find the header end
+        start_offset = 0
+        end_offset = 0
+        boundary = b""
+
+        file_size = os.path.getsize(temp_file_path)
+
+        with open(temp_file_path, 'rb') as f:
+            # Read first 4KB to find header
+            head = f.read(4096)
+
+            # Find boundary
+            first_line_end = head.find(b'\r\n')
+            if first_line_end == -1:
+                raise Exception("Could not find boundary in multipart body")
+
+            boundary = head[:first_line_end] # e.g. --boundary123
+            logger.info(f"Detected boundary: {boundary}")
+
+            # Find end of headers (\r\n\r\n)
+            header_end = head.find(b'\r\n\r\n')
+            if header_end == -1:
+                raise Exception("Could not find end of multipart headers")
+
+            start_offset = header_end + 4
+            logger.info(f"Video data starts at offset: {start_offset}")
+
+            # Find end boundary (read from end of file)
+            # It should be \r\n + boundary + -- + \r\n
+            # We seek to end-200 bytes
+            f.seek(max(0, file_size - 200))
+            tail = f.read()
+
+            # The closing boundary is usually --boundary--
+            # We look for the last occurrence of the boundary
+            last_boundary_pos = tail.rfind(boundary)
+            if last_boundary_pos != -1:
+                # The data ends before \r\n + boundary
+                # The tail buffer relative position needs to be converted to absolute
+                end_pos_in_tail = last_boundary_pos
+                # We also need to check for the preceding \r\n
+                if end_pos_in_tail >= 2 and tail[end_pos_in_tail-2:end_pos_in_tail] == b'\r\n':
+                     end_pos_in_tail -= 2
+
+                # Absolute end offset
+                end_offset = (file_size - 200) + last_boundary_pos
+                # Correction for CRLF before boundary
+                # Actually, simply: read until (file_size - len(tail) + last_boundary_pos) - 2
+                end_offset = (max(0, file_size - 200) + last_boundary_pos) - 2
+            else:
+                logger.warning("Could not find closing boundary, assuming EOF")
+                end_offset = file_size
+
+            logger.info(f"Video data ends at offset: {end_offset}. Total video size: {end_offset - start_offset}")
+
+        # 2. Extract and Upload to Supabase
+        # Since we have the file on disk, we can just pass the file object (seeked) to upload_file?
+        # Or if upload_file expects bytes/path, checking storage.py...
+        # It takes `file_data` (bytes) or file-like?
+        # supabase-py's `upload` method handles parsing if we pass a file object.
+        # But we need to pass ONLY the video slice.
+        # So we create a generator or a sliced file object?
+        # Simpler: Read the slice into memory if < 1GB? Or copy to new temp file?
+        # Copying to new temp file is safer for memory.
+
+        video_path = temp_file_path + "_video.mp4"
+        with open(temp_file_path, 'rb') as src, open(video_path, 'wb') as dst:
+            src.seek(start_offset)
+            # Copy in chunks
+            bytes_to_copy = end_offset - start_offset
+            copied = 0
+            while copied < bytes_to_copy:
+                chunk_size = min(1024*1024*10, bytes_to_copy - copied) # 10MB chunks
+                chunk = src.read(chunk_size)
+                if not chunk:
+                    break
+                dst.write(chunk)
+                copied += len(chunk)
+
+        logger.info(f"Extracted video content to {video_path}")
+
+        # 3. Upload to Supabase with user isolation
+        timestamp = int(time.time())
+        safe_name = re.sub(r'[^a-zA-Z0-9._-]', '', original_filename)
+        # 使用 user_id 作为目录前缀实现隔离
+        storage_path = f"{user_id}/{timestamp}_{safe_name}"
+
+        # Use storage service (this calls Supabase which might do its own http request)
+        # We read the cleaned video file
+        with open(video_path, 'rb') as f:
+            file_content = f.read() # Still reading into memory for simple upload call, but server has 32GB RAM so ok for 500MB
+            await storage_service.upload_file(
+                bucket=storage_service.BUCKET_MATERIALS,
+                path=storage_path,
+                file_data=file_content,
+                content_type=content_type
+            )
+
+        logger.info(f"Upload to Supabase complete: {storage_path}")
+
+        # Cleanup
+        os.remove(temp_file_path)
+        os.remove(video_path)
+
+        return storage_path
+
+    except Exception as e:
+        logger.error(f"Background upload processing failed: {e}\n{traceback.format_exc()}")
+        raise
+

@router.post("")
-async def upload_material(file: UploadFile = File(...)):
-    if not file.filename.lower().endswith(('.mp4', '.mov', '.avi')):
-        raise HTTPException(400, "Invalid format")
+async def upload_material(
+    request: Request,
+    background_tasks: BackgroundTasks,
+    current_user: dict = Depends(get_current_user)
+):
+    user_id = current_user["id"]
+    logger.info(f"ENTERED upload_material (Streaming Mode) for user {user_id}. Headers: {request.headers}")
    
-    # 使用时间戳+原始文件名（保留原始名称，避免冲突）
+    filename = "unknown_video.mp4" # Fallback
+    content_type = "video/mp4"
+    
+    # Try to parse filename from header if possible (unreliable in raw stream)
+    # We will rely on post-processing or client hint
+    # Frontend sends standard multipart.
+    
+    # Create temp file
    timestamp = int(time.time())
-    safe_name = sanitize_filename(file.filename)
-    save_path = settings.UPLOAD_DIR / "materials" / f"{timestamp}_{safe_name}"
-    
-    # Save file
-    with open(save_path, "wb") as buffer:
-        shutil.copyfileobj(file.file, buffer)
-    
-    # Calculate size
-    size_mb = save_path.stat().st_size / (1024 * 1024)
-    
-    # 提取显示名称（去掉时间戳前缀）
-    display_name = safe_name
+    temp_filename = f"upload_{timestamp}.raw"
+    temp_path = os.path.join("/tmp", temp_filename) # Use /tmp on Linux
+    # Ensure /tmp exists (it does) but verify paths
+    if os.name == 'nt': # Local dev
+        temp_path = f"d:/tmp/{temp_filename}"
+        os.makedirs("d:/tmp", exist_ok=True)
+
+    try:
+        total_size = 0
+        last_log = 0
        
-    return {
-        "id": save_path.stem,
-        "name": display_name, 
-        "path": f"uploads/materials/{save_path.name}",
-        "size_mb": size_mb,
-        "type": "video"
-    }
+        async with aiofiles.open(temp_path, 'wb') as f:
+            async for chunk in request.stream():
+                await f.write(chunk)
+                total_size += len(chunk)
+                
+                # Log progress every 20MB
+                if total_size - last_log > 20 * 1024 * 1024:
+                    logger.info(f"Receiving stream... Processed {total_size / (1024*1024):.2f} MB")
+                    last_log = total_size
+                    
+        logger.info(f"Stream reception complete. Total size: {total_size} bytes. Saved to {temp_path}")
+        
+        if total_size == 0:
+            raise HTTPException(400, "Received empty body")
+
+        # Attempt to extract filename from the saved file's first bytes?
+        # Or just accept it as "uploaded_video.mp4" for now to prove it works.
+        # We can try to regex the header in the file content we just wrote.
+        # Implemented in background task to return success immediately.
+        
+        # Wait, if we return immediately, the user's UI might not show the file yet?
+        # The prompt says "Wait for upload". 
+        # But to avoid User Waiting Timeout, maybe returning early is better?
+        # NO, user expects the file to be in the list.
+        # So we Must await the processing.
+        # But "Processing" (Strip + Upload to Supabase) takes time.
+        # Receiving took time.
+        # If we await Supabase upload, does it timeout?
+        # Supabase upload is outgoing. Usually faster/stable.
+        
+        # Let's await the processing to ensure "List Materials" shows it.
+        # We need to extract the filename for the list.
+        
+        # Quick extract filename from first 4kb
+        with open(temp_path, 'rb') as f:
+            head = f.read(4096).decode('utf-8', errors='ignore')
+            match = re.search(r'filename="([^"]+)"', head)
+            if match:
+                filename = match.group(1)
+                logger.info(f"Extracted filename from body: {filename}")
+            
+        # Run processing sync (in await)
+        storage_path = await process_and_upload(temp_path, filename, content_type, user_id)
+
+        # Get signed URL (it exists now)
+        signed_url = await storage_service.get_signed_url(
+            bucket=storage_service.BUCKET_MATERIALS,
+            path=storage_path
+        )
+        
+        size_mb = total_size / (1024 * 1024) # Approximate (includes headers)
+
+        # 从 storage_path 提取显示名
+        display_name = storage_path.split('/')[-1]  # 去掉 user_id 前缀
+        if '_' in display_name:
+            parts = display_name.split('_', 1)
+            if parts[0].isdigit():
+                display_name = parts[1]
+
+        return {
+            "id": storage_path,
+            "name": display_name,
+            "path": signed_url,
+            "size_mb": size_mb,
+            "type": "video"
+        }
+        
+    except Exception as e:
+        error_msg = f"Streaming upload failed: {str(e)}"
+        detail_msg = f"Exception: {repr(e)}\nArgs: {e.args}\n{traceback.format_exc()}"
+        logger.error(error_msg + "\n" + detail_msg)
+        
+        # Write to debug file
+        try:
+             with open("debug_upload.log", "a") as logf:
+                 logf.write(f"\n--- Error at {time.ctime()} ---\n")
+                 logf.write(detail_msg)
+                 logf.write("\n-----------------------------\n")
+        except:
+             pass
+
+        if os.path.exists(temp_path):
+            try:
+                os.remove(temp_path)
+            except:
+                pass
+        raise HTTPException(500, f"Upload failed. Check server logs. Error: {str(e)}")
+

@router.get("")
-async def list_materials():
-    materials_dir = settings.UPLOAD_DIR / "materials"
-    files = []
-    if materials_dir.exists():
-        for f in materials_dir.glob("*"):
-            try:
-                stat = f.stat()
-                # 提取显示名称：去掉时间戳前缀 (格式: {timestamp}_{原始文件名})
-                display_name = f.name
-                if '_' in f.name:
-                    parts = f.name.split('_', 1)
-                    if parts[0].isdigit():
-                        display_name = parts[1]  # 原始文件名
-                
-                files.append({
-                    "id": f.stem,
-                    "name": display_name,
-                    "path": f"uploads/materials/{f.name}",
-                    "size_mb": stat.st_size / (1024 * 1024),
-                    "type": "video",
-                    "created_at": stat.st_ctime
-                })
-            except Exception:
-                continue
-    # Sort by creation time desc
-    files.sort(key=lambda x: x.get("created_at", 0), reverse=True)
-    return {"materials": files}
-
-
-@router.delete("/{material_id}")
-async def delete_material(material_id: str):
-    """删除素材文件"""
-    materials_dir = settings.UPLOAD_DIR / "materials"
-    
-    # 查找匹配的文件（ID 是文件名不含扩展名）
-    found = None
-    for f in materials_dir.glob("*"):
-        if f.stem == material_id:
-            found = f
-            break
-    
-    if not found:
-        raise HTTPException(404, "Material not found")
-    
+async def list_materials(current_user: dict = Depends(get_current_user)):
+    user_id = current_user["id"]
    try:
-        found.unlink()
+        # 只列出当前用户目录下的文件
+        files_obj = await storage_service.list_files(
+            bucket=storage_service.BUCKET_MATERIALS,
+            path=user_id
+        )
+        materials = []
+        for f in files_obj:
+            name = f.get('name')
+            if not name or name == '.emptyFolderPlaceholder':
+                continue
+            display_name = name
+            if '_' in name:
+                parts = name.split('_', 1)
+                if parts[0].isdigit():
+                    display_name = parts[1]
+            # 完整路径包含 user_id
+            full_path = f"{user_id}/{name}"
+            signed_url = await storage_service.get_signed_url(
+                bucket=storage_service.BUCKET_MATERIALS,
+                path=full_path
+            )
+            metadata = f.get('metadata', {})
+            size = metadata.get('size', 0)
+            # created_at 在顶层，是 ISO 字符串
+            created_at_str = f.get('created_at', '')
+            created_at = 0
+            if created_at_str:
+                from datetime import datetime
+                try:
+                    dt = datetime.fromisoformat(created_at_str.replace('Z', '+00:00'))
+                    created_at = int(dt.timestamp())
+                except:
+                    pass
+            materials.append({
+                "id": full_path,  # ID 使用完整路径
+                "name": display_name,
+                "path": signed_url,
+                "size_mb": size / (1024 * 1024),
+                "type": "video",
+                "created_at": created_at
+            })
+        materials.sort(key=lambda x: x['id'], reverse=True)
+        return {"materials": materials}
+    except Exception as e:
+        logger.error(f"List materials failed: {e}")
+        return {"materials": []}
+
+
+@router.delete("/{material_id:path}")
+async def delete_material(material_id: str, current_user: dict = Depends(get_current_user)):
+    user_id = current_user["id"]
+    # 验证 material_id 属于当前用户
+    if not material_id.startswith(f"{user_id}/"):
+        raise HTTPException(403, "无权删除此素材")
+    try:
+        await storage_service.delete_file(
+            bucket=storage_service.BUCKET_MATERIALS,
+            path=material_id
+        )
        return {"success": True, "message": "素材已删除"}
    except Exception as e:
        raise HTTPException(500, f"删除失败: {str(e)}")

+
+
--- a/backend/app/api/ref_audios.py
+++ b/backend/app/api/ref_audios.py
@@ -0,0 +1,411 @@
+"""
+参考音频管理 API
+支持上传/列表/删除参考音频，用于 Qwen3-TTS 声音克隆
+"""
+from fastapi import APIRouter, UploadFile, File, Form, HTTPException, Depends
+from pydantic import BaseModel
+from typing import List, Optional
+from pathlib import Path
+from loguru import logger
+import time
+import json
+import subprocess
+import tempfile
+import os
+import re
+
+from app.core.deps import get_current_user
+from app.services.storage import storage_service
+
+router = APIRouter()
+
+# 支持的音频格式
+ALLOWED_AUDIO_EXTENSIONS = {'.wav', '.mp3', '.m4a', '.webm', '.ogg', '.flac', '.aac'}
+
+# 参考音频 bucket
+BUCKET_REF_AUDIOS = "ref-audios"
+
+
+class RefAudioResponse(BaseModel):
+    id: str
+    name: str
+    path: str  # signed URL for playback
+    ref_text: str
+    duration_sec: float
+    created_at: int
+
+
+class RefAudioListResponse(BaseModel):
+    items: List[RefAudioResponse]
+
+
+def sanitize_filename(filename: str) -> str:
+    """清理文件名，移除特殊字符"""
+    safe_name = re.sub(r'[<>:"/\\|?*\s]', '_', filename)
+    if len(safe_name) > 50:
+        ext = Path(safe_name).suffix
+        safe_name = safe_name[:50 - len(ext)] + ext
+    return safe_name
+
+
+def get_audio_duration(file_path: str) -> float:
+    """获取音频时长 (秒)"""
+    try:
+        result = subprocess.run(
+            ['ffprobe', '-v', 'quiet', '-show_entries', 'format=duration',
+             '-of', 'csv=p=0', file_path],
+            capture_output=True, text=True, timeout=10
+        )
+        return float(result.stdout.strip())
+    except Exception as e:
+        logger.warning(f"获取音频时长失败: {e}")
+        return 0.0
+
+
+def convert_to_wav(input_path: str, output_path: str) -> bool:
+    """将音频转换为 WAV 格式 (16kHz, mono)"""
+    try:
+        subprocess.run([
+            'ffmpeg', '-y', '-i', input_path,
+            '-ar', '16000',  # 16kHz 采样率
+            '-ac', '1',      # 单声道
+            '-acodec', 'pcm_s16le',  # 16-bit PCM
+            output_path
+        ], capture_output=True, timeout=60, check=True)
+        return True
+    except Exception as e:
+        logger.error(f"音频转换失败: {e}")
+        return False
+
+
+@router.post("", response_model=RefAudioResponse)
+async def upload_ref_audio(
+    file: UploadFile = File(...),
+    ref_text: str = Form(...),
+    user: dict = Depends(get_current_user)
+):
+    """
+    上传参考音频
+
+    - file: 音频文件 (支持 wav, mp3, m4a, webm 等)
+    - ref_text: 参考音频的转写文字 (必填)
+    """
+    user_id = user["id"]
+
+    # 验证文件扩展名
+    ext = Path(file.filename).suffix.lower()
+    if ext not in ALLOWED_AUDIO_EXTENSIONS:
+        raise HTTPException(
+            status_code=400,
+            detail=f"不支持的音频格式: {ext}。支持的格式: {', '.join(ALLOWED_AUDIO_EXTENSIONS)}"
+        )
+
+    # 验证 ref_text
+    if not ref_text or len(ref_text.strip()) < 2:
+        raise HTTPException(status_code=400, detail="参考文字不能为空")
+
+    try:
+        # 创建临时文件
+        with tempfile.NamedTemporaryFile(delete=False, suffix=ext) as tmp_input:
+            content = await file.read()
+            tmp_input.write(content)
+            tmp_input_path = tmp_input.name
+
+        # 转换为 WAV 格式
+        tmp_wav_path = tmp_input_path + ".wav"
+        if ext != '.wav':
+            if not convert_to_wav(tmp_input_path, tmp_wav_path):
+                raise HTTPException(status_code=500, detail="音频格式转换失败")
+        else:
+            # 即使是 wav 也要标准化格式
+            convert_to_wav(tmp_input_path, tmp_wav_path)
+
+        # 获取音频时长
+        duration = get_audio_duration(tmp_wav_path)
+        if duration < 1.0:
+            raise HTTPException(status_code=400, detail="音频时长过短，至少需要 1 秒")
+        if duration > 60.0:
+            raise HTTPException(status_code=400, detail="音频时长过长，最多 60 秒")
+
+
+        # 3. 处理重名逻辑 (Friendly Display Name)
+        original_name = file.filename
+        
+        # 获取用户现有的所有参考音频列表 (为了检查文件名冲突)
+        # 注意: 这种列表方式在文件极多时性能一般，但考虑到单用户参考音频数量有限，目前可行
+        existing_files = await storage_service.list_files(BUCKET_REF_AUDIOS, user_id)
+        existing_names = set()
+        
+        # 预加载所有现有的 display name
+        # 这里需要并发请求 metadata 可能会慢，优化: 仅检查 metadata 文件并解析
+        # 简易方案: 仅在 metadata 中读取 original_filename 
+        # 但 list_files 返回的是 name，我们需要 metadata
+        # 考虑到性能，这里使用一种妥协方案：
+        # 我们不做全量检查，而是简单的检查：如果用户上传 myvoice.wav
+        # 我们看看有没有 (timestamp)_myvoice.wav 这种其实并不能准确判断 display name 是否冲突
+        # 
+        # 正确做法: 应该有个数据库表存 metadata。但目前是无数据库设计。
+        # 
+        # 改用简单方案: 
+        # 既然我们无法快速获取所有 display name，
+        # 我们暂时只处理 "在新上传时，original_filename 保持原样"
+        # 但用户希望 "如果在列表中看到重复的，自动加(1)"
+        # 
+        # 鉴于无数据库架构的限制，要在上传时知道"已有的 display name" 成本太高(需遍历下载所有json)。
+        # 
+        # 💡 替代方案: 
+        # 我们不检查旧的。我们只保证**存储**唯一。
+        # 对于用户提到的 "新上传的文件名后加个数字" -> 这通常是指 "另存为" 的逻辑。
+        # 既然用户现在的痛点是 "显示了时间戳太丑"，而我已经去掉了时间戳显示。
+        # 那么如果用户上传两个 "TEST.wav"，列表里就会有两个 "TEST.wav" (但时间不同)。
+        # 这其实是可以接受的。
+        # 
+        # 但如果用户强求 "自动重命名":
+        # 我们可以在这里做一个轻量级的 "同名检测"：
+        # 检查有没有 *_{original_name} 的文件存在。
+        # 如果 storage 里已经有 123_abc.wav, 456_abc.wav
+        # 我们可以认为 abc.wav 已经存在。
+        
+        dup_count = 0
+        search_suffix = f"_{original_name}" # 比如 _test.wav
+        
+        for f in existing_files:
+            fname = f.get('name', '')
+            if fname.endswith(search_suffix):
+                dup_count += 1
+                
+        final_display_name = original_name
+        if dup_count > 0:
+            name_stem = Path(original_name).stem
+            name_ext = Path(original_name).suffix
+            final_display_name = f"{name_stem}({dup_count}){name_ext}"
+            
+        # 生成存储路径 (唯一ID)
+        timestamp = int(time.time())
+        safe_name = sanitize_filename(Path(file.filename).stem)
+        storage_path = f"{user_id}/{timestamp}_{safe_name}.wav"
+
+        # 上传 WAV 文件到 Supabase
+        with open(tmp_wav_path, 'rb') as f:
+            wav_data = f.read()
+
+        await storage_service.upload_file(
+            bucket=BUCKET_REF_AUDIOS,
+            path=storage_path,
+            file_data=wav_data,
+            content_type="audio/wav"
+        )
+
+        # 上传元数据 JSON
+        metadata = {
+            "ref_text": ref_text.strip(),
+            "original_filename": final_display_name, # 这里的名字如果有重复会自动加(1)
+            "duration_sec": duration,
+            "created_at": timestamp
+        }
+        metadata_path = f"{user_id}/{timestamp}_{safe_name}.json"
+        await storage_service.upload_file(
+            bucket=BUCKET_REF_AUDIOS,
+            path=metadata_path,
+            file_data=json.dumps(metadata, ensure_ascii=False).encode('utf-8'),
+            content_type="application/json"
+        )
+
+        # 获取签名 URL
+        signed_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, storage_path)
+
+        # 清理临时文件
+        os.unlink(tmp_input_path)
+        if os.path.exists(tmp_wav_path):
+            os.unlink(tmp_wav_path)
+
+        return RefAudioResponse(
+            id=storage_path,
+            name=file.filename,
+            path=signed_url,
+            ref_text=ref_text.strip(),
+            duration_sec=duration,
+            created_at=timestamp
+        )
+
+    except HTTPException:
+        raise
+    except Exception as e:
+        logger.error(f"上传参考音频失败: {e}")
+        raise HTTPException(status_code=500, detail=f"上传失败: {str(e)}")
+
+
+@router.get("", response_model=RefAudioListResponse)
+async def list_ref_audios(user: dict = Depends(get_current_user)):
+    """列出当前用户的所有参考音频"""
+    user_id = user["id"]
+
+    try:
+        # 列出用户目录下的文件
+        files = await storage_service.list_files(BUCKET_REF_AUDIOS, user_id)
+
+        # 过滤出 .wav 文件并获取对应的 metadata
+        items = []
+        for f in files:
+            name = f.get("name", "")
+            if not name.endswith(".wav"):
+                continue
+
+            storage_path = f"{user_id}/{name}"
+
+            # 尝试读取 metadata
+            metadata_name = name.replace(".wav", ".json")
+            metadata_path = f"{user_id}/{metadata_name}"
+
+            ref_text = ""
+            duration_sec = 0.0
+            created_at = 0
+            original_filename = ""
+
+            try:
+                # 获取 metadata 内容
+                metadata_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, metadata_path)
+                import httpx
+                async with httpx.AsyncClient() as client:
+                    resp = await client.get(metadata_url)
+                    if resp.status_code == 200:
+                        metadata = resp.json()
+                        ref_text = metadata.get("ref_text", "")
+                        duration_sec = metadata.get("duration_sec", 0.0)
+                        created_at = metadata.get("created_at", 0)
+                        original_filename = metadata.get("original_filename", "")
+            except Exception as e:
+                logger.warning(f"读取 metadata 失败: {e}")
+                # 从文件名提取时间戳
+                try:
+                    created_at = int(name.split("_")[0])
+                except:
+                    pass
+
+            # 获取音频签名 URL
+            signed_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, storage_path)
+
+            # 优先显示原始文件名 (去掉时间戳前缀)
+            display_name = original_filename if original_filename else name
+            # 如果原始文件名丢失，尝试从现有文件名中通过正则去掉时间戳
+            if not display_name or display_name == name:
+                 # 匹配 "1234567890_filename.wav"
+                 match = re.match(r'^\d+_(.+)$', name)
+                 if match:
+                     display_name = match.group(1)
+
+            items.append(RefAudioResponse(
+                id=storage_path,
+                name=display_name,
+                path=signed_url,
+                ref_text=ref_text,
+                duration_sec=duration_sec,
+                created_at=created_at
+            ))
+
+        # 按创建时间倒序排列
+        items.sort(key=lambda x: x.created_at, reverse=True)
+
+        return RefAudioListResponse(items=items)
+
+    except Exception as e:
+        logger.error(f"列出参考音频失败: {e}")
+        raise HTTPException(status_code=500, detail=f"获取列表失败: {str(e)}")
+
+
+@router.delete("/{audio_id:path}")
+async def delete_ref_audio(audio_id: str, user: dict = Depends(get_current_user)):
+    """删除参考音频"""
+    user_id = user["id"]
+
+    # 安全检查：确保只能删除自己的文件
+    if not audio_id.startswith(f"{user_id}/"):
+        raise HTTPException(status_code=403, detail="无权删除此文件")
+
+    try:
+        # 删除 WAV 文件
+        await storage_service.delete_file(BUCKET_REF_AUDIOS, audio_id)
+
+        # 删除 metadata JSON
+        metadata_path = audio_id.replace(".wav", ".json")
+        try:
+            await storage_service.delete_file(BUCKET_REF_AUDIOS, metadata_path)
+        except:
+            pass  # metadata 可能不存在
+
+        return {"success": True, "message": "删除成功"}
+
+    except Exception as e:
+        logger.error(f"删除参考音频失败: {e}")
+        raise HTTPException(status_code=500, detail=f"删除失败: {str(e)}")
+
+
+class RenameRequest(BaseModel):
+    new_name: str
+
+
+@router.put("/{audio_id:path}")
+async def rename_ref_audio(
+    audio_id: str,
+    request: RenameRequest,
+    user: dict = Depends(get_current_user)
+):
+    """重命名参考音频 (修改 metadata 中的 display name)"""
+    user_id = user["id"]
+
+    # 安全检查
+    if not audio_id.startswith(f"{user_id}/"):
+        raise HTTPException(status_code=403, detail="无权修改此文件")
+    
+    new_name = request.new_name.strip()
+    if not new_name:
+         raise HTTPException(status_code=400, detail="新名称不能为空")
+         
+    # 确保新名称有后缀 (保留原后缀或添加 .wav)
+    if not Path(new_name).suffix:
+        new_name += ".wav"
+
+    try:
+        # 1. 下载现有的 metadata
+        metadata_path = audio_id.replace(".wav", ".json")
+        try:
+             # 获取已有的 JSON
+             import httpx
+             metadata_url = await storage_service.get_signed_url(BUCKET_REF_AUDIOS, metadata_path)
+             if not metadata_url:
+                  # 如果 json 不存在，则需要新建一个基础的
+                  raise Exception("Metadata not found")
+             
+             async with httpx.AsyncClient() as client:
+                resp = await client.get(metadata_url)
+                if resp.status_code == 200:
+                    metadata = resp.json()
+                else:
+                    raise Exception(f"Failed to fetch metadata: {resp.status_code}")
+                    
+        except Exception as e:
+            logger.warning(f"无法读取元数据: {e}, 将创建新的元数据")
+            # 兜底：如果读取失败，构建最小元数据
+            metadata = {
+                "ref_text": "", # 可能丢失
+                "duration_sec": 0.0,
+                "created_at": int(time.time()),
+                "original_filename": new_name
+            }
+
+        # 2. 更新 original_filename
+        metadata["original_filename"] = new_name
+        
+        # 3. 覆盖上传 metadata
+        await storage_service.upload_file(
+            bucket=BUCKET_REF_AUDIOS,
+            path=metadata_path,
+            file_data=json.dumps(metadata, ensure_ascii=False).encode('utf-8'),
+            content_type="application/json"
+        )
+        
+        return {"success": True, "name": new_name}
+
+    except Exception as e:
+        logger.error(f"重命名失败: {e}")
+        raise HTTPException(status_code=500, detail=f"重命名失败: {str(e)}")
--- a/backend/app/api/tools.py
+++ b/backend/app/api/tools.py
@@ -0,0 +1,398 @@
+from fastapi import APIRouter, UploadFile, File, Form, HTTPException
+from typing import Optional
+import shutil
+import os
+import time
+from pathlib import Path
+from loguru import logger
+import traceback
+import re
+import json
+import requests
+from urllib.parse import unquote
+
+from app.services.whisper_service import whisper_service
+from app.services.glm_service import glm_service
+
+router = APIRouter()
+
+@router.post("/extract-script")
+async def extract_script_tool(
+    file: Optional[UploadFile] = File(None),
+    url: Optional[str] = Form(None),
+    rewrite: bool = Form(True)
+):
+    """
+    独立文案提取工具
+    支持上传视频/音频 OR 输入视频链接 -> 提取文字 -> (可选) AI洗稿
+    """
+    if not file and not url:
+        raise HTTPException(400, "必须提供文件或视频链接")
+
+    temp_path = None
+    try:
+        timestamp = int(time.time())
+        temp_dir = Path("/tmp")
+        if os.name == 'nt':
+            temp_dir = Path("d:/tmp")
+        temp_dir.mkdir(parents=True, exist_ok=True)
+
+        # 1. 获取/保存文件
+        loop = asyncio.get_event_loop()
+        
+        if file:
+            safe_filename = Path(file.filename).name.replace(" ", "_")
+            temp_path = temp_dir / f"tool_extract_{timestamp}_{safe_filename}"
+            # 文件 I/O 放入线程池
+            await loop.run_in_executor(None, lambda: shutil.copyfileobj(file.file, open(temp_path, "wb")))
+            logger.info(f"Tool processing upload file: {temp_path}")
+        else:
+            # URL 下载逻辑
+            # 自动提取文案中的链接 (支持 Douyin/Bilibili 等分享文案)
+            url_match = re.search(r'https?://[^\s]+', url)
+            if url_match:
+                extracted_url = url_match.group(0)
+                logger.info(f"Extracted URL from text: {extracted_url}")
+                url = extracted_url
+            
+            logger.info(f"Tool downloading URL: {url}")
+            
+            # 封装 yt-dlp 下载函数 (Blocking)
+            def _download_yt_dlp():
+                import yt_dlp
+                logger.info("Attempting download with yt-dlp...")
+                
+                ydl_opts = {
+                    'format': 'bestaudio/best',
+                    'outtmpl': str(temp_dir / f"tool_download_{timestamp}_%(id)s.%(ext)s"),
+                    'quiet': True,
+                    'no_warnings': True,
+                    'http_headers': {
+                         'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36',
+                         'Referer': 'https://www.douyin.com/',
+                    }
+                }
+                
+                with yt_dlp.YoutubeDL(ydl_opts) as ydl:
+                    info = ydl.extract_info(url, download=True)
+                    if 'requested_downloads' in info:
+                        downloaded_file = info['requested_downloads'][0]['filepath']
+                    else:
+                        ext = info.get('ext', 'mp4')
+                        id = info.get('id')
+                        downloaded_file = str(temp_dir / f"tool_download_{timestamp}_{id}.{ext}")
+                    
+                    return Path(downloaded_file)
+
+            # 先尝试 yt-dlp (Run in Executor)
+            try:
+                temp_path = await loop.run_in_executor(None, _download_yt_dlp)
+                logger.info(f"yt-dlp downloaded to: {temp_path}")
+
+            except Exception as e:
+                logger.warning(f"yt-dlp download failed: {e}. Trying manual Douyin fallback...")
+                
+                # 失败则尝试手动解析 (Douyin Fallback)
+                if "douyin" in url:
+                    manual_path = await download_douyin_manual(url, temp_dir, timestamp)
+                    if manual_path:
+                        temp_path = manual_path
+                        logger.info(f"Manual Douyin fallback successful: {temp_path}")
+                    else:
+                         raise HTTPException(400, f"视频下载失败。yt-dlp 报错: {str(e)}")
+                elif "bilibili" in url:
+                    manual_path = await download_bilibili_manual(url, temp_dir, timestamp)
+                    if manual_path:
+                        temp_path = manual_path
+                        logger.info(f"Manual Bilibili fallback successful: {temp_path}")
+                    else:
+                         raise HTTPException(400, f"视频下载失败。yt-dlp 报错: {str(e)}")
+                else:
+                    raise HTTPException(400, f"视频下载失败: {str(e)}")
+
+        if not temp_path or not temp_path.exists():
+             raise HTTPException(400, "文件获取失败")
+        
+        # 1.5 安全转换: 强制转为 WAV (16k)
+        import subprocess
+        audio_path = temp_dir / f"extract_audio_{timestamp}.wav"
+        
+        def _convert_audio():
+            try:
+                convert_cmd = [
+                    'ffmpeg',
+                    '-i', str(temp_path),
+                    '-vn', # 忽略视频
+                    '-acodec', 'pcm_s16le',
+                    '-ar', '16000', # Whisper 推荐采样率
+                    '-ac', '1',    # 单声道
+                    '-y',          # 覆盖
+                    str(audio_path)
+                ]
+                # 捕获 stderr
+                subprocess.run(convert_cmd, check=True, stdout=subprocess.PIPE, stderr=subprocess.PIPE)
+                return True
+            except subprocess.CalledProcessError as e:
+                error_log = e.stderr.decode('utf-8', errors='ignore') if e.stderr else str(e)
+                logger.error(f"FFmpeg check/convert failed: {error_log}")
+                # 检查是否为 HTML
+                head = b""
+                try:
+                    with open(temp_path, 'rb') as f:
+                        head = f.read(100)
+                except: pass
+                if b'<!DOCTYPE html' in head or b'<html' in head:
+                    raise ValueError("HTML_DETECTED")
+                raise ValueError("CONVERT_FAILED")
+
+        # 执行转换 (Run in Executor)
+        try:
+            await loop.run_in_executor(None, _convert_audio)
+            logger.info(f"Converted to WAV: {audio_path}")
+            target_path = audio_path
+        except ValueError as ve:
+            if str(ve) == "HTML_DETECTED":
+                 raise HTTPException(400, "下载的文件是网页而非视频，请重试或手动上传。")
+            else:
+                 raise HTTPException(400, "下载的文件已损坏或格式无法识别。")
+            
+        # 2. 提取文案 (Whisper)
+        script = await whisper_service.transcribe(str(target_path))
+        
+        # 3. AI 洗稿 (GLM)
+        rewritten = None
+        if rewrite:
+            if script and len(script.strip()) > 0:
+                logger.info("Rewriting script...")
+                rewritten = await glm_service.rewrite_script(script)
+            else:
+                logger.warning("No script extracted, skipping rewrite")
+
+        return {
+            "success": True,
+            "original_script": script,
+            "rewritten_script": rewritten
+        }
+
+    except HTTPException as he:
+        raise he
+    except Exception as e:
+        logger.error(f"Tool extract failed: {e}")
+        logger.error(traceback.format_exc())
+        
+        # Friendly error message
+        msg = str(e)
+        if "Fresh cookies" in msg:
+            msg = "下载失败：目标平台开启了反爬验证，请过段时间重试或直接上传视频文件。"
+            
+        raise HTTPException(500, f"提取失败: {msg}")
+    finally:
+        # 清理临时文件
+        if temp_path and temp_path.exists():
+            try:
+                os.remove(temp_path)
+                logger.info(f"Cleaned up temp file: {temp_path}")
+            except Exception as e:
+                logger.warning(f"Failed to cleanup temp file {temp_path}: {e}")
+
+
+async def download_douyin_manual(url: str, temp_dir: Path, timestamp: int) -> Optional[Path]:
+    """
+    手动下载抖音视频 (Fallback logic - Ported from SuperIPAgent/douyinDownloader)
+    使用特定的 User Profile URL 和硬编码 Cookie 绕过反爬
+    """
+    logger.info(f"[SuperIPAgent] Starting download for: {url}")
+    
+    try:
+        # 1. 提取 Modal ID (支持短链跳转)
+        headers = {
+            "user-agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36"
+        }
+        
+        # 如果是短链或重定向
+        resp = requests.get(url, headers=headers, allow_redirects=True, timeout=10)
+        final_url = resp.url
+        logger.info(f"[SuperIPAgent] Final URL: {final_url}")
+        
+        modal_id = None
+        match = re.search(r'/video/(\d+)', final_url)
+        if match:
+            modal_id = match.group(1)
+        
+        if not modal_id:
+            logger.error("[SuperIPAgent] Could not extract modal_id")
+            return None
+
+        logger.info(f"[SuperIPAgent] Extracted modal_id: {modal_id}")
+        
+        # 2. 构造特定请求 URL (Copy from SuperIPAgent)
+        # 使用特定用户的 Profile 页 + modal_id 参数，配合特定 Cookie
+        target_url = f"https://www.douyin.com/user/MS4wLjABAAAAN_s_hups7LD0N4qnrM3o2gI0vuG3pozNaEolz2_py3cHTTrpVr1Z4dukFD9SOlwY?from_tab_name=main&modal_id={modal_id}"
+        
+        # 3. 使用硬编码 Cookie (Copy from SuperIPAgent)
+        headers_with_cookie = {
+            "accept": "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7",
+            "cookie": "douyin.com; device_web_cpu_core=10; device_web_memory_size=8; __ac_nonce=06760391f00b9b51264ae; __ac_signature=_02B4Z6wo00f019a5ceAAAIDAhEZR-X3jjWfWmXVAAJLXd4; ttwid=1%7C7MTKBSMsP4eOv9h5NAh8p0E-NYIud09ftNmB0mjLpWc%7C1734359327%7C8794abeabbd47447e1f56e5abc726be089f2a0344d6343b5f75f23e7b0f0028f; UIFID_TEMP=0de8750d2b188f4235dbfd208e44abbb976428f0720eb983255afefa45d39c0c6532e1d4768dd8587bf919f866ff1396912bcb2af71efee56a14a2a9f37b74010d0a0413795262f6d4afe02a032ac7ab; s_v_web_id=verify_m4r4ribr_c7krmY1z_WoeI_43po_ATpO_I4o8U1bex2D7; hevc_supported=true; home_can_add_dy_2_desktop=%220%22; dy_swidth=2560; dy_sheight=1440; stream_recommend_feed_params=%22%7B%5C%22cookie_enabled%5C%22%3Atrue%2C%5C%22screen_width%5C%22%3A2560%2C%5C%22screen_height%5C%22%3A1440%2C%5C%22browser_online%5C%22%3Atrue%2C%5C%22cpu_core_num%5C%22%3A10%2C%5C%22device_memory%5C%22%3A8%2C%5C%22downlink%5C%22%3A10%2C%5C%22effective_type%5C%22%3A%5C%224g%5C%22%2C%5C%22round_trip_time%5C%22%3A50%7D%22; strategyABtestKey=%221734359328.577%22; csrf_session_id=2f53aed9aa6974e83aa9a1014180c3a4; fpk1=U2FsdGVkX1/IpBh0qdmlKAVhGyYHgur4/VtL9AReZoeSxadXn4juKvsakahRGqjxOPytHWspYoBogyhS/V6QSw==; fpk2=0845b309c7b9b957afd9ecf775a4c21f; passport_csrf_token=d80e0c5b2fa2328219856be5ba7e671e; passport_csrf_token_default=d80e0c5b2fa2328219856be5ba7e671e; odin_tt=3c891091d2eb0f4718c1d5645bc4a0017032d4d5aa989decb729e9da2ad570918cbe5e9133dc6b145fa8c758de98efe32ff1f81aa0d611e838cc73ab08ef7d3f6adf66ab4d10e8372ddd628f94f16b8e; volume_info=%7B%22isUserMute%22%3Afalse%2C%22isMute%22%3Afalse%2C%22volume%22%3A0.5%7D; bd_ticket_guard_client_web_domain=2; FORCE_LOGIN=%7B%22videoConsumedRemainSeconds%22%3A180%7D; UIFID=0de8750d2b188f4235dbfd208e44abbb976428f0720eb983255afefa45d39c0c6532e1d4768dd8587bf919f866ff139655a3c2b735923234f371c699560c657923fd3d6c5b63ab7bb9b83423b6cb4787e2ce66a7fbc4ecb24c8570f520fe6de068bbb95115023c0c6c1b6ee31b49fb7e3996fb8349f43a3fd8b7a61cd9e18e8fe65eb6a7c13de4c0960d84e344b644725db3eb2fa6b7caf821de1b50527979f2; is_dash_user=1; biz_trace_id=b57a241f; bd_ticket_guard_client_data=eyJiZC10aWNrZXQtZ3VhcmQtdmVyc2lvbiI6MiwiYmQtdGlja2V0LWd1YXJkLWl0ZXJhdGlvbi12ZXJzaW9uIjoxLCJiZC10aWNrZXQtZ3VhcmQtcmVlLXB1YmxpYy1rZXkiOiJCTEo2R0lDalVoWW1XcHpGOFdrN0Vrc0dXcCtaUzNKY1g4NGNGY2k0TTl1TEowNjdUb21mbFU5aDdvWVBGamhNRWNRQWtKdnN1MnM3RmpTWnlJQXpHMjA9IiwiYmQtdGlja2V0LWd1YXJkLXdlYi12ZXJzaW9uIjoyfQ%3D%3D; download_guide=%221%2F20241216%2F0%22; sdk_source_info=7e276470716a68645a606960273f276364697660272927676c715a6d6069756077273f276364697660272927666d776a68605a607d71606b766c6a6b5a7666776c7571273f275e58272927666a6b766a69605a696c6061273f27636469766027292762696a6764695a7364776c6467696076273f275e5827292771273f273d33323131333c3036313632342778; bit_env=RiOY4jzzpxZoVCl6zdVSVhVRjdwHRTxqcqWdqMBZLPGjMdB4Tax1kAELHNTVAAh72KuhumewE4Lq6f0-VJ2UpJrkrhSxoPw9LUb3zQrq1OSwbeSPHkRlRgRQvO89sItdGUyq1oFr0XyRCnMYG87KSeWyc4x0czGR0o50hTDoDLG5rJVoRcdQOLvjiAegsqyytKF59sPX_QM9qffK2SqYsg0hCggURc_AI6kguDDE5DvG0bnyz1utw4z1eEnIoLrkGDqzqBZj4dOAr0BVU6ofbsS-pOQ2u2PM1dLP9FlBVBlVaqYVgHJeSLsR5k76BRTddUjTb4zEilVIEwAMJWGN4I1BxVt6fC9B5tBQpuT0lj3n3eKXCKXZsd8FrEs5_pbfDsxV-e_WMiXI2ff4qxiTC0U73sfo9OpicKICtZjdq8qsHxJuu6wVR36zvXeL2Wch5C6MzprNvkivv0l8nbh2mSgy1nabZr3dmU6NcR-Bg3Q3xTWUlR9aAUmpopC-cNuXjgLpT-Lw1AYGilSUnCvosth1Gfypq-b0MpgmdSDgTrQ%3D; gulu_source_res=eyJwX2luIjoiMDhjOGQ3ZTJiODQyNjZkZWI5Y2VkMGJiODNlNmY1ZWY0ZjMyNTE2ZmYyZjAzNDMzZjI0OWU1Y2Q1NTczNTk5NyJ9; passport_auth_mix_state=hp9bc3dgb1tm5wd8p82zawus27g0e3ue; IsDouyinActive=false",
+            "user-agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36",
+        }
+        
+        logger.info(f"[SuperIPAgent] Requesting page with Cookie...")
+        # 必须 verify=False 否则有些环境会报错
+        response = requests.get(target_url, headers=headers_with_cookie, timeout=10)
+        
+        # 4. 解析 RENDER_DATA
+        content_match = re.findall(r'<script id="RENDER_DATA" type="application/json">(.*?)</script>', response.text)
+        if not content_match:
+             # 尝试解码后再查找？或者结构变了
+             # 再尝试找 SSR_HYDRATED_DATA
+             if "SSR_HYDRATED_DATA" in response.text:
+                 content_match = re.findall(r'<script id="SSR_HYDRATED_DATA" type="application/json">(.*?)</script>', response.text)
+        
+        if not content_match:
+             logger.error(f"[SuperIPAgent] Could not find RENDER_DATA in page (len={len(response.text)})")
+             return None
+
+        content = unquote(content_match[0])
+        try:
+            data = json.loads(content)
+        except:
+            logger.error("[SuperIPAgent] JSON decode failed")
+            return None
+            
+        # 5. 提取视频流
+        video_url = None
+        try:
+            # 路径通常是: app -> videoDetail -> video -> bitRateList -> playAddr -> src
+            if "app" in data and "videoDetail" in data["app"]:
+                 info = data["app"]["videoDetail"]["video"]
+                 if "bitRateList" in info and info["bitRateList"]:
+                     video_url = info["bitRateList"][0]["playAddr"][0]["src"]
+                 elif "playAddr" in info and info["playAddr"]:
+                      video_url = info["playAddr"][0]["src"]
+        except Exception as e:
+            logger.error(f"[SuperIPAgent] Path extraction failed: {e}")
+            
+        if not video_url:
+            logger.error("[SuperIPAgent] No video_url found")
+            return None
+            
+        if video_url.startswith("//"):
+            video_url = "https:" + video_url
+            
+        logger.info(f"[SuperIPAgent] Found video URL: {video_url[:50]}...")
+        
+        # 6. 下载 (带 Header)
+        temp_path = temp_dir / f"douyin_manual_{timestamp}.mp4"
+        download_headers = {
+            'Referer': 'https://www.douyin.com/',
+            'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36',
+        }
+        
+        dl_resp = requests.get(video_url, headers=download_headers, stream=True, timeout=60)
+        if dl_resp.status_code == 200:
+             with open(temp_path, 'wb') as f:
+                 for chunk in dl_resp.iter_content(chunk_size=1024):
+                     f.write(chunk)
+                     
+             logger.info(f"[SuperIPAgent] Downloaded successfully: {temp_path}")
+             return temp_path
+        else:
+             logger.error(f"[SuperIPAgent] Download failed: {dl_resp.status_code}")
+             return None
+
+    except Exception as e:
+        logger.error(f"[SuperIPAgent] Logic failed: {e}")
+        return None
+
+async def download_bilibili_manual(url: str, temp_dir: Path, timestamp: int) -> Optional[Path]:
+    """
+    手动下载 Bilibili 视频 (Fallback logic - Playwright Version)
+    B站通常音视频分离，这里只提取音频即可（因为只需要文案）
+    """
+    from playwright.async_api import async_playwright
+    
+    logger.info(f"[Playwright] Starting Bilibili download for: {url}")
+    
+    playwright = None
+    browser = None
+    try:
+        playwright = await async_playwright().start()
+        # Launch browser (ensure chromium is installed: playwright install chromium)
+        browser = await playwright.chromium.launch(headless=True, args=['--no-sandbox', '--disable-setuid-sandbox'])
+        
+        # Mobile User Agent often gives single stream?
+        # But Bilibili mobile web is tricky. Desktop is fine.
+        context = await browser.new_context(
+            user_agent="Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36"
+        )
+        
+        page = await context.new_page()
+        
+        # Intercept audio responses?
+        # Bilibili streams are usually .m4s 
+        # But finding the initial state is easier.
+        
+        logger.info("[Playwright] Navigating to Bilibili...")
+        await page.goto(url, timeout=45000)
+        
+        # Wait for video element (triggers loading)
+        try:
+            await page.wait_for_selector('video', timeout=15000)
+        except:
+             logger.warning("[Playwright] Video selector timeout")
+
+        # 1. Try extracting from __playinfo__
+        # window.__playinfo__ contains dash streams
+        playinfo = await page.evaluate("window.__playinfo__")
+        
+        audio_url = None
+        
+        if playinfo and "data" in playinfo and "dash" in playinfo["data"]:
+            dash = playinfo["data"]["dash"]
+            if "audio" in dash and dash["audio"]:
+                audio_url = dash["audio"][0]["baseUrl"]
+                logger.info(f"[Playwright] Found audio stream in __playinfo__: {audio_url[:50]}...")
+        
+        # 2. If playinfo fails, try extracting video src (sometimes it's a blob, which we can't fetch easily without interception)
+        # But interception is complex. Let's try requests with Referer if we have URL.
+        
+        if not audio_url:
+            logger.warning("[Playwright] Could not find audio in __playinfo__")
+            return None
+            
+        # Download the audio stream
+        temp_path = temp_dir / f"bilibili_audio_{timestamp}.m4s" # usually m4s
+        
+        try:
+            api_request = context.request
+            headers = {
+                "User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36",
+                "Referer": "https://www.bilibili.com/"
+            }
+            
+            logger.info(f"[Playwright] Downloading audio stream...")
+            response = await api_request.get(audio_url, headers=headers)
+            
+            if response.status == 200:
+                body = await response.body()
+                with open(temp_path, 'wb') as f:
+                    f.write(body)
+                    
+                logger.info(f"[Playwright] Downloaded successfully: {temp_path}")
+                return temp_path
+            else:
+                logger.error(f"[Playwright] API Request failed: {response.status}")
+                return None
+                
+        except Exception as e:
+             logger.error(f"[Playwright] Download logic error: {e}")
+             return None
+
+    except Exception as e:
+        logger.error(f"[Playwright] Bilibili download failed: {e}")
+        return None
+    finally:
+        if browser:
+            await browser.close()
+        if playwright:
+            await playwright.stop()
--- a/backend/app/api/videos.py
+++ b/backend/app/api/videos.py
@@ -1,14 +1,28 @@
-from fastapi import APIRouter, HTTPException, BackgroundTasks
+from fastapi import APIRouter, HTTPException, BackgroundTasks, Depends, Request
 from pydantic import BaseModel
 from typing import Optional
 from pathlib import Path
+from loguru import logger
 import uuid
 import traceback
 import time
-from app.services.tts_service import TTSService
-from app.services.video_service import VideoService
-from app.services.lipsync_service import LipSyncService
+import httpx
+import os
+from app.services.tts_service import TTSService
+from app.services.video_service import VideoService
+from app.services.lipsync_service import LipSyncService
+from app.services.voice_clone_service import voice_clone_service
+from app.services.assets_service import (
+    get_style,
+    get_default_style,
+    resolve_bgm_path,
+    prepare_style_for_remotion,
+)
+from app.services.storage import storage_service
+from app.services.whisper_service import whisper_service
+from app.services.remotion_service import remotion_service
 from app.core.config import settings
+from app.core.deps import get_current_user

 router = APIRouter()

@@ -16,6 +30,19 @@ class GenerateRequest(BaseModel):
    text: str
    voice: str = "zh-CN-YunxiNeural"
    material_path: str
+    # 声音克隆模式新增字段
+    tts_mode: str = "edgetts"  # "edgetts" | "voiceclone"
+    ref_audio_id: Optional[str] = None  # 参考音频 storage path
+    ref_text: Optional[str] = None  # 参考音频的转写文字
+    # 字幕和标题功能
+    title: Optional[str] = None  # 视频标题（片头显示）
+    enable_subtitles: bool = True  # 是否启用逐字高亮字幕
+    subtitle_style_id: Optional[str] = None  # 字幕样式 ID
+    title_style_id: Optional[str] = None  # 标题样式 ID
+    subtitle_font_size: Optional[int] = None  # 字幕字号（覆盖样式）
+    title_font_size: Optional[int] = None  # 标题字号（覆盖样式）
+    bgm_id: Optional[str] = None  # 背景音乐 ID
+    bgm_volume: Optional[float] = 0.2  # 背景音乐音量 (0-1)

 tasks = {} # In-memory task store

@@ -37,52 +64,112 @@ async def _check_lipsync_ready(force: bool = False) -> bool:
    
    now = time.time()
    # 5分钟缓存
-    if not force and _lipsync_ready is not None and (now - _lipsync_last_check) < 300:
-        return _lipsync_ready
+    if not force and _lipsync_ready is not None and (now - _lipsync_last_check) < 300:
+        return bool(_lipsync_ready)
    
    lipsync = _get_lipsync_service()
    health = await lipsync.check_health()
    _lipsync_ready = health.get("ready", False)
    _lipsync_last_check = now
    print(f"[LipSync] Health check: ready={_lipsync_ready}")
-    return _lipsync_ready
+    return bool(_lipsync_ready)

-async def _process_video_generation(task_id: str, req: GenerateRequest):
+async def _download_material(path_or_url: str, temp_path: Path):
+    """下载素材到临时文件 (流式下载，节省内存)"""
+    if path_or_url.startswith("http"):
+        # Download from URL
+        timeout = httpx.Timeout(None) # Disable timeout for large files
+        async with httpx.AsyncClient(timeout=timeout) as client:
+            async with client.stream("GET", path_or_url) as resp:
+                resp.raise_for_status()
+                with open(temp_path, "wb") as f:
+                    async for chunk in resp.aiter_bytes():
+                        f.write(chunk)
+    else:
+        # Local file (legacy or absolute path)
+        src = Path(path_or_url)
+        if not src.is_absolute():
+            src = settings.BASE_DIR.parent / path_or_url
+        
+        if src.exists():
+            import shutil
+            shutil.copy(src, temp_path)
+        else:
+            raise FileNotFoundError(f"Material not found: {path_or_url}")
+
+async def _process_video_generation(task_id: str, req: GenerateRequest, user_id: str):
+    temp_files = [] # Track files to clean up
    try:
        start_time = time.time()
-        
-        # Resolve path if it's relative
-        input_material_path = Path(req.material_path)
-        if not input_material_path.is_absolute():
-            input_material_path = settings.BASE_DIR.parent / req.material_path
-            
+
        tasks[task_id]["status"] = "processing"
        tasks[task_id]["progress"] = 5
-        tasks[task_id]["message"] = "正在初始化..."
-        
+        tasks[task_id]["message"] = "正在下载素材..."
+
+        # Prepare temp dir
+        temp_dir = settings.UPLOAD_DIR / "temp"
+        temp_dir.mkdir(parents=True, exist_ok=True)
+
+        # 0. Download Material
+        input_material_path = temp_dir / f"{task_id}_input.mp4"
+        temp_files.append(input_material_path)
+
+        await _download_material(req.material_path, input_material_path)
+
        # 1. TTS - 进度 5% -> 25%
-        tasks[task_id]["message"] = "正在生成语音 (TTS)..."
+        tasks[task_id]["message"] = "正在生成语音..."
        tasks[task_id]["progress"] = 10
-        
-        tts = TTSService()
-        audio_path = settings.OUTPUT_DIR / f"{task_id}_audio.mp3"
-        await tts.generate_audio(req.text, req.voice, str(audio_path))
-        
+
+        audio_path = temp_dir / f"{task_id}_audio.wav"
+        temp_files.append(audio_path)
+
+        if req.tts_mode == "voiceclone":
+            # 声音克隆模式
+            if not req.ref_audio_id or not req.ref_text:
+                raise ValueError("声音克隆模式需要提供参考音频和参考文字")
+
+            tasks[task_id]["message"] = "正在下载参考音频..."
+
+            # 从 Supabase 下载参考音频
+            ref_audio_local = temp_dir / f"{task_id}_ref.wav"
+            temp_files.append(ref_audio_local)
+
+            ref_audio_url = await storage_service.get_signed_url(
+                bucket="ref-audios",
+                path=req.ref_audio_id
+            )
+            await _download_material(ref_audio_url, ref_audio_local)
+
+            tasks[task_id]["message"] = "正在克隆声音 (Qwen3-TTS)..."
+            await voice_clone_service.generate_audio(
+                text=req.text,
+                ref_audio_path=str(ref_audio_local),
+                ref_text=req.ref_text,
+                output_path=str(audio_path),
+                language="Chinese"
+            )
+        else:
+            # EdgeTTS 模式 (默认)
+            tasks[task_id]["message"] = "正在生成语音 (EdgeTTS)..."
+            tts = TTSService()
+            await tts.generate_audio(req.text, req.voice, str(audio_path))
+
        tts_time = time.time() - start_time
        print(f"[Pipeline] TTS completed in {tts_time:.1f}s")
        tasks[task_id]["progress"] = 25
-        
+
        # 2. LipSync - 进度 25% -> 85%
        tasks[task_id]["message"] = "正在合成唇形 (LatentSync)..."
        tasks[task_id]["progress"] = 30
-        
+
        lipsync = _get_lipsync_service()
-        lipsync_video_path = settings.OUTPUT_DIR / f"{task_id}_lipsync.mp4"
-        
+        lipsync_video_path = temp_dir / f"{task_id}_lipsync.mp4"
+        temp_files.append(lipsync_video_path)
+
        # 使用缓存的健康检查结果
        lipsync_start = time.time()
        is_ready = await _check_lipsync_ready()
-        
+
        if is_ready:
            print(f"[LipSync] Starting LatentSync inference...")
            tasks[task_id]["progress"] = 35
@@ -97,35 +184,195 @@ async def _process_video_generation(task_id: str, req: GenerateRequest):

        lipsync_time = time.time() - lipsync_start
        print(f"[Pipeline] LipSync completed in {lipsync_time:.1f}s")
-        tasks[task_id]["progress"] = 85
-        
-        # 3. Composition - 进度 85% -> 100%
-        tasks[task_id]["message"] = "正在合成最终视频..."
-        tasks[task_id]["progress"] = 90
-        
-        video = VideoService()
-        final_output = settings.OUTPUT_DIR / f"{task_id}_output.mp4"
-        await video.compose(str(lipsync_video_path), str(audio_path), str(final_output))
-        
+        tasks[task_id]["progress"] = 80
+
+        # 3. WhisperX 字幕对齐 - 进度 80% -> 85%
+        captions_path = None
+        if req.enable_subtitles:
+            tasks[task_id]["message"] = "正在生成字幕 (Whisper)..."
+            tasks[task_id]["progress"] = 82
+
+            captions_path = temp_dir / f"{task_id}_captions.json"
+            temp_files.append(captions_path)
+
+            try:
+                await whisper_service.align(
+                    audio_path=str(audio_path),
+                    text=req.text,
+                    output_path=str(captions_path)
+                )
+                print(f"[Pipeline] Whisper alignment completed")
+            except Exception as e:
+                logger.warning(f"Whisper alignment failed, skipping subtitles: {e}")
+                captions_path = None
+
+        tasks[task_id]["progress"] = 85
+
+        # 3.5 背景音乐混音（不影响唇形与字幕对齐）
+        video = VideoService()
+        final_audio_path = audio_path
+        if req.bgm_id:
+            tasks[task_id]["message"] = "正在合成背景音乐..."
+            tasks[task_id]["progress"] = 86
+
+            bgm_path = resolve_bgm_path(req.bgm_id)
+            if bgm_path:
+                mix_output_path = temp_dir / f"{task_id}_audio_mix.wav"
+                temp_files.append(mix_output_path)
+                volume = req.bgm_volume if req.bgm_volume is not None else 0.2
+                volume = max(0.0, min(float(volume), 1.0))
+                try:
+                    video.mix_audio(
+                        voice_path=str(audio_path),
+                        bgm_path=str(bgm_path),
+                        output_path=str(mix_output_path),
+                        bgm_volume=volume
+                    )
+                    final_audio_path = mix_output_path
+                except Exception as e:
+                    logger.warning(f"BGM mix failed, fallback to voice only: {e}")
+            else:
+                logger.warning(f"BGM not found: {req.bgm_id}")
+
+        # 4. Remotion 视频合成（字幕 + 标题）- 进度 85% -> 95%
+        # 判断是否需要使用 Remotion（有字幕或标题时使用）
+        use_remotion = (captions_path and captions_path.exists()) or req.title
+
+        subtitle_style = None
+        title_style = None
+        if req.enable_subtitles:
+            subtitle_style = get_style("subtitle", req.subtitle_style_id) or get_default_style("subtitle")
+        if req.title:
+            title_style = get_style("title", req.title_style_id) or get_default_style("title")
+
+        if req.subtitle_font_size and req.enable_subtitles:
+            if subtitle_style is None:
+                subtitle_style = {}
+            subtitle_style["font_size"] = int(req.subtitle_font_size)
+
+        if req.title_font_size and req.title:
+            if title_style is None:
+                title_style = {}
+            title_style["font_size"] = int(req.title_font_size)
+
+        if use_remotion:
+            subtitle_style = prepare_style_for_remotion(
+                subtitle_style,
+                temp_dir,
+                f"{task_id}_subtitle_font"
+            )
+            title_style = prepare_style_for_remotion(
+                title_style,
+                temp_dir,
+                f"{task_id}_title_font"
+            )
+
+        final_output_local_path = temp_dir / f"{task_id}_output.mp4"
+        temp_files.append(final_output_local_path)
+
+        if use_remotion:
+            tasks[task_id]["message"] = "正在合成视频 (Remotion)..."
+            tasks[task_id]["progress"] = 87
+
+            # 先用 FFmpeg 合成音视频（Remotion 需要带音频的视频）
+            composed_video_path = temp_dir / f"{task_id}_composed.mp4"
+            temp_files.append(composed_video_path)
+
+            await video.compose(str(lipsync_video_path), str(final_audio_path), str(composed_video_path))
+
+            # 检查 Remotion 是否可用
+            remotion_health = await remotion_service.check_health()
+            if remotion_health.get("ready"):
+                try:
+                    def on_remotion_progress(percent):
+                        # 映射 Remotion 进度到 87-95%
+                        mapped = 87 + int(percent * 0.08)
+                        tasks[task_id]["progress"] = mapped
+
+                    await remotion_service.render(
+                        video_path=str(composed_video_path),
+                        output_path=str(final_output_local_path),
+                        captions_path=str(captions_path) if captions_path else None,
+                        title=req.title,
+                        title_duration=3.0,
+                        fps=25,
+                        enable_subtitles=req.enable_subtitles,
+                        subtitle_style=subtitle_style,
+                        title_style=title_style,
+                        on_progress=on_remotion_progress
+                    )
+                    print(f"[Pipeline] Remotion render completed")
+                except Exception as e:
+                    logger.warning(f"Remotion render failed, using FFmpeg fallback: {e}")
+                    # 回退到 FFmpeg 合成
+                    import shutil
+                    shutil.copy(str(composed_video_path), final_output_local_path)
+            else:
+                logger.warning(f"Remotion not ready: {remotion_health.get('error')}, using FFmpeg")
+                import shutil
+                shutil.copy(str(composed_video_path), final_output_local_path)
+        else:
+            # 不需要字幕和标题，直接用 FFmpeg 合成
+            tasks[task_id]["message"] = "正在合成最终视频..."
+            tasks[task_id]["progress"] = 90
+
+            await video.compose(str(lipsync_video_path), str(final_audio_path), str(final_output_local_path))
+
        total_time = time.time() - start_time
+
+        # 4. Upload to Supabase with user isolation
+        tasks[task_id]["message"] = "正在上传结果..."
+        tasks[task_id]["progress"] = 95
+
+        # 使用 user_id 作为目录前缀实现隔离
+        storage_path = f"{user_id}/{task_id}_output.mp4"
+        with open(final_output_local_path, "rb") as f:
+            file_data = f.read()
+            await storage_service.upload_file(
+                bucket=storage_service.BUCKET_OUTPUTS,
+                path=storage_path,
+                file_data=file_data,
+                content_type="video/mp4"
+            )
+
+        # Get Signed URL
+        signed_url = await storage_service.get_signed_url(
+            bucket=storage_service.BUCKET_OUTPUTS,
+            path=storage_path
+        )
+
        print(f"[Pipeline] Total generation time: {total_time:.1f}s")
-        
+
        tasks[task_id]["status"] = "completed"
        tasks[task_id]["progress"] = 100
        tasks[task_id]["message"] = f"生成完成！耗时 {total_time:.0f} 秒"
-        tasks[task_id]["output"] = str(final_output)
-        tasks[task_id]["download_url"] = f"/outputs/{final_output.name}"
+        tasks[task_id]["output"] = storage_path
+        tasks[task_id]["download_url"] = signed_url
        
    except Exception as e:
        tasks[task_id]["status"] = "failed"
        tasks[task_id]["message"] = f"错误: {str(e)}"
        tasks[task_id]["error"] = traceback.format_exc()
+        logger.error(f"Generate video failed: {e}")
+    finally:
+        # Cleanup temp files
+        for f in temp_files:
+            try:
+                if f.exists():
+                    f.unlink()
+            except Exception as e:
+                print(f"Error cleaning up {f}: {e}")

@router.post("/generate")
-async def generate_video(req: GenerateRequest, background_tasks: BackgroundTasks):
+async def generate_video(
+    req: GenerateRequest,
+    background_tasks: BackgroundTasks,
+    current_user: dict = Depends(get_current_user)
+):
+    user_id = current_user["id"]
    task_id = str(uuid.uuid4())
-    tasks[task_id] = {"status": "pending", "task_id": task_id, "progress": 0}
-    background_tasks.add_task(_process_video_generation, task_id, req)
+    tasks[task_id] = {"status": "pending", "task_id": task_id, "progress": 0, "user_id": user_id}
+    background_tasks.add_task(_process_video_generation, task_id, req, user_id)
    return {"task_id": task_id}

@router.get("/tasks/{task_id}")
@@ -143,55 +390,88 @@ async def lipsync_health():
    return await lipsync.check_health()


+@router.get("/voiceclone/health")
+async def voiceclone_health():
+    """获取声音克隆服务健康状态"""
+    return await voice_clone_service.check_health()
+
+
@router.get("/generated")
-async def list_generated_videos():
-    """从文件系统读取生成的视频列表（持久化）"""
-    output_dir = settings.OUTPUT_DIR
-    videos = []
-    
-    if output_dir.exists():
-        for f in output_dir.glob("*_output.mp4"):
-            try:
-                stat = f.stat()
-                videos.append({
-                    "id": f.stem,
-                    "name": f.name,
-                    "path": f"/outputs/{f.name}",
-                    "size_mb": stat.st_size / (1024 * 1024),
-                    "created_at": stat.st_ctime
-                })
-            except Exception:
+async def list_generated_videos(current_user: dict = Depends(get_current_user)):
+    """从 Storage 读取当前用户生成的视频列表"""
+    user_id = current_user["id"]
+    try:
+        # 只列出当前用户目录下的文件
+        files_obj = await storage_service.list_files(
+            bucket=storage_service.BUCKET_OUTPUTS,
+            path=user_id
+        )
+
+        videos = []
+        for f in files_obj:
+            name = f.get('name')
+            if not name or name == '.emptyFolderPlaceholder':
                continue
-    
-    # Sort by creation time desc (newest first)
-    videos.sort(key=lambda x: x.get("created_at", 0), reverse=True)
-    return {"videos": videos}
+
+            # 过滤非 output.mp4 文件
+            if not name.endswith("_output.mp4"):
+                continue
+
+            # 获取 ID (即文件名去除后缀)
+            video_id = Path(name).stem
+
+            # 完整路径包含 user_id
+            full_path = f"{user_id}/{name}"
+
+            # 获取签名链接
+            signed_url = await storage_service.get_signed_url(
+                bucket=storage_service.BUCKET_OUTPUTS,
+                path=full_path
+            )
+
+            metadata = f.get('metadata', {})
+            size = metadata.get('size', 0)
+            # created_at 在顶层，是 ISO 字符串，转换为 Unix 时间戳
+            created_at_str = f.get('created_at', '')
+            created_at = 0
+            if created_at_str:
+                from datetime import datetime
+                try:
+                    dt = datetime.fromisoformat(created_at_str.replace('Z', '+00:00'))
+                    created_at = int(dt.timestamp())
+                except:
+                    pass
+
+            videos.append({
+                "id": video_id,
+                "name": name,
+                "path": signed_url, # Direct playable URL
+                "size_mb": size / (1024 * 1024),
+                "created_at": created_at
+            })
+
+        # Sort by created_at desc (newest first)
+        # Supabase API usually returns ISO string, simpler string sort works for ISO
+        videos.sort(key=lambda x: x.get("created_at", ""), reverse=True)
+        return {"videos": videos}
+
+    except Exception as e:
+        logger.error(f"List generated videos failed: {e}")
+        return {"videos": []}


@router.delete("/generated/{video_id}")
-async def delete_generated_video(video_id: str):
+async def delete_generated_video(video_id: str, current_user: dict = Depends(get_current_user)):
    """删除生成的视频"""
-    output_dir = settings.OUTPUT_DIR
-    
-    # 查找匹配的文件
-    found = None
-    for f in output_dir.glob("*.mp4"):
-        if f.stem == video_id:
-            found = f
-            break
-    
-    if not found:
-        raise HTTPException(404, "Video not found")
-    
+    user_id = current_user["id"]
    try:
-        found.unlink()
-        # 同时删除相关的临时文件（如果存在）
-        task_id = video_id.replace("_output", "")
-        for suffix in ["_audio.mp3", "_lipsync.mp4"]:
-            temp_file = output_dir / f"{task_id}{suffix}"
-            if temp_file.exists():
-                temp_file.unlink()
-        
+        # video_id 通常是 uuid_output，完整路径需要加上 user_id
+        storage_path = f"{user_id}/{video_id}.mp4"
+
+        await storage_service.delete_file(
+            bucket=storage_service.BUCKET_OUTPUTS,
+            path=storage_path
+        )
        return {"success": True, "message": "视频已删除"}
    except Exception as e:
        raise HTTPException(500, f"删除失败: {str(e)}")
--- a/backend/app/core/config.py
+++ b/backend/app/core/config.py
@@ -3,9 +3,10 @@ from pathlib import Path

 class Settings(BaseSettings):
    # 基础路径配置
-    BASE_DIR: Path = Path(__file__).resolve().parent.parent
-    UPLOAD_DIR: Path = BASE_DIR.parent / "uploads"
-    OUTPUT_DIR: Path = BASE_DIR.parent / "outputs"
+    BASE_DIR: Path = Path(__file__).resolve().parent.parent
+    UPLOAD_DIR: Path = BASE_DIR.parent / "uploads"
+    OUTPUT_DIR: Path = BASE_DIR.parent / "outputs"
+    ASSETS_DIR: Path = BASE_DIR.parent / "assets"
    
    # 数据库/缓存
    REDIS_URL: str = "redis://localhost:6379/0"
@@ -22,12 +23,12 @@ class Settings(BaseSettings):
    LATENTSYNC_INFERENCE_STEPS: int = 20            # 推理步数 [20-50]
    LATENTSYNC_GUIDANCE_SCALE: float = 1.5          # 引导系数 [1.0-3.0]
    LATENTSYNC_ENABLE_DEEPCACHE: bool = True        # 启用 DeepCache 加速
-    LATENTSYNC_ENABLE_DEEPCACHE: bool = True        # 启用 DeepCache 加速
    LATENTSYNC_SEED: int = 1247                     # 随机种子 (-1 则随机)
-    LATENTSYNC_USE_SERVER: bool = False             # 使用常驻服务 (Persistent Server) 加速
+    LATENTSYNC_USE_SERVER: bool = True              # 使用常驻服务 (Persistent Server) 加速
    
    # Supabase 配置
    SUPABASE_URL: str = ""
+    SUPABASE_PUBLIC_URL: str = ""  # 公网访问地址，用于生成前端可访问的 URL
    SUPABASE_KEY: str = ""
    
    # JWT 配置
@@ -36,9 +37,13 @@ class Settings(BaseSettings):
    JWT_EXPIRE_HOURS: int = 24
    
    # 管理员配置
-    ADMIN_EMAIL: str = ""
+    ADMIN_PHONE: str = ""
    ADMIN_PASSWORD: str = ""
    
+    # GLM AI 配置
+    GLM_API_KEY: str = ""
+    GLM_MODEL: str = "glm-4.7-flash"
+    
    @property
    def LATENTSYNC_DIR(self) -> Path:
        """LatentSync 目录路径 (动态计算)"""
--- a/backend/app/main.py
+++ b/backend/app/main.py
@@ -2,7 +2,7 @@ from fastapi import FastAPI
 from fastapi.staticfiles import StaticFiles
 from fastapi.middleware.cors import CORSMiddleware
 from app.core import config
-from app.api import materials, videos, publish, login_helper, auth, admin
+from app.api import materials, videos, publish, login_helper, auth, admin, ref_audios, ai, tools, assets
 from loguru import logger
 import os

@@ -10,6 +10,28 @@ settings = config.settings

 app = FastAPI(title="ViGent TalkingHead Agent")

+from fastapi import Request
+from starlette.middleware.base import BaseHTTPMiddleware
+import time
+import traceback
+
+class LoggingMiddleware(BaseHTTPMiddleware):
+    async def dispatch(self, request: Request, call_next):
+        start_time = time.time()
+        logger.info(f"START Request: {request.method} {request.url}")
+        logger.info(f"HEADERS: {dict(request.headers)}")
+        try:
+            response = await call_next(request)
+            process_time = time.time() - start_time
+            logger.info(f"END Request: {request.method} {request.url} - Status: {response.status_code} - Duration: {process_time:.2f}s")
+            return response
+        except Exception as e:
+            process_time = time.time() - start_time
+            logger.error(f"EXCEPTION during request {request.method} {request.url}: {str(e)}\n{traceback.format_exc()}")
+            raise e
+
+app.add_middleware(LoggingMiddleware)
+
 app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],
@@ -19,12 +41,14 @@ app.add_middleware(
 )

 # Create dirs
-settings.UPLOAD_DIR.mkdir(parents=True, exist_ok=True)
-settings.OUTPUT_DIR.mkdir(parents=True, exist_ok=True)
-(settings.UPLOAD_DIR / "materials").mkdir(exist_ok=True)
+settings.UPLOAD_DIR.mkdir(parents=True, exist_ok=True)
+settings.OUTPUT_DIR.mkdir(parents=True, exist_ok=True)
+(settings.UPLOAD_DIR / "materials").mkdir(exist_ok=True)
+settings.ASSETS_DIR.mkdir(parents=True, exist_ok=True)

-app.mount("/outputs", StaticFiles(directory=str(settings.OUTPUT_DIR)), name="outputs")
-app.mount("/uploads", StaticFiles(directory=str(settings.UPLOAD_DIR)), name="uploads")
+app.mount("/outputs", StaticFiles(directory=str(settings.OUTPUT_DIR)), name="outputs")
+app.mount("/uploads", StaticFiles(directory=str(settings.UPLOAD_DIR)), name="uploads")
+app.mount("/assets", StaticFiles(directory=str(settings.ASSETS_DIR)), name="assets")

 # 注册路由
 app.include_router(materials.router, prefix="/api/materials", tags=["Materials"])
@@ -33,6 +57,10 @@ app.include_router(publish.router, prefix="/api/publish", tags=["Publish"])
 app.include_router(login_helper.router, prefix="/api", tags=["LoginHelper"])
 app.include_router(auth.router)  # /api/auth
 app.include_router(admin.router)  # /api/admin
+app.include_router(ref_audios.router, prefix="/api/ref-audios", tags=["RefAudios"])
+app.include_router(ai.router)  # /api/ai
+app.include_router(tools.router, prefix="/api/tools", tags=["Tools"])
+app.include_router(assets.router, prefix="/api/assets", tags=["Assets"])


@app.on_event("startup")
@@ -40,11 +68,11 @@ async def init_admin():
    """
    服务启动时初始化管理员账号
    """
-    admin_email = settings.ADMIN_EMAIL
+    admin_phone = settings.ADMIN_PHONE
    admin_password = settings.ADMIN_PASSWORD
    
-    if not admin_email or not admin_password:
-        logger.warning("未配置 ADMIN_EMAIL 和 ADMIN_PASSWORD，跳过管理员初始化")
+    if not admin_phone or not admin_password:
+        logger.warning("未配置 ADMIN_PHONE 和 ADMIN_PASSWORD，跳过管理员初始化")
        return
    
    try:
@@ -54,15 +82,15 @@ async def init_admin():
        supabase = get_supabase()
        
        # 检查是否已存在
-        existing = supabase.table("users").select("id").eq("email", admin_email).execute()
+        existing = supabase.table("users").select("id").eq("phone", admin_phone).execute()
        
        if existing.data:
-            logger.info(f"管理员账号已存在: {admin_email}")
+            logger.info(f"管理员账号已存在: {admin_phone}")
            return
        
        # 创建管理员
        supabase.table("users").insert({
-            "email": admin_email,
+            "phone": admin_phone,
            "password_hash": get_password_hash(admin_password),
            "username": "Admin",
            "role": "admin",
@@ -70,7 +98,7 @@ async def init_admin():
            "expires_at": None  # 永不过期
        }).execute()
        
-        logger.success(f"管理员账号已创建: {admin_email}")
+        logger.success(f"管理员账号已创建: {admin_phone}")
    except Exception as e:
        logger.error(f"初始化管理员失败: {e}")

--- a/backend/app/services/assets_service.py
+++ b/backend/app/services/assets_service.py
@@ -0,0 +1,128 @@
+import json
+import shutil
+from pathlib import Path
+from typing import Optional, List, Dict, Any
+
+from loguru import logger
+
+from app.core.config import settings
+
+
+BGM_EXTENSIONS = {".wav", ".mp3", ".m4a", ".aac", ".flac", ".ogg", ".webm"}
+
+
+def _style_file_path(style_type: str) -> Path:
+    return settings.ASSETS_DIR / "styles" / f"{style_type}.json"
+
+
+def _load_style_file(style_type: str) -> List[Dict[str, Any]]:
+    style_path = _style_file_path(style_type)
+    if not style_path.exists():
+        return []
+    try:
+        with open(style_path, "r", encoding="utf-8") as f:
+            data = json.load(f)
+            if isinstance(data, list):
+                return data
+    except Exception as e:
+        logger.error(f"Failed to load style file {style_path}: {e}")
+    return []
+
+
+def list_styles(style_type: str) -> List[Dict[str, Any]]:
+    return _load_style_file(style_type)
+
+
+def get_style(style_type: str, style_id: Optional[str]) -> Optional[Dict[str, Any]]:
+    if not style_id:
+        return None
+    for item in _load_style_file(style_type):
+        if item.get("id") == style_id:
+            return item
+    return None
+
+
+def get_default_style(style_type: str) -> Optional[Dict[str, Any]]:
+    styles = _load_style_file(style_type)
+    if not styles:
+        return None
+    for item in styles:
+        if item.get("is_default"):
+            return item
+    return styles[0]
+
+
+def list_bgm() -> List[Dict[str, Any]]:
+    bgm_root = settings.ASSETS_DIR / "bgm"
+    if not bgm_root.exists():
+        return []
+
+    items: List[Dict[str, Any]] = []
+    for path in bgm_root.rglob("*"):
+        if not path.is_file():
+            continue
+        if path.suffix.lower() not in BGM_EXTENSIONS:
+            continue
+        rel = path.relative_to(bgm_root).as_posix()
+        items.append({
+            "id": rel,
+            "name": path.stem,
+            "ext": path.suffix.lower().lstrip(".")
+        })
+
+    items.sort(key=lambda x: x.get("name", ""))
+    return items
+
+
+def resolve_bgm_path(bgm_id: str) -> Optional[Path]:
+    if not bgm_id:
+        return None
+    bgm_root = settings.ASSETS_DIR / "bgm"
+    candidate = (bgm_root / bgm_id).resolve()
+    try:
+        candidate.relative_to(bgm_root.resolve())
+    except ValueError:
+        return None
+    if candidate.exists() and candidate.is_file():
+        return candidate
+    return None
+
+
+def prepare_style_for_remotion(
+    style: Optional[Dict[str, Any]],
+    temp_dir: Path,
+    prefix: str
+) -> Optional[Dict[str, Any]]:
+    if not style:
+        return None
+
+    prepared = dict(style)
+    font_file = prepared.get("font_file")
+    if not font_file:
+        return prepared
+
+    source_font = (settings.ASSETS_DIR / "fonts" / font_file).resolve()
+    try:
+        source_font.relative_to((settings.ASSETS_DIR / "fonts").resolve())
+    except ValueError:
+        logger.warning(f"Font path outside assets: {font_file}")
+        return prepared
+
+    if not source_font.exists():
+        logger.warning(f"Font file missing: {source_font}")
+        return prepared
+
+    temp_dir.mkdir(parents=True, exist_ok=True)
+    ext = source_font.suffix.lower()
+    target_name = f"{prefix}{ext}"
+    target_path = temp_dir / target_name
+
+    try:
+        shutil.copy(source_font, target_path)
+        prepared["font_file"] = target_name
+        if not prepared.get("font_family"):
+            prepared["font_family"] = prefix
+    except Exception as e:
+        logger.warning(f"Failed to copy font {source_font} -> {target_path}: {e}")
+
+    return prepared
--- a/backend/app/services/glm_service.py
+++ b/backend/app/services/glm_service.py
@@ -0,0 +1,146 @@
+"""
+GLM AI 服务
+使用智谱 GLM 生成标题和标签
+"""
+
+import json
+import re
+from loguru import logger
+from zai import ZhipuAiClient
+
+from app.core.config import settings
+
+
+class GLMService:
+    """GLM AI 服务"""
+
+    def __init__(self):
+        self.client = None
+
+    def _get_client(self):
+        """获取或创建 ZhipuAI 客户端"""
+        if self.client is None:
+            if not settings.GLM_API_KEY:
+                raise Exception("GLM_API_KEY 未配置")
+            self.client = ZhipuAiClient(api_key=settings.GLM_API_KEY)
+        return self.client
+
+    async def generate_title_tags(self, text: str) -> dict:
+        """
+        根据口播文案生成标题和标签
+
+        Args:
+            text: 口播文案
+
+        Returns:
+            {"title": "标题", "tags": ["标签1", "标签2", ...]}
+        """
+        prompt = f"""根据以下口播文案，生成一个吸引人的短视频标题和3个相关标签。
+
+口播文案：
+{text}
+
+要求：
+1. 标题要简洁有力，能吸引观众点击，不超过10个字
+2. 标签要与内容相关，便于搜索和推荐，只要3个
+
+请严格按以下JSON格式返回（不要包含其他内容）：
+{{"title": "标题", "tags": ["标签1", "标签2", "标签3"]}}"""
+
+        try:
+            client = self._get_client()
+            logger.info(f"Calling GLM API with model: {settings.GLM_MODEL}")
+            
+            response = client.chat.completions.create(
+                model=settings.GLM_MODEL,
+                messages=[{"role": "user", "content": prompt}],
+                thinking={"type": "disabled"},  # 禁用思考模式，加快响应
+                max_tokens=500,
+                temperature=0.7
+            )
+
+            # 提取生成的内容
+            content = response.choices[0].message.content
+            logger.info(f"GLM response (model: {settings.GLM_MODEL}): {content}")
+
+            # 解析 JSON
+            result = self._parse_json_response(content)
+            return result
+
+        except Exception as e:
+            logger.error(f"GLM service error: {e}")
+            raise Exception(f"AI 生成失败: {str(e)}")
+
+    async def rewrite_script(self, text: str) -> str:
+        """
+        AI 洗稿（文案改写）
+
+        Args:
+            text: 原始文案
+
+        Returns:
+            改写后的文案
+        """
+        prompt = f"""请将以下视频文案进行改写。
+
+原始文案：
+{text}
+
+要求：
+1. 保持原意，但语气更加自然流畅
+2. 适合口播，读起来朗朗上口
+3. 字数与原文相当或略微精简
+4. 不要返回多余的解释，只返回改写后的正文"""
+
+        try:
+            client = self._get_client()
+            logger.info(f"Using GLM to rewrite script")
+
+            response = client.chat.completions.create(
+                model=settings.GLM_MODEL,
+                messages=[{"role": "user", "content": prompt}],
+                thinking={"type": "disabled"},
+                max_tokens=2000,
+                temperature=0.8
+            )
+
+            content = response.choices[0].message.content
+            logger.info("GLM rewrite completed")
+            return content.strip()
+
+        except Exception as e:
+            logger.error(f"GLM rewrite error: {e}")
+            raise Exception(f"AI 改写失败: {str(e)}")
+
+
+
+    def _parse_json_response(self, content: str) -> dict:
+        """解析 GLM 返回的 JSON 内容"""
+        # 尝试直接解析
+        try:
+            return json.loads(content)
+        except json.JSONDecodeError:
+            pass
+
+        # 尝试提取 JSON 块
+        json_match = re.search(r'\{[^{}]*"title"[^{}]*"tags"[^{}]*\}', content, re.DOTALL)
+        if json_match:
+            try:
+                return json.loads(json_match.group())
+            except json.JSONDecodeError:
+                pass
+
+        # 尝试提取 ```json 代码块
+        code_match = re.search(r'```(?:json)?\s*(\{.*?\})\s*```', content, re.DOTALL)
+        if code_match:
+            try:
+                return json.loads(code_match.group(1))
+            except json.JSONDecodeError:
+                pass
+
+        logger.error(f"Failed to parse GLM response: {content}")
+        raise Exception("AI 返回格式解析失败")
+
+
+# 全局服务实例
+glm_service = GLMService()
--- a/backend/app/services/lipsync_service.py
+++ b/backend/app/services/lipsync_service.py
@@ -73,7 +73,51 @@ class LipSyncService:
            logger.warning(f"⚠️ Conda Python 不存在: {self.conda_python}")
            return False
        return True
-    
+
+    def _get_media_duration(self, media_path: str) -> Optional[float]:
+        """获取音频或视频的时长（秒）"""
+        try:
+            cmd = [
+                "ffprobe", "-v", "error",
+                "-show_entries", "format=duration",
+                "-of", "default=noprint_wrappers=1:nokey=1",
+                media_path
+            ]
+            result = subprocess.run(cmd, capture_output=True, text=True, timeout=10)
+            if result.returncode == 0:
+                return float(result.stdout.strip())
+        except Exception as e:
+            logger.warning(f"⚠️ 获取媒体时长失败: {e}")
+        return None
+
+    def _loop_video_to_duration(self, video_path: str, output_path: str, target_duration: float) -> str:
+        """
+        循环视频以匹配目标时长
+        使用 FFmpeg stream_loop 实现无缝循环
+        """
+        try:
+            cmd = [
+                "ffmpeg", "-y",
+                "-stream_loop", "-1",  # 无限循环
+                "-i", video_path,
+                "-t", str(target_duration),  # 截取到目标时长
+                "-c:v", "libx264",
+                "-preset", "fast",
+                "-crf", "18",
+                "-an",  # 去掉原音频
+                output_path
+            ]
+            result = subprocess.run(cmd, capture_output=True, text=True, timeout=300)
+            if result.returncode == 0 and Path(output_path).exists():
+                logger.info(f"✅ 视频循环完成: {target_duration:.1f}s")
+                return output_path
+            else:
+                logger.warning(f"⚠️ 视频循环失败: {result.stderr[:200]}")
+                return video_path
+        except Exception as e:
+            logger.warning(f"⚠️ 视频循环异常: {e}")
+            return video_path
+
    def _preprocess_video(self, video_path: str, output_path: str, target_height: int = 720) -> str:
        """
        视频预处理：压缩视频以加速后续处理
@@ -204,27 +248,34 @@ class LipSyncService:
        
        logger.info("⏳ 等待 GPU 资源 (排队中)...")
        async with self._lock:
-            if self.use_server:
-                # 模式 A: 调用常驻服务 (加速模式)
-                return await self._call_persistent_server(video_path, audio_path, output_path)
-                
-            logger.info("🔄 调用 LatentSync 推理 (subprocess)...")
-            
-            # 使用临时目录存放输出
+            # 使用临时目录存放中间文件
            with tempfile.TemporaryDirectory() as tmpdir:
                tmpdir = Path(tmpdir)
+
+                # 获取音频和视频时长
+                audio_duration = self._get_media_duration(audio_path)
+                video_duration = self._get_media_duration(video_path)
+
+                # 如果音频比视频长，循环视频以匹配音频长度
+                if audio_duration and video_duration and audio_duration > video_duration + 0.5:
+                    logger.info(f"🔄 音频({audio_duration:.1f}s) > 视频({video_duration:.1f}s)，循环视频...")
+                    looped_video = tmpdir / "looped_input.mp4"
+                    actual_video_path = self._loop_video_to_duration(
+                        video_path,
+                        str(looped_video),
+                        audio_duration
+                    )
+                else:
+                    actual_video_path = video_path
+
+                if self.use_server:
+                    # 模式 A: 调用常驻服务 (加速模式)
+                    return await self._call_persistent_server(actual_video_path, audio_path, output_path)
+
+                logger.info("🔄 调用 LatentSync 推理 (subprocess)...")
+
                temp_output = tmpdir / "output.mp4"
                
-                # 视频预处理：压缩高分辨率视频以加速处理
-                # preprocessed_video = tmpdir / "preprocessed_input.mp4"
-                # actual_video_path = self._preprocess_video(
-                #     video_path, 
-                #     str(preprocessed_video), 
-                #     target_height=720
-                # )
-                # 暂时禁用预处理以保持原始分辨率
-                actual_video_path = video_path
-                
                # 构建命令
                cmd = [
                    str(self.conda_python),
@@ -285,7 +336,7 @@ class LipSyncService:
                        return output_path
                    
                    logger.info(f"LatentSync 输出:\n{stdout_text[-500:] if stdout_text else 'N/A'}")
-                    
+
                    # 检查输出文件
                    if temp_output.exists():
                        shutil.copy(temp_output, output_path)
--- a/backend/app/services/publish_service.py
+++ b/backend/app/services/publish_service.py
@@ -2,12 +2,17 @@
 发布服务 (支持用户隔离)
 """
 import json
+import os
+import re
+import tempfile
+import httpx
 from datetime import datetime
 from pathlib import Path
 from typing import Optional, List, Dict, Any
 from loguru import logger
 from app.core.config import settings
 from app.core.paths import get_user_cookie_dir, get_platform_cookie_path, get_legacy_cookie_dir, get_legacy_cookie_path
+from app.services.storage import storage_service

 # Import platform uploaders
 from .uploader.bilibili_uploader import BilibiliUploader
@@ -17,7 +22,7 @@ from .uploader.xiaohongshu_uploader import XiaohongshuUploader

 class PublishService:
    """Social media publishing service (with user isolation)"""
-    
+
    # 支持的平台配置
    PLATFORMS: Dict[str, Dict[str, Any]] = {
        "bilibili": {"name": "B站", "url": "https://member.bilibili.com/platform/upload/video/frame", "enabled": True},
@@ -113,13 +118,56 @@ class PublishService:
        logger.info(f"[发布] 视频: {video_path}")
        logger.info(f"[发布] 标题: {title}")
        logger.info(f"[发布] 用户: {user_id or 'legacy'}")
-        
+
+        temp_file = None
        try:
+            # 处理视频路径
+            if video_path.startswith('http://') or video_path.startswith('https://'):
+                # 尝试从 URL 解析 bucket 和 path，直接使用本地文件
+                local_video_path = None
+
+                # URL 格式: .../storage/v1/object/sign/{bucket}/{path}?token=...
+                match = re.search(r'/storage/v1/object/sign/([^/]+)/(.+?)\?', video_path)
+                if match:
+                    bucket = match.group(1)
+                    storage_path = match.group(2)
+                    logger.info(f"[发布] 解析 URL: bucket={bucket}, path={storage_path}")
+
+                    # 尝试获取本地文件路径
+                    local_video_path = storage_service.get_local_file_path(bucket, storage_path)
+
+                if local_video_path and os.path.exists(local_video_path):
+                    logger.info(f"[发布] 直接使用本地文件: {local_video_path}")
+                else:
+                    # 本地文件不存在，通过 HTTP 下载
+                    logger.info(f"[发布] 本地文件不存在，通过 HTTP 下载...")
+                    temp_file = tempfile.NamedTemporaryFile(delete=False, suffix='.mp4')
+                    temp_file.close()
+
+                    # 将公网 URL 替换为内网 URL
+                    download_url = video_path
+                    if settings.SUPABASE_PUBLIC_URL and settings.SUPABASE_URL:
+                        public_url = settings.SUPABASE_PUBLIC_URL.rstrip('/')
+                        internal_url = settings.SUPABASE_URL.rstrip('/')
+                        download_url = video_path.replace(public_url, internal_url)
+
+                    async with httpx.AsyncClient(timeout=httpx.Timeout(None)) as client:
+                        async with client.stream("GET", download_url) as resp:
+                            resp.raise_for_status()
+                            with open(temp_file.name, 'wb') as f:
+                                async for chunk in resp.aiter_bytes():
+                                    f.write(chunk)
+                    local_video_path = temp_file.name
+                    logger.info(f"[发布] 视频已下载到: {local_video_path}")
+            else:
+                # 本地相对路径
+                local_video_path = str(settings.BASE_DIR.parent / video_path)
+
            # Select appropriate uploader
            if platform == "bilibili":
                uploader = BilibiliUploader(
                    title=title,
-                    file_path=str(settings.BASE_DIR.parent / video_path),
+                    file_path=local_video_path,
                    tags=tags,
                    publish_date=publish_time,
                    account_file=str(account_file),
@@ -130,7 +178,7 @@ class PublishService:
            elif platform == "douyin":
                uploader = DouyinUploader(
                    title=title,
-                    file_path=str(settings.BASE_DIR.parent / video_path),
+                    file_path=local_video_path,
                    tags=tags,
                    publish_date=publish_time,
                    account_file=str(account_file),
@@ -139,7 +187,7 @@ class PublishService:
            elif platform == "xiaohongshu":
                uploader = XiaohongshuUploader(
                    title=title,
-                    file_path=str(settings.BASE_DIR.parent / video_path),
+                    file_path=local_video_path,
                    tags=tags,
                    publish_date=publish_time,
                    account_file=str(account_file),
@@ -157,7 +205,7 @@ class PublishService:
            result = await uploader.main()
            result['platform'] = platform
            return result
-            
+
        except Exception as e:
            logger.exception(f"[发布] 上传异常: {e}")
            return {
@@ -165,6 +213,14 @@ class PublishService:
                "message": f"上传异常: {str(e)}",
                "platform": platform
            }
+        finally:
+            # 清理临时文件
+            if temp_file and os.path.exists(temp_file.name):
+                try:
+                    os.remove(temp_file.name)
+                    logger.info(f"[发布] 已清理临时文件: {temp_file.name}")
+                except Exception as e:
+                    logger.warning(f"[发布] 清理临时文件失败: {e}")
    
    async def login(self, platform: str, user_id: Optional[str] = None) -> Dict[str, Any]:
        """
--- a/backend/app/services/remotion_service.py
+++ b/backend/app/services/remotion_service.py
@@ -0,0 +1,159 @@
+"""
+Remotion 视频渲染服务
+调用 Node.js Remotion 进行视频合成（字幕 + 标题）
+"""
+
+import asyncio
+import json
+import subprocess
+from pathlib import Path
+from typing import Optional
+from loguru import logger
+
+
+class RemotionService:
+    """Remotion 视频渲染服务"""
+
+    def __init__(self, remotion_dir: Optional[str] = None):
+        # Remotion 项目目录
+        if remotion_dir:
+            self.remotion_dir = Path(remotion_dir)
+        else:
+            # 默认在 ViGent2/remotion 目录
+            self.remotion_dir = Path(__file__).parent.parent.parent.parent / "remotion"
+
+    async def render(
+        self,
+        video_path: str,
+        output_path: str,
+        captions_path: Optional[str] = None,
+        title: Optional[str] = None,
+        title_duration: float = 3.0,
+        fps: int = 25,
+        enable_subtitles: bool = True,
+        subtitle_style: Optional[dict] = None,
+        title_style: Optional[dict] = None,
+        on_progress: Optional[callable] = None
+    ) -> str:
+        """
+        使用 Remotion 渲染视频（添加字幕和标题）
+
+        Args:
+            video_path: 输入视频路径（唇形同步后的视频）
+            output_path: 输出视频路径
+            captions_path: 字幕 JSON 文件路径（Whisper 生成）
+            title: 视频标题（可选）
+            title_duration: 标题显示时长（秒）
+            fps: 帧率
+            enable_subtitles: 是否启用字幕
+            on_progress: 进度回调函数
+
+        Returns:
+            输出视频路径
+        """
+        # 构建命令参数
+        cmd = [
+            "npx", "ts-node", "render.ts",
+            "--video", str(video_path),
+            "--output", str(output_path),
+            "--fps", str(fps),
+            "--enableSubtitles", str(enable_subtitles).lower()
+        ]
+
+        if captions_path:
+            cmd.extend(["--captions", str(captions_path)])
+
+        if title:
+            cmd.extend(["--title", title])
+            cmd.extend(["--titleDuration", str(title_duration)])
+
+        if subtitle_style:
+            cmd.extend(["--subtitleStyle", json.dumps(subtitle_style, ensure_ascii=False)])
+
+        if title_style:
+            cmd.extend(["--titleStyle", json.dumps(title_style, ensure_ascii=False)])
+
+        logger.info(f"Running Remotion render: {' '.join(cmd)}")
+
+        # 在线程池中运行子进程
+        def _run_render():
+            process = subprocess.Popen(
+                cmd,
+                cwd=str(self.remotion_dir),
+                stdout=subprocess.PIPE,
+                stderr=subprocess.STDOUT,
+                text=True,
+                bufsize=1
+            )
+
+            output_lines = []
+            for line in iter(process.stdout.readline, ''):
+                line = line.strip()
+                if line:
+                    output_lines.append(line)
+                    logger.debug(f"[Remotion] {line}")
+
+                    # 解析进度
+                    if "Rendering:" in line and "%" in line:
+                        try:
+                            percent_str = line.split("Rendering:")[1].strip().replace("%", "")
+                            percent = int(percent_str)
+                            if on_progress:
+                                on_progress(percent)
+                        except (ValueError, IndexError):
+                            pass
+
+            process.wait()
+
+            if process.returncode != 0:
+                error_msg = "\n".join(output_lines[-20:])  # 最后 20 行
+                raise RuntimeError(f"Remotion render failed (code {process.returncode}):\n{error_msg}")
+
+            return output_path
+
+        loop = asyncio.get_event_loop()
+        result = await loop.run_in_executor(None, _run_render)
+
+        logger.info(f"Remotion render complete: {result}")
+        return result
+
+    async def check_health(self) -> dict:
+        """检查 Remotion 服务健康状态"""
+        try:
+            # 检查 remotion 目录是否存在
+            if not self.remotion_dir.exists():
+                return {
+                    "ready": False,
+                    "error": f"Remotion directory not found: {self.remotion_dir}"
+                }
+
+            # 检查 package.json 是否存在
+            package_json = self.remotion_dir / "package.json"
+            if not package_json.exists():
+                return {
+                    "ready": False,
+                    "error": "package.json not found"
+                }
+
+            # 检查 node_modules 是否存在
+            node_modules = self.remotion_dir / "node_modules"
+            if not node_modules.exists():
+                return {
+                    "ready": False,
+                    "error": "node_modules not found, run 'npm install' first"
+                }
+
+            return {
+                "ready": True,
+                "remotion_dir": str(self.remotion_dir)
+            }
+
+        except Exception as e:
+            return {
+                "ready": False,
+                "error": str(e)
+            }
+
+
+# 全局服务实例
+remotion_service = RemotionService()
--- a/backend/app/services/storage.py
+++ b/backend/app/services/storage.py
@@ -0,0 +1,168 @@
+from supabase import Client
+from app.core.supabase import get_supabase
+from app.core.config import settings
+from loguru import logger
+from typing import Optional, Union, Dict, List, Any
+from pathlib import Path
+import asyncio
+import functools
+import os
+
+# Supabase Storage 本地存储根目录
+SUPABASE_STORAGE_LOCAL_PATH = Path("/home/rongye/ProgramFiles/Supabase/volumes/storage/stub/stub")
+
+class StorageService:
+    def __init__(self):
+        self.supabase: Client = get_supabase()
+        self.BUCKET_MATERIALS = "materials"
+        self.BUCKET_OUTPUTS = "outputs"
+        self.BUCKET_REF_AUDIOS = "ref-audios"
+        # 确保所有 bucket 存在
+        self._ensure_buckets()
+
+    def _ensure_buckets(self):
+        """确保所有必需的 bucket 存在"""
+        buckets = [self.BUCKET_MATERIALS, self.BUCKET_OUTPUTS, self.BUCKET_REF_AUDIOS]
+        try:
+            existing = self.supabase.storage.list_buckets()
+            existing_names = {b.name for b in existing} if existing else set()
+            for bucket_name in buckets:
+                if bucket_name not in existing_names:
+                    try:
+                        self.supabase.storage.create_bucket(bucket_name, options={"public": True})
+                        logger.info(f"Created bucket: {bucket_name}")
+                    except Exception as e:
+                        # 可能已存在，忽略错误
+                        logger.debug(f"Bucket {bucket_name} creation skipped: {e}")
+        except Exception as e:
+            logger.warning(f"Failed to ensure buckets: {e}")
+
+    def _convert_to_public_url(self, url: str) -> str:
+        """将内部 URL 转换为公网可访问的 URL"""
+        if settings.SUPABASE_PUBLIC_URL and settings.SUPABASE_URL:
+            # 去掉末尾斜杠进行替换
+            internal_url = settings.SUPABASE_URL.rstrip('/')
+            public_url = settings.SUPABASE_PUBLIC_URL.rstrip('/')
+            return url.replace(internal_url, public_url)
+        return url
+
+    def get_local_file_path(self, bucket: str, path: str) -> Optional[str]:
+        """
+        获取 Storage 文件的本地磁盘路径
+
+        Supabase Storage 文件存储结构：
+        {STORAGE_ROOT}/{bucket}/{path}/{internal_uuid}
+
+        Returns:
+            本地文件路径，如果不存在返回 None
+        """
+        try:
+            # 构建目录路径
+            dir_path = SUPABASE_STORAGE_LOCAL_PATH / bucket / path
+
+            if not dir_path.exists():
+                logger.warning(f"Storage 目录不存在: {dir_path}")
+                return None
+
+            # 目录下只有一个文件（internal_uuid）
+            files = list(dir_path.iterdir())
+            if not files:
+                logger.warning(f"Storage 目录为空: {dir_path}")
+                return None
+
+            local_path = str(files[0])
+            logger.info(f"获取本地文件路径: {local_path}")
+            return local_path
+
+        except Exception as e:
+            logger.error(f"获取本地文件路径失败: {e}")
+            return None
+
+    async def upload_file(self, bucket: str, path: str, file_data: bytes, content_type: str) -> str:
+        """
+        异步上传文件到 Supabase Storage
+        """
+        try:
+            # 运行在线程池中，避免阻塞事件循环
+            loop = asyncio.get_running_loop()
+            await loop.run_in_executor(
+                None, 
+                functools.partial(
+                    self.supabase.storage.from_(bucket).upload,
+                    path=path,
+                    file=file_data,
+                    file_options={"content-type": content_type, "upsert": "true"}
+                )
+            )
+            logger.info(f"Storage upload success: {path}")
+            return path
+        except Exception as e:
+            logger.error(f"Storage upload failed: {e}")
+            raise e
+
+    async def get_signed_url(self, bucket: str, path: str, expires_in: int = 3600) -> str:
+        """异步获取签名访问链接"""
+        try:
+            loop = asyncio.get_running_loop()
+            res = await loop.run_in_executor(
+                None,
+                lambda: self.supabase.storage.from_(bucket).create_signed_url(path, expires_in)
+            )
+
+            # 兼容处理
+            url = ""
+            if isinstance(res, dict) and "signedURL" in res:
+                url = res["signedURL"]
+            elif isinstance(res, str):
+                url = res
+            else:
+                logger.warning(f"Unexpected signed_url response: {res}")
+                url = res.get("signedURL", "") if isinstance(res, dict) else str(res)
+
+            # 转换为公网可访问的 URL
+            return self._convert_to_public_url(url)
+        except Exception as e:
+            logger.error(f"Get signed URL failed: {e}")
+            return ""
+
+    async def get_public_url(self, bucket: str, path: str) -> str:
+        """获取公开访问链接"""
+        try:
+            loop = asyncio.get_running_loop()
+            res = await loop.run_in_executor(
+                None,
+                lambda: self.supabase.storage.from_(bucket).get_public_url(path)
+            )
+            # 转换为公网可访问的 URL
+            return self._convert_to_public_url(res)
+        except Exception as e:
+            logger.error(f"Get public URL failed: {e}")
+            return ""
+
+    async def delete_file(self, bucket: str, path: str):
+        """异步删除文件"""
+        try:
+            loop = asyncio.get_running_loop()
+            await loop.run_in_executor(
+                None,
+                lambda: self.supabase.storage.from_(bucket).remove([path])
+            )
+            logger.info(f"Deleted file: {bucket}/{path}")
+        except Exception as e:
+            logger.error(f"Delete file failed: {e}")
+            pass
+
+    async def list_files(self, bucket: str, path: str) -> List[Any]:
+        """异步列出文件"""
+        try:
+            loop = asyncio.get_running_loop()
+            res = await loop.run_in_executor(
+                None,
+                lambda: self.supabase.storage.from_(bucket).list(path)
+            )
+            return res or []
+        except Exception as e:
+            logger.error(f"List files failed: {e}")
+            return []
+
+storage_service = StorageService()
--- a/backend/app/services/video_service.py
+++ b/backend/app/services/video_service.py
@@ -1,9 +1,10 @@
 """
 视频合成服务
 """
-import os
-import subprocess
-import json
+import os
+import subprocess
+import json
+import shlex
 from pathlib import Path
 from loguru import logger
 from typing import Optional
@@ -12,18 +13,18 @@ class VideoService:
    def __init__(self):
        pass

-    def _run_ffmpeg(self, cmd: list) -> bool:
-        cmd_str = ' '.join(f'"{c}"' if ' ' in c or '\\' in c else c for c in cmd)
-        logger.debug(f"FFmpeg CMD: {cmd_str}")
-        try:
-            # Synchronous call for BackgroundTasks compatibility
-            result = subprocess.run(
-                cmd_str,
-                shell=True,
-                capture_output=True,
-                text=True,
-                encoding='utf-8',
-            )
+    def _run_ffmpeg(self, cmd: list) -> bool:
+        cmd_str = ' '.join(shlex.quote(str(c)) for c in cmd)
+        logger.debug(f"FFmpeg CMD: {cmd_str}")
+        try:
+            # Synchronous call for BackgroundTasks compatibility
+            result = subprocess.run(
+                cmd,
+                shell=False,
+                capture_output=True,
+                text=True,
+                encoding='utf-8',
+            )
            if result.returncode != 0:
                logger.error(f"FFmpeg Error: {result.stderr}")
                return False
@@ -32,9 +33,9 @@ class VideoService:
            logger.error(f"FFmpeg Exception: {e}")
            return False

-    def _get_duration(self, file_path: str) -> float:
-        # Synchronous call for BackgroundTasks compatibility
-        cmd = f'ffprobe -v error -show_entries format=duration -of default=noprint_wrappers=1:nokey=1 "{file_path}"'
+    def _get_duration(self, file_path: str) -> float:
+        # Synchronous call for BackgroundTasks compatibility
+        cmd = f'ffprobe -v error -show_entries format=duration -of default=noprint_wrappers=1:nokey=1 "{file_path}"'
        try:
            result = subprocess.run(
                cmd,
@@ -44,7 +45,39 @@ class VideoService:
            )
            return float(result.stdout.strip())
        except Exception:
-            return 0.0
+            return 0.0
+
+    def mix_audio(
+        self,
+        voice_path: str,
+        bgm_path: str,
+        output_path: str,
+        bgm_volume: float = 0.2
+    ) -> str:
+        """混合人声与背景音乐"""
+        Path(output_path).parent.mkdir(parents=True, exist_ok=True)
+
+        volume = max(0.0, min(float(bgm_volume), 1.0))
+        filter_complex = (
+            f"[0:a]volume=1.0[a0];"
+            f"[1:a]volume={volume}[a1];"
+            f"[a0][a1]amix=inputs=2:duration=first:dropout_transition=2:normalize=0[aout]"
+        )
+
+        cmd = [
+            "ffmpeg", "-y",
+            "-i", voice_path,
+            "-stream_loop", "-1", "-i", bgm_path,
+            "-filter_complex", filter_complex,
+            "-map", "[aout]",
+            "-c:a", "pcm_s16le",
+            "-shortest",
+            output_path,
+        ]
+
+        if self._run_ffmpeg(cmd):
+            return output_path
+        raise RuntimeError("FFmpeg audio mix failed")

    async def compose(
        self,
@@ -82,8 +115,15 @@ class VideoService:
        # Previous state: subtitles disabled due to font issues
        # if subtitle_path: ...
        
-        # Audio map
-        cmd.extend(["-c:v", "libx264", "-c:a", "aac", "-shortest"])
+        # Audio map with high quality encoding
+        cmd.extend([
+            "-c:v", "libx264",
+            "-preset", "slow",      # 慢速预设，更好的压缩效率
+            "-crf", "18",           # 高质量（与 LatentSync 一致）
+            "-c:a", "aac",
+            "-b:a", "192k",         # 音频比特率
+            "-shortest"
+        ])
        # Use audio from input 1
        cmd.extend(["-map", "0:v", "-map", "1:a"])
        
--- a/backend/app/services/voice_clone_service.py
+++ b/backend/app/services/voice_clone_service.py
@@ -0,0 +1,115 @@
+"""
+声音克隆服务
+通过 HTTP 调用 Qwen3-TTS 独立服务 (端口 8009)
+"""
+import httpx
+import asyncio
+from pathlib import Path
+from typing import Optional
+from loguru import logger
+
+from app.core.config import settings
+
+# Qwen3-TTS 服务地址
+QWEN_TTS_URL = "http://localhost:8009"
+
+
+class VoiceCloneService:
+    """声音克隆服务 - 调用 Qwen3-TTS HTTP API"""
+
+    def __init__(self):
+        self.base_url = QWEN_TTS_URL
+        # 健康状态缓存
+        self._health_cache: Optional[dict] = None
+        self._health_cache_time: float = 0
+        # GPU 并发锁 (Serial Queue)
+        self._lock = asyncio.Lock()
+
+    async def generate_audio(
+        self,
+        text: str,
+        ref_audio_path: str,
+        ref_text: str,
+        output_path: str,
+        language: str = "Chinese"
+    ) -> str:
+        """
+        使用声音克隆生成语音
+
+        Args:
+            text: 要合成的文本
+            ref_audio_path: 参考音频本地路径
+            ref_text: 参考音频的转写文字
+            output_path: 输出 wav 路径
+            language: 语言 (Chinese/English/Auto)
+
+        Returns:
+            输出文件路径
+        """
+        # 使用锁确保串行执行，避免 GPU 显存溢出
+        async with self._lock:
+            logger.info(f"🎤 Voice Clone: {text[:30]}...")
+            Path(output_path).parent.mkdir(parents=True, exist_ok=True)
+
+            # 读取参考音频
+            with open(ref_audio_path, "rb") as f:
+                ref_audio_data = f.read()
+
+            # 调用 Qwen3-TTS 服务
+            timeout = httpx.Timeout(300.0)  # 5分钟超时
+            async with httpx.AsyncClient(timeout=timeout) as client:
+                try:
+                    response = await client.post(
+                        f"{self.base_url}/generate",
+                        files={"ref_audio": ("ref.wav", ref_audio_data, "audio/wav")},
+                        data={
+                            "text": text,
+                            "ref_text": ref_text,
+                            "language": language
+                        }
+                    )
+                    response.raise_for_status()
+
+                    # 保存返回的音频
+                    with open(output_path, "wb") as f:
+                        f.write(response.content)
+
+                    logger.info(f"✅ Voice clone saved: {output_path}")
+                    return output_path
+
+                except httpx.HTTPStatusError as e:
+                    logger.error(f"Qwen3-TTS API error: {e.response.status_code} - {e.response.text}")
+                    raise RuntimeError(f"声音克隆服务错误: {e.response.text}")
+                except httpx.RequestError as e:
+                    logger.error(f"Qwen3-TTS connection error: {e}")
+                    raise RuntimeError("无法连接声音克隆服务，请检查服务是否启动")
+
+    async def check_health(self) -> dict:
+        """健康检查"""
+        import time
+
+        # 5分钟缓存
+        now = time.time()
+        if self._health_cache and (now - self._health_cache_time) < 300:
+            return self._health_cache
+
+        try:
+            async with httpx.AsyncClient(timeout=5.0) as client:
+                response = await client.get(f"{self.base_url}/health")
+                response.raise_for_status()
+                self._health_cache = response.json()
+                self._health_cache_time = now
+                return self._health_cache
+        except Exception as e:
+            logger.warning(f"Qwen3-TTS health check failed: {e}")
+            return {
+                "service": "Qwen3-TTS Voice Clone",
+                "model": "0.6B-Base",
+                "ready": False,
+                "gpu_id": 0,
+                "error": str(e)
+            }
+
+
+# 单例
+voice_clone_service = VoiceCloneService()
--- a/backend/app/services/whisper_service.py
+++ b/backend/app/services/whisper_service.py
@@ -0,0 +1,288 @@
+"""
+字幕对齐服务
+使用 faster-whisper 生成字级别时间戳
+"""
+
+import json
+import re
+from pathlib import Path
+from typing import Optional, List
+from loguru import logger
+
+# 模型缓存
+_whisper_model = None
+
+# 断句标点
+SENTENCE_PUNCTUATION = set('。！？，、；：,.!?;:')
+# 每行最大字数
+MAX_CHARS_PER_LINE = 12
+
+
+def split_word_to_chars(word: str, start: float, end: float) -> list:
+    """
+    将词拆分成单个字符，时间戳线性插值
+
+    Args:
+        word: 词文本
+        start: 词开始时间
+        end: 词结束时间
+
+    Returns:
+        单字符列表，每个包含 word/start/end
+    """
+    tokens = []
+    ascii_buffer = ""
+
+    for char in word:
+        if not char.strip():
+            continue
+
+        if char.isascii() and char.isalnum():
+            ascii_buffer += char
+            continue
+
+        if ascii_buffer:
+            tokens.append(ascii_buffer)
+            ascii_buffer = ""
+
+        tokens.append(char)
+
+    if ascii_buffer:
+        tokens.append(ascii_buffer)
+
+    if not tokens:
+        return []
+
+    if len(tokens) == 1:
+        return [{"word": tokens[0], "start": start, "end": end}]
+
+    # 线性插值时间戳
+    duration = end - start
+    token_duration = duration / len(tokens)
+
+    result = []
+    for i, token in enumerate(tokens):
+        token_start = start + i * token_duration
+        token_end = start + (i + 1) * token_duration
+        result.append({
+            "word": token,
+            "start": round(token_start, 3),
+            "end": round(token_end, 3)
+        })
+
+    return result
+
+
+def split_segment_to_lines(words: List[dict], max_chars: int = MAX_CHARS_PER_LINE) -> List[dict]:
+    """
+    将长段落按标点和字数拆分成多行
+
+    Args:
+        words: 字列表，每个包含 word/start/end
+        max_chars: 每行最大字数
+
+    Returns:
+        拆分后的 segment 列表
+    """
+    if not words:
+        return []
+
+    segments = []
+    current_words = []
+    current_text = ""
+
+    for word_info in words:
+        char = word_info["word"]
+        current_words.append(word_info)
+        current_text += char
+
+        # 判断是否需要断句
+        should_break = False
+
+        # 1. 遇到断句标点
+        if char in SENTENCE_PUNCTUATION:
+            should_break = True
+        # 2. 达到最大字数
+        elif len(current_text) >= max_chars:
+            should_break = True
+
+        if should_break and current_words:
+            segments.append({
+                "text": current_text,
+                "start": current_words[0]["start"],
+                "end": current_words[-1]["end"],
+                "words": current_words.copy()
+            })
+            current_words = []
+            current_text = ""
+
+    # 处理剩余的字
+    if current_words:
+        segments.append({
+            "text": current_text,
+            "start": current_words[0]["start"],
+            "end": current_words[-1]["end"],
+            "words": current_words.copy()
+        })
+
+    return segments
+
+
+class WhisperService:
+    """字幕对齐服务（基于 faster-whisper）"""
+
+    def __init__(
+        self,
+        model_size: str = "large-v3",
+        device: str = "cuda",
+        compute_type: str = "float16",
+    ):
+        self.model_size = model_size
+        self.device = device
+        self.compute_type = compute_type
+
+    def _load_model(self):
+        """懒加载 faster-whisper 模型"""
+        global _whisper_model
+
+        if _whisper_model is None:
+            from faster_whisper import WhisperModel
+
+            logger.info(f"Loading faster-whisper model: {self.model_size} on {self.device}")
+            _whisper_model = WhisperModel(
+                self.model_size,
+                device=self.device,
+                compute_type=self.compute_type
+            )
+            logger.info("faster-whisper model loaded")
+
+        return _whisper_model
+
+    async def align(
+        self,
+        audio_path: str,
+        text: str,
+        output_path: Optional[str] = None
+    ) -> dict:
+        """
+        对音频进行转录，生成字级别时间戳
+
+        Args:
+            audio_path: 音频文件路径
+            text: 原始文本（用于参考，但实际使用 whisper 转录结果）
+            output_path: 可选，输出 JSON 文件路径
+
+        Returns:
+            包含字级别时间戳的字典
+        """
+        import asyncio
+
+        def _do_transcribe():
+            model = self._load_model()
+
+            logger.info(f"Transcribing audio: {audio_path}")
+
+            # 转录并获取字级别时间戳
+            segments_iter, info = model.transcribe(
+                audio_path,
+                language="zh",
+                word_timestamps=True,  # 启用字级别时间戳
+                vad_filter=True,  # 启用 VAD 过滤静音
+            )
+
+            logger.info(f"Detected language: {info.language} (prob: {info.language_probability:.2f})")
+
+            all_segments = []
+            for segment in segments_iter:
+                # 提取每个字的时间戳，并拆分成单字
+                all_words = []
+                if segment.words:
+                    for word_info in segment.words:
+                        word_text = word_info.word.strip()
+                        if word_text:
+                            # 将词拆分成单字，时间戳线性插值
+                            chars = split_word_to_chars(
+                                word_text,
+                                word_info.start,
+                                word_info.end
+                            )
+                            all_words.extend(chars)
+
+                # 将长段落按标点和字数拆分成多行
+                if all_words:
+                    line_segments = split_segment_to_lines(all_words, MAX_CHARS_PER_LINE)
+                    all_segments.extend(line_segments)
+
+            logger.info(f"Generated {len(all_segments)} subtitle segments")
+            return {"segments": all_segments}
+
+        # 在线程池中执行
+        loop = asyncio.get_event_loop()
+        result = await loop.run_in_executor(None, _do_transcribe)
+
+        # 保存到文件
+        if output_path:
+            output_file = Path(output_path)
+            output_file.parent.mkdir(parents=True, exist_ok=True)
+            with open(output_file, "w", encoding="utf-8") as f:
+                json.dump(result, f, ensure_ascii=False, indent=2)
+            logger.info(f"Captions saved to: {output_path}")
+
+        return result
+
+    async def transcribe(self, audio_path: str) -> str:
+        """
+        仅转录文本（用于提取文案）
+
+        Args:
+            audio_path: 音频/视频文件路径
+
+        Returns:
+            纯文本内容
+        """
+        import asyncio
+
+        def _do_transcribe_text():
+            model = self._load_model()
+            logger.info(f"Extracting script from: {audio_path}")
+
+            # 转录 (无需字级时间戳)
+            segments_iter, _ = model.transcribe(
+                audio_path,
+                language="zh",
+                word_timestamps=False,
+                vad_filter=True,
+            )
+
+            text_parts = []
+            for segment in segments_iter:
+                text_parts.append(segment.text.strip())
+
+            full_text = " ".join(text_parts)
+            logger.info(f"Extracted text length: {len(full_text)}")
+            return full_text
+
+        # 在线程池中执行
+        loop = asyncio.get_event_loop()
+        result = await loop.run_in_executor(None, _do_transcribe_text)
+        return result
+
+    async def check_health(self) -> dict:
+        """检查服务健康状态"""
+        try:
+            from faster_whisper import WhisperModel
+            return {
+                "ready": True,
+                "model_size": self.model_size,
+                "device": self.device,
+                "backend": "faster-whisper"
+            }
+        except ImportError:
+            return {
+                "ready": False,
+                "error": "faster-whisper not installed"
+            }
+
+
+# 全局服务实例
+whisper_service = WhisperService()
--- a/backend/assets/styles/subtitle.json
+++ b/backend/assets/styles/subtitle.json
@@ -0,0 +1,58 @@
+[
+  {
+    "id": "subtitle_classic_yellow",
+    "label": "经典黄字",
+    "font_file": "DingTalk JinBuTi.ttf",
+    "font_family": "DingTalkJinBuTi",
+    "font_size": 60,
+    "highlight_color": "#FFE600",
+    "normal_color": "#FFFFFF",
+    "stroke_color": "#000000",
+    "stroke_size": 3,
+    "letter_spacing": 2,
+    "bottom_margin": 80,
+    "is_default": true
+  },
+  {
+    "id": "subtitle_cyan",
+    "label": "清爽青蓝",
+    "font_file": "DingTalk Sans.ttf",
+    "font_family": "DingTalkSans",
+    "font_size": 48,
+    "highlight_color": "#00E5FF",
+    "normal_color": "#FFFFFF",
+    "stroke_color": "#000000",
+    "stroke_size": 3,
+    "letter_spacing": 1,
+    "bottom_margin": 76,
+    "is_default": false
+  },
+  {
+    "id": "subtitle_orange",
+    "label": "活力橙",
+    "font_file": "simhei.ttf",
+    "font_family": "SimHei",
+    "font_size": 50,
+    "highlight_color": "#FF8A00",
+    "normal_color": "#FFFFFF",
+    "stroke_color": "#000000",
+    "stroke_size": 3,
+    "letter_spacing": 2,
+    "bottom_margin": 80,
+    "is_default": false
+  },
+  {
+    "id": "subtitle_clean_white",
+    "label": "纯白轻描",
+    "font_file": "DingTalk JinBuTi.ttf",
+    "font_family": "DingTalkJinBuTi",
+    "font_size": 46,
+    "highlight_color": "#FFFFFF",
+    "normal_color": "#FFFFFF",
+    "stroke_color": "#111111",
+    "stroke_size": 2,
+    "letter_spacing": 1,
+    "bottom_margin": 72,
+    "is_default": false
+  }
+]
--- a/backend/assets/styles/title.json
+++ b/backend/assets/styles/title.json
@@ -0,0 +1,58 @@
+[
+  {
+    "id": "title_pop",
+    "label": "站酷快乐体",
+    "font_file": "title/站酷快乐体.ttf",
+    "font_family": "ZCoolHappy",
+    "font_size": 90,
+    "color": "#FFFFFF",
+    "stroke_color": "#000000",
+    "stroke_size": 8,
+    "letter_spacing": 5,
+    "top_margin": 62,
+    "font_weight": 900,
+    "is_default": true
+  },
+  {
+    "id": "title_bold_white",
+    "label": "黑体大标题",
+    "font_file": "title/思源黑体/SourceHanSansCN-Heavy思源黑体免费.otf",
+    "font_family": "SourceHanSansCN-Heavy",
+    "font_size": 72,
+    "color": "#FFFFFF",
+    "stroke_color": "#000000",
+    "stroke_size": 8,
+    "letter_spacing": 4,
+    "top_margin": 60,
+    "font_weight": 900,
+    "is_default": false
+  },
+  {
+    "id": "title_serif_gold",
+    "label": "宋体金色",
+    "font_file": "title/思源宋体/SourceHanSerifCN-SemiBold思源宋体免费.otf",
+    "font_family": "SourceHanSerifCN-SemiBold",
+    "font_size": 70,
+    "color": "#FDE68A",
+    "stroke_color": "#2B1B00",
+    "stroke_size": 8,
+    "letter_spacing": 3,
+    "top_margin": 58,
+    "font_weight": 800,
+    "is_default": false
+  },
+  {
+    "id": "title_douyin",
+    "label": "抖音活力",
+    "font_file": "title/抖音美好体开源.otf",
+    "font_family": "DouyinMeiHao",
+    "font_size": 72,
+    "color": "#FFFFFF",
+    "stroke_color": "#1F0A00",
+    "stroke_size": 8,
+    "letter_spacing": 4,
+    "top_margin": 60,
+    "font_weight": 900,
+    "is_default": false
+  }
+]
--- a/backend/database/migrate_to_phone.sql
+++ b/backend/database/migrate_to_phone.sql
@@ -0,0 +1,88 @@
+-- ============================================================
+-- ViGent 手机号登录迁移脚本
+-- 用于将 email 字段改为 phone 字段
+-- 
+-- 执行方式（任选一种）：
+-- 1. Supabase Studio: 打开 https://supabase.hbyrkj.top -> SQL Editor -> 粘贴执行
+-- 2. Docker 命令: docker exec -i supabase-db psql -U postgres < migrate_to_phone.sql
+-- ============================================================
+
+-- 注意：此脚本会删除现有的用户数据！
+-- 如需保留数据，请先备份
+
+-- 1. 删除依赖表（有外键约束）
+DROP TABLE IF EXISTS user_sessions CASCADE;
+DROP TABLE IF EXISTS social_accounts CASCADE;
+
+-- 2. 删除用户表
+DROP TABLE IF EXISTS users CASCADE;
+
+-- 3. 重新创建 users 表（使用 phone 字段）
+CREATE TABLE users (
+    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
+    phone TEXT UNIQUE NOT NULL,
+    password_hash TEXT NOT NULL,
+    username TEXT,
+    role TEXT DEFAULT 'pending' CHECK (role IN ('pending', 'user', 'admin')),
+    is_active BOOLEAN DEFAULT FALSE,
+    expires_at TIMESTAMP WITH TIME ZONE,
+    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
+    updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
+);
+
+-- 4. 重新创建 user_sessions 表
+CREATE TABLE user_sessions (
+    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
+    user_id UUID REFERENCES users(id) ON DELETE CASCADE UNIQUE,
+    session_token TEXT UNIQUE NOT NULL,
+    device_info TEXT,
+    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
+);
+
+-- 5. 重新创建 social_accounts 表
+CREATE TABLE social_accounts (
+    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
+    user_id UUID REFERENCES users(id) ON DELETE CASCADE,
+    platform TEXT NOT NULL CHECK (platform IN ('bilibili', 'douyin', 'xiaohongshu')),
+    logged_in BOOLEAN DEFAULT FALSE,
+    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
+    UNIQUE(user_id, platform)
+);
+
+-- 6. 创建索引
+CREATE INDEX idx_users_phone ON users(phone);
+CREATE INDEX idx_sessions_user_id ON user_sessions(user_id);
+CREATE INDEX idx_social_user_platform ON social_accounts(user_id, platform);
+
+-- 7. 启用 RLS
+ALTER TABLE users ENABLE ROW LEVEL SECURITY;
+ALTER TABLE user_sessions ENABLE ROW LEVEL SECURITY;
+ALTER TABLE social_accounts ENABLE ROW LEVEL SECURITY;
+
+-- 8. 创建 RLS 策略
+CREATE POLICY "Users can view own profile" ON users
+    FOR SELECT USING (auth.uid()::text = id::text);
+
+CREATE POLICY "Users can access own sessions" ON user_sessions
+    FOR ALL USING (user_id::text = auth.uid()::text);
+
+CREATE POLICY "Users can access own social accounts" ON social_accounts
+    FOR ALL USING (user_id::text = auth.uid()::text);
+
+-- 9. 更新时间触发器
+CREATE OR REPLACE FUNCTION update_updated_at()
+RETURNS TRIGGER AS $$
+BEGIN
+    NEW.updated_at = NOW();
+    RETURN NEW;
+END;
+$$ LANGUAGE plpgsql;
+
+DROP TRIGGER IF EXISTS users_updated_at ON users;
+CREATE TRIGGER users_updated_at
+    BEFORE UPDATE ON users
+    FOR EACH ROW
+    EXECUTE FUNCTION update_updated_at();
+
+-- 完成！
+-- 管理员账号会在后端服务重启时自动创建 (15549380526)
--- a/backend/database/schema.sql
+++ b/backend/database/schema.sql
@@ -4,7 +4,7 @@
 -- 1. 创建 users 表
 CREATE TABLE IF NOT EXISTS users (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-    email TEXT UNIQUE NOT NULL,
+    phone TEXT UNIQUE NOT NULL,
    password_hash TEXT NOT NULL,
    username TEXT,
    role TEXT DEFAULT 'pending' CHECK (role IN ('pending', 'user', 'admin')),
@@ -34,7 +34,7 @@ CREATE TABLE IF NOT EXISTS social_accounts (
 );

 -- 4. 创建索引
-CREATE INDEX IF NOT EXISTS idx_users_email ON users(email);
+CREATE INDEX IF NOT EXISTS idx_users_phone ON users(phone);
 CREATE INDEX IF NOT EXISTS idx_sessions_user_id ON user_sessions(user_id);
 CREATE INDEX IF NOT EXISTS idx_social_user_platform ON social_accounts(user_id, platform);

--- a/backend/requirements.txt
+++ b/backend/requirements.txt
@@ -28,3 +28,10 @@ supabase>=2.0.0
 python-jose[cryptography]>=3.3.0
 passlib[bcrypt]>=1.7.4
 bcrypt==4.0.1
+
+# 字幕对齐
+faster-whisper>=1.0.0
+
+# 文案提取与AI生成
+yt-dlp>=2023.0.0
+zai-sdk>=0.2.0
--- a/backend/scripts/watchdog.py
+++ b/backend/scripts/watchdog.py
@@ -0,0 +1,84 @@
+
+import asyncio
+import httpx
+import logging
+import subprocess
+import time
+from datetime import datetime
+
+# 配置日志
+logging.basicConfig(
+    level=logging.INFO,
+    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s',
+    handlers=[
+        logging.FileHandler("watchdog.log"),
+        logging.StreamHandler()
+    ]
+)
+logger = logging.getLogger("Watchdog")
+
+# 服务配置
+SERVICES = [
+    {
+        "name": "vigent2-qwen-tts",
+        "url": "http://localhost:8009/health",
+        "failures": 0,
+        "threshold": 3,
+        "timeout": 10.0,
+        "restart_cmd": ["pm2", "restart", "vigent2-qwen-tts"]
+    }
+]
+
+async def check_service(service):
+    """检查单个服务健康状态"""
+    try:
+        timeout = service.get("timeout", 10.0)
+        async with httpx.AsyncClient(timeout=timeout) as client:
+            response = await client.get(service["url"])
+            if response.status_code == 200:
+                # 成功
+                if service["failures"] > 0:
+                    logger.info(f"✅ 服务 {service['name']} 已恢复正常")
+                service["failures"] = 0
+                return True
+            else:
+                logger.warning(f"⚠️ 服务 {service['name']} 返回状态码 {response.status_code}")
+    except Exception as e:
+        logger.warning(f"⚠️ 无法连接服务 {service['name']}: {str(e)}")
+    
+    # 失败处理
+    service["failures"] += 1
+    logger.warning(f"❌ 服务 {service['name']} 连续失败 {service['failures']}/{service['threshold']} 次")
+    
+    if service["failures"] >= service['threshold']:
+        logger.error(f"🚨 服务 {service['name']} 已达到失败阈值，正在重启...")
+        try:
+            subprocess.run(service["restart_cmd"], check=True)
+            logger.info(f"♻️ 服务 {service['name']} 重启命令已发送")
+            # 重启后给予一段宽限期 (例如 60秒) 不检查，等待服务启动
+            service["failures"] = 0 # 重置计数
+            return "restarting" 
+        except Exception as restart_error:
+            logger.error(f"💥 重启服务 {service['name']} 失败: {restart_error}")
+            
+    return False
+
+async def main():
+    logger.info("🛡️ ViGent2 服务看门狗 (Watchdog) 已启动")
+    
+    while True:
+        # 并发检查所有服务
+        for service in SERVICES:
+            result = await check_service(service)
+            if result == "restarting":
+                # 如果有服务重启，额外等待包含启动时间
+                pass
+                
+        # 每 30 秒检查一次
+        await asyncio.sleep(30)
+
+if __name__ == "__main__":
+    try:
+        asyncio.run(main())
+    except KeyboardInterrupt:
+        logger.info("🛑 看门狗已停止")
--- a/frontend/README.md
+++ b/frontend/README.md
@@ -1,72 +0,0 @@
-# ViGent2 Frontend
-
-ViGent2 的前端界面，采用 Next.js 14 + TailwindCSS 构建。
-
-## ✨ 核心功能
-
-### 1. 视频生成 (`/`)
- **素材管理**: 拖拽上传人物视频，实时预览。
- **文案配音**: 集成 EdgeTTS，支持多音色选择 (云溪 / 晓晓)。
- **进度追踪**: 实时显示视频生成进度 (10% -> 100%)。
- **结果预览**: 生成完成后直接播放下载。
-
-### 2. 全自动发布 (`/publish`) [Day 7 新增]
- **多平台管理**: 统一管理 B站、抖音、小红书账号状态。
- **扫码登录**: 
-  - 集成后端 Playwright 生成的 QR Code。
-  - 实时检测扫码状态 (Wait/Success)。
-  - Cookie 自动保存与状态同步。
- **发布配置**: 设置视频标题、标签、简介。
- **定时任务**: 支持 "立即发布" 或 "定时发布"。
-
-## 🛠️ 技术栈
-
- **框架**: Next.js 14 (App Router)
- **样式**: TailwindCSS
- **图标**: Lucide React
- **组件**: 自定义现代化组件 (Glassmorphism 风格)
- **API**: Fetch API (对接后端 FastAPI :8006)
-
-## 🚀 开发指南
-
-### 安装依赖
-
-```bash
-npm install
-```
-
-### 启动开发服务器
-
-默认运行在 **3002** 端口 (通过 `package.json` 配置):
-
-```bash
-npm run dev
-# 访问: http://localhost:3002
-```
-
-### 目录结构
-
-```
-src/
-├── app/
-│   ├── page.tsx           # 视频生成主页
-│   ├── publish/           # 发布管理页
-│   │   └── page.tsx
-│   └── layout.tsx         # 全局布局 (导航栏)
-├── components/            # UI 组件
-│   ├── VideoUploader.tsx  # 视频上传
-│   ├── StatusBadge.tsx    # 状态徽章
-│   └── ...
-└── lib/                   # 工具函数
-```
-
-## 🔌 后端对接
-
- **Base URL**: `http://localhost:8006`
- **代理配置**: Next.js Rewrites (如需) 或直接 CORS。
-
-## 🎨 设计规范
-
- **主色调**: 深紫/黑色系 (Dark Mode)
- **交互**: 悬停微动画 (Hover Effects)
- **响应式**: 适配桌面端大屏操作
--- a/frontend/next.config.ts
+++ b/frontend/next.config.ts
@@ -16,6 +16,10 @@ const nextConfig: NextConfig = {
        source: '/outputs/:path*',
        destination: 'http://localhost:8006/outputs/:path*',  // 转发生成的视频
      },
+      {
+        source: '/assets/:path*',
+        destination: 'http://localhost:8006/assets/:path*',  // 转发静态资源（字体/音乐）
+      },
    ];
  },
 };
--- a/frontend/package-lock.json
+++ b/frontend/package-lock.json
@@ -8,6 +8,9 @@
      "name": "frontend",
      "version": "0.1.0",
      "dependencies": {
+        "@supabase/supabase-js": "^2.93.1",
+        "axios": "^1.13.4",
+        "lucide-react": "^0.563.0",
        "next": "16.1.1",
        "react": "19.2.3",
        "react-dom": "19.2.3",
@@ -68,7 +71,6 @@
      "integrity": "sha512-H3mcG6ZDLTlYfaSNi0iOKkigqMFvkTKlGUYlD8GW7nNOYRrevuA46iTypPyv+06V3fEmvvazfntkBU34L0azAw==",
      "dev": true,
      "license": "MIT",
-      "peer": true,
      "dependencies": {
        "@babel/code-frame": "^7.28.6",
        "@babel/generator": "^7.28.6",
@@ -1235,6 +1237,80 @@
      "dev": true,
      "license": "MIT"
    },
+    "node_modules/@supabase/auth-js": {
+      "version": "2.93.1",
+      "resolved": "https://registry.npmjs.org/@supabase/auth-js/-/auth-js-2.93.1.tgz",
+      "integrity": "sha512-pC0Ek4xk4z6q7A/3+UuZ/eYgfFUUQTg3DhapzrAgJnFGDJDFDyGCj6v9nIz8+3jfLqSZ3QKGe6AoEodYjShghg==",
+      "dependencies": {
+        "tslib": "2.8.1"
+      },
+      "engines": {
+        "node": ">=20.0.0"
+      }
+    },
+    "node_modules/@supabase/functions-js": {
+      "version": "2.93.1",
+      "resolved": "https://registry.npmjs.org/@supabase/functions-js/-/functions-js-2.93.1.tgz",
+      "integrity": "sha512-Ott2IcIXHGupaC0nX9WNEiJAX4OdlGRu9upkkURaQHbaLdz9JuCcHxlwTERgtgjMpikbIWHfMM1M9QTQFYABiA==",
+      "dependencies": {
+        "tslib": "2.8.1"
+      },
+      "engines": {
+        "node": ">=20.0.0"
+      }
+    },
+    "node_modules/@supabase/postgrest-js": {
+      "version": "2.93.1",
+      "resolved": "https://registry.npmjs.org/@supabase/postgrest-js/-/postgrest-js-2.93.1.tgz",
+      "integrity": "sha512-uRKKQJBDnfi6XFNFPNMh9+u3HT2PCgp065PcMPmG7e0xGuqvLtN89QxO2/SZcGbw2y1+mNBz0yUs5KmyNqF2fA==",
+      "dependencies": {
+        "tslib": "2.8.1"
+      },
+      "engines": {
+        "node": ">=20.0.0"
+      }
+    },
+    "node_modules/@supabase/realtime-js": {
+      "version": "2.93.1",
+      "resolved": "https://registry.npmjs.org/@supabase/realtime-js/-/realtime-js-2.93.1.tgz",
+      "integrity": "sha512-2WaP/KVHPlQDjWM6qe4wOZz6zSRGaXw1lfXf4thbfvk3C3zPPKqXRyspyYnk3IhphyxSsJ2hQ/cXNOz48008tg==",
+      "dependencies": {
+        "@types/phoenix": "^1.6.6",
+        "@types/ws": "^8.18.1",
+        "tslib": "2.8.1",
+        "ws": "^8.18.2"
+      },
+      "engines": {
+        "node": ">=20.0.0"
+      }
+    },
+    "node_modules/@supabase/storage-js": {
+      "version": "2.93.1",
+      "resolved": "https://registry.npmjs.org/@supabase/storage-js/-/storage-js-2.93.1.tgz",
+      "integrity": "sha512-3KVwd4S1i1BVPL6KIywe5rnruNQXSkLyvrdiJmwnqwbCcDujQumARdGWBPesqCjOPKEU2M9ORWKAsn+2iLzquA==",
+      "dependencies": {
+        "iceberg-js": "^0.8.1",
+        "tslib": "2.8.1"
+      },
+      "engines": {
+        "node": ">=20.0.0"
+      }
+    },
+    "node_modules/@supabase/supabase-js": {
+      "version": "2.93.1",
+      "resolved": "https://registry.npmjs.org/@supabase/supabase-js/-/supabase-js-2.93.1.tgz",
+      "integrity": "sha512-FJTgS5s0xEgRQ3u7gMuzGObwf3jA4O5Ki/DgCDXx94w1pihLM4/WG3XFa4BaCJYfuzLxLcv6zPPA5tDvBUjAUg==",
+      "dependencies": {
+        "@supabase/auth-js": "2.93.1",
+        "@supabase/functions-js": "2.93.1",
+        "@supabase/postgrest-js": "2.93.1",
+        "@supabase/realtime-js": "2.93.1",
+        "@supabase/storage-js": "2.93.1"
+      },
+      "engines": {
+        "node": ">=20.0.0"
+      }
+    },
    "node_modules/@swc/helpers": {
      "version": "0.5.15",
      "resolved": "https://registry.npmjs.org/@swc/helpers/-/helpers-0.5.15.tgz",
@@ -1551,19 +1627,22 @@
      "version": "20.19.28",
      "resolved": "https://registry.npmjs.org/@types/node/-/node-20.19.28.tgz",
      "integrity": "sha512-VyKBr25BuFDzBFCK5sUM6ZXiWfqgCTwTAOK8qzGV/m9FCirXYDlmczJ+d5dXBAQALGCdRRdbteKYfJ84NGEusw==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "undici-types": "~6.21.0"
      }
    },
+    "node_modules/@types/phoenix": {
+      "version": "1.6.7",
+      "resolved": "https://registry.npmjs.org/@types/phoenix/-/phoenix-1.6.7.tgz",
+      "integrity": "sha512-oN9ive//QSBkf19rfDv45M7eZPi0eEXylht2OLEXicu5b4KoQ1OzXIw+xDSGWxSxe1JmepRR/ZH283vsu518/Q=="
+    },
    "node_modules/@types/react": {
      "version": "19.2.8",
      "resolved": "https://registry.npmjs.org/@types/react/-/react-19.2.8.tgz",
      "integrity": "sha512-3MbSL37jEchWZz2p2mjntRZtPt837ij10ApxKfgmXCTuHWagYg7iA5bqPw6C8BMPfwidlvfPI/fxOc42HLhcyg==",
      "dev": true,
      "license": "MIT",
-      "peer": true,
      "dependencies": {
        "csstype": "^3.2.2"
      }
@@ -1578,6 +1657,14 @@
        "@types/react": "^19.2.0"
      }
    },
+    "node_modules/@types/ws": {
+      "version": "8.18.1",
+      "resolved": "https://registry.npmjs.org/@types/ws/-/ws-8.18.1.tgz",
+      "integrity": "sha512-ThVF6DCVhA8kUGy+aazFQ4kXQ7E1Ty7A3ypFOe0IcJV8O/M511G99AW24irKrW56Wt44yG9+ij8FaqoBGkuBXg==",
+      "dependencies": {
+        "@types/node": "*"
+      }
+    },
    "node_modules/@typescript-eslint/eslint-plugin": {
      "version": "8.53.0",
      "resolved": "https://registry.npmjs.org/@typescript-eslint/eslint-plugin/-/eslint-plugin-8.53.0.tgz",
@@ -1623,7 +1710,6 @@
      "integrity": "sha512-npiaib8XzbjtzS2N4HlqPvlpxpmZ14FjSJrteZpPxGUaYPlvhzlzUZ4mZyABo0EFrOWnvyd0Xxroq//hKhtAWg==",
      "dev": true,
      "license": "MIT",
-      "peer": true,
      "dependencies": {
        "@typescript-eslint/scope-manager": "8.53.0",
        "@typescript-eslint/types": "8.53.0",
@@ -2123,7 +2209,6 @@
      "integrity": "sha512-NZyJarBfL7nWwIq+FDL6Zp/yHEhePMNnnJ0y3qfieCrmNvYct8uvtiV41UvlSe6apAfk0fY1FbWx+NwfmpvtTg==",
      "dev": true,
      "license": "MIT",
-      "peer": true,
      "bin": {
        "acorn": "bin/acorn"
      },
@@ -2368,6 +2453,12 @@
        "node": ">= 0.4"
      }
    },
+    "node_modules/asynckit": {
+      "version": "0.4.0",
+      "resolved": "https://registry.npmjs.org/asynckit/-/asynckit-0.4.0.tgz",
+      "integrity": "sha512-Oei9OH4tRh0YqU3GxhX79dM/mwVgvbZJaSNaRk+bshkj0S5cfHcgYakreBjrHwatXKbz+IoIdYLxrKim2MjW0Q==",
+      "license": "MIT"
+    },
    "node_modules/available-typed-arrays": {
      "version": "1.0.7",
      "resolved": "https://registry.npmjs.org/available-typed-arrays/-/available-typed-arrays-1.0.7.tgz",
@@ -2394,6 +2485,17 @@
        "node": ">=4"
      }
    },
+    "node_modules/axios": {
+      "version": "1.13.4",
+      "resolved": "https://registry.npmjs.org/axios/-/axios-1.13.4.tgz",
+      "integrity": "sha512-1wVkUaAO6WyaYtCkcYCOx12ZgpGf9Zif+qXa4n+oYzK558YryKqiL6UWwd5DqiH3VRW0GYhTZQ/vlgJrCoNQlg==",
+      "license": "MIT",
+      "dependencies": {
+        "follow-redirects": "^1.15.6",
+        "form-data": "^4.0.4",
+        "proxy-from-env": "^1.1.0"
+      }
+    },
    "node_modules/axobject-query": {
      "version": "4.1.0",
      "resolved": "https://registry.npmjs.org/axobject-query/-/axobject-query-4.1.0.tgz",
@@ -2464,7 +2566,6 @@
        }
      ],
      "license": "MIT",
-      "peer": true,
      "dependencies": {
        "baseline-browser-mapping": "^2.9.0",
        "caniuse-lite": "^1.0.30001759",
@@ -2502,7 +2603,6 @@
      "version": "1.0.2",
      "resolved": "https://registry.npmjs.org/call-bind-apply-helpers/-/call-bind-apply-helpers-1.0.2.tgz",
      "integrity": "sha512-Sp1ablJ0ivDkSzjcaJdxEunN5/XvksFJ2sMBFfq6x0ryhQV/2b/KwFe21cMpmHtPOSij8K99/wSfoEuTObmuMQ==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "es-errors": "^1.3.0",
@@ -2602,6 +2702,18 @@
      "dev": true,
      "license": "MIT"
    },
+    "node_modules/combined-stream": {
+      "version": "1.0.8",
+      "resolved": "https://registry.npmjs.org/combined-stream/-/combined-stream-1.0.8.tgz",
+      "integrity": "sha512-FQN4MRfuJeHf7cBbBMJFXhKSDq+2kAArBlmRBvcvFE5BB1HZKXtSFASDhdlz9zOYwxh8lDdnvmMOe/+5cdoEdg==",
+      "license": "MIT",
+      "dependencies": {
+        "delayed-stream": "~1.0.0"
+      },
+      "engines": {
+        "node": ">= 0.8"
+      }
+    },
    "node_modules/concat-map": {
      "version": "0.0.1",
      "resolved": "https://registry.npmjs.org/concat-map/-/concat-map-0.0.1.tgz",
@@ -2760,6 +2872,15 @@
        "url": "https://github.com/sponsors/ljharb"
      }
    },
+    "node_modules/delayed-stream": {
+      "version": "1.0.0",
+      "resolved": "https://registry.npmjs.org/delayed-stream/-/delayed-stream-1.0.0.tgz",
+      "integrity": "sha512-ZySD7Nf91aLB0RxL4KGrKHBXl7Eds1DAmEdcoVawXnLD7SDhpNgtuII2aAkg7a7QS41jxPSZ17p4VdGnMHk3MQ==",
+      "license": "MIT",
+      "engines": {
+        "node": ">=0.4.0"
+      }
+    },
    "node_modules/dequal": {
      "version": "2.0.3",
      "resolved": "https://registry.npmjs.org/dequal/-/dequal-2.0.3.tgz",
@@ -2796,7 +2917,6 @@
      "version": "1.0.1",
      "resolved": "https://registry.npmjs.org/dunder-proto/-/dunder-proto-1.0.1.tgz",
      "integrity": "sha512-KIN/nDJBQRcXw0MLVhZE9iQHmG68qAVIBg9CqmUYjmQIhgij9U5MFvrqkUL5FbtyyzZuOeOt0zdeRe4UY7ct+A==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "call-bind-apply-helpers": "^1.0.1",
@@ -2908,7 +3028,6 @@
      "version": "1.0.1",
      "resolved": "https://registry.npmjs.org/es-define-property/-/es-define-property-1.0.1.tgz",
      "integrity": "sha512-e3nRfgfUZ4rNGL232gUgX06QNyyez04KdjFrF+LTRoOXmrOgFKDg4BCdsjW8EnT69eqdYGmRpJwiPVYNrCaW3g==",
-      "dev": true,
      "license": "MIT",
      "engines": {
        "node": ">= 0.4"
@@ -2918,7 +3037,6 @@
      "version": "1.3.0",
      "resolved": "https://registry.npmjs.org/es-errors/-/es-errors-1.3.0.tgz",
      "integrity": "sha512-Zf5H2Kxt2xjTvbJvP2ZWLEICxA6j+hAmMzIlypy4xcBg1vKVnx89Wy0GbS+kf5cwCVFFzdCFh2XSCFNULS6csw==",
-      "dev": true,
      "license": "MIT",
      "engines": {
        "node": ">= 0.4"
@@ -2956,7 +3074,6 @@
      "version": "1.1.1",
      "resolved": "https://registry.npmjs.org/es-object-atoms/-/es-object-atoms-1.1.1.tgz",
      "integrity": "sha512-FGgH2h8zKNim9ljj7dankFPcICIK9Cp5bm+c2gQSYePhpaG5+esrLODihIorn+Pe6FGJzWhXQotPv73jTaldXA==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "es-errors": "^1.3.0"
@@ -2969,7 +3086,6 @@
      "version": "2.1.0",
      "resolved": "https://registry.npmjs.org/es-set-tostringtag/-/es-set-tostringtag-2.1.0.tgz",
      "integrity": "sha512-j6vWzfrGVfyXxge+O0x5sh6cvxAog0a/4Rdd2K36zCMV5eJ+/+tOAngRO8cODMNWbVRdVlmGZQL2YS3yR8bIUA==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "es-errors": "^1.3.0",
@@ -3041,7 +3157,6 @@
      "integrity": "sha512-LEyamqS7W5HB3ujJyvi0HQK/dtVINZvd5mAAp9eT5S/ujByGjiZLCzPcHVzuXbpJDJF/cxwHlfceVUDZ2lnSTw==",
      "dev": true,
      "license": "MIT",
-      "peer": true,
      "dependencies": {
        "@eslint-community/eslint-utils": "^4.8.0",
        "@eslint-community/regexpp": "^4.12.1",
@@ -3227,7 +3342,6 @@
      "integrity": "sha512-whOE1HFo/qJDyX4SnXzP4N6zOWn79WhnCUY/iDR0mPfQZO8wcYE4JClzI2oZrhBnnMUCBCHZhO6VQyoBU95mZA==",
      "dev": true,
      "license": "MIT",
-      "peer": true,
      "dependencies": {
        "@rtsao/scc": "^1.1.0",
        "array-includes": "^3.1.9",
@@ -3586,6 +3700,26 @@
      "dev": true,
      "license": "ISC"
    },
+    "node_modules/follow-redirects": {
+      "version": "1.15.11",
+      "resolved": "https://registry.npmjs.org/follow-redirects/-/follow-redirects-1.15.11.tgz",
+      "integrity": "sha512-deG2P0JfjrTxl50XGCDyfI97ZGVCxIpfKYmfyrQ54n5FO/0gfIES8C/Psl6kWVDolizcaaxZJnTS0QSMxvnsBQ==",
+      "funding": [
+        {
+          "type": "individual",
+          "url": "https://github.com/sponsors/RubenVerborgh"
+        }
+      ],
+      "license": "MIT",
+      "engines": {
+        "node": ">=4.0"
+      },
+      "peerDependenciesMeta": {
+        "debug": {
+          "optional": true
+        }
+      }
+    },
    "node_modules/for-each": {
      "version": "0.3.5",
      "resolved": "https://registry.npmjs.org/for-each/-/for-each-0.3.5.tgz",
@@ -3602,11 +3736,26 @@
        "url": "https://github.com/sponsors/ljharb"
      }
    },
+    "node_modules/form-data": {
+      "version": "4.0.5",
+      "resolved": "https://registry.npmjs.org/form-data/-/form-data-4.0.5.tgz",
+      "integrity": "sha512-8RipRLol37bNs2bhoV67fiTEvdTrbMUYcFTiy3+wuuOnUog2QBHCZWXDRijWQfAkhBj2Uf5UnVaiWwA5vdd82w==",
+      "license": "MIT",
+      "dependencies": {
+        "asynckit": "^0.4.0",
+        "combined-stream": "^1.0.8",
+        "es-set-tostringtag": "^2.1.0",
+        "hasown": "^2.0.2",
+        "mime-types": "^2.1.12"
+      },
+      "engines": {
+        "node": ">= 6"
+      }
+    },
    "node_modules/function-bind": {
      "version": "1.1.2",
      "resolved": "https://registry.npmjs.org/function-bind/-/function-bind-1.1.2.tgz",
      "integrity": "sha512-7XHNxH7qX9xG5mIwxkhumTox/MIRNcOgDrxWsMt2pAr23WHp6MrRlN7FBSFpCpr+oVO0F744iUgR82nJMfG2SA==",
-      "dev": true,
      "license": "MIT",
      "funding": {
        "url": "https://github.com/sponsors/ljharb"
@@ -3667,7 +3816,6 @@
      "version": "1.3.0",
      "resolved": "https://registry.npmjs.org/get-intrinsic/-/get-intrinsic-1.3.0.tgz",
      "integrity": "sha512-9fSjSaos/fRIVIp+xSJlE6lfwhES7LNtKaCBIamHsjr2na1BiABJPo0mOjjz8GJDURarmCPGqaiVg5mfjb98CQ==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "call-bind-apply-helpers": "^1.0.2",
@@ -3692,7 +3840,6 @@
      "version": "1.0.1",
      "resolved": "https://registry.npmjs.org/get-proto/-/get-proto-1.0.1.tgz",
      "integrity": "sha512-sTSfBjoXBp89JvIKIefqw7U2CCebsc74kiY6awiGogKtoSGbgjYE/G/+l9sF3MWFPNc9IcoOC4ODfKHfxFmp0g==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "dunder-proto": "^1.0.1",
@@ -3780,7 +3927,6 @@
      "version": "1.2.0",
      "resolved": "https://registry.npmjs.org/gopd/-/gopd-1.2.0.tgz",
      "integrity": "sha512-ZUKRh6/kUFoAiTAtTYPZJ3hw9wNxx+BIBOijnlG9PnrJsCcSjs1wyyD6vJpaYtgnzDrKYRSqf3OO6Rfa93xsRg==",
-      "dev": true,
      "license": "MIT",
      "engines": {
        "node": ">= 0.4"
@@ -3852,7 +3998,6 @@
      "version": "1.1.0",
      "resolved": "https://registry.npmjs.org/has-symbols/-/has-symbols-1.1.0.tgz",
      "integrity": "sha512-1cDNdwJ2Jaohmb3sg4OmKaMBwuC48sYni5HUw2DvsC8LjGTLK9h+eb1X6RyuOHe4hT0ULCW68iomhjUoKUqlPQ==",
-      "dev": true,
      "license": "MIT",
      "engines": {
        "node": ">= 0.4"
@@ -3865,7 +4010,6 @@
      "version": "1.0.2",
      "resolved": "https://registry.npmjs.org/has-tostringtag/-/has-tostringtag-1.0.2.tgz",
      "integrity": "sha512-NqADB8VjPFLM2V0VvHUewwwsw0ZWBaIdgo+ieHtK3hasLz4qeCRjYcqfB6AQrBggRKppKF8L52/VqdVsO47Dlw==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "has-symbols": "^1.0.3"
@@ -3881,7 +4025,6 @@
      "version": "2.0.2",
      "resolved": "https://registry.npmjs.org/hasown/-/hasown-2.0.2.tgz",
      "integrity": "sha512-0hJU9SCPvmMzIBdZFqNPXWa6dqh7WdH0cII9y+CyS8rG3nL48Bclra9HmKhVVUHyPWNH5Y7xDwAB7bfgSjkUMQ==",
-      "dev": true,
      "license": "MIT",
      "dependencies": {
        "function-bind": "^1.1.2"
@@ -3907,6 +4050,14 @@
        "hermes-estree": "0.25.1"
      }
    },
+    "node_modules/iceberg-js": {
+      "version": "0.8.1",
+      "resolved": "https://registry.npmjs.org/iceberg-js/-/iceberg-js-0.8.1.tgz",
+      "integrity": "sha512-1dhVQZXhcHje7798IVM+xoo/1ZdVfzOMIc8/rgVSijRK38EDqOJoGula9N/8ZI5RD8QTxNQtK/Gozpr+qUqRRA==",
+      "engines": {
+        "node": ">=20.0.0"
+      }
+    },
    "node_modules/ignore": {
      "version": "5.3.2",
      "resolved": "https://registry.npmjs.org/ignore/-/ignore-5.3.2.tgz",
@@ -4850,6 +5001,15 @@
        "yallist": "^3.0.2"
      }
    },
+    "node_modules/lucide-react": {
+      "version": "0.563.0",
+      "resolved": "https://registry.npmjs.org/lucide-react/-/lucide-react-0.563.0.tgz",
+      "integrity": "sha512-8dXPB2GI4dI8jV4MgUDGBeLdGk8ekfqVZ0BdLcrRzocGgG75ltNEmWS+gE7uokKF/0oSUuczNDT+g9hFJ23FkA==",
+      "license": "ISC",
+      "peerDependencies": {
+        "react": "^16.5.1 || ^17.0.0 || ^18.0.0 || ^19.0.0"
+      }
+    },
    "node_modules/magic-string": {
      "version": "0.30.21",
      "resolved": "https://registry.npmjs.org/magic-string/-/magic-string-0.30.21.tgz",
@@ -4864,7 +5024,6 @@
      "version": "1.1.0",
      "resolved": "https://registry.npmjs.org/math-intrinsics/-/math-intrinsics-1.1.0.tgz",
      "integrity": "sha512-/IXtbwEk5HTPyEwyKX6hGkYXxM9nbj64B+ilVJnC/R6B0pH5G4V3b0pVbL7DBj4tkhBAppbQUlf6F6Xl9LHu1g==",
-      "dev": true,
      "license": "MIT",
      "engines": {
        "node": ">= 0.4"
@@ -4894,6 +5053,27 @@
        "node": ">=8.6"
      }
    },
+    "node_modules/mime-db": {
+      "version": "1.52.0",
+      "resolved": "https://registry.npmjs.org/mime-db/-/mime-db-1.52.0.tgz",
+      "integrity": "sha512-sPU4uV7dYlvtWJxwwxHD0PuihVNiE7TyAbQ5SWxDCB9mUYvOgroQOwYQQOKPJ8CIbE+1ETVlOoK1UC2nU3gYvg==",
+      "license": "MIT",
+      "engines": {
+        "node": ">= 0.6"
+      }
+    },
+    "node_modules/mime-types": {
+      "version": "2.1.35",
+      "resolved": "https://registry.npmjs.org/mime-types/-/mime-types-2.1.35.tgz",
+      "integrity": "sha512-ZDY+bPm5zTTF+YpCrAU9nK0UgICYPT0QtT1NZWFv4s++TNkcgVaT0g6+4R2uI4MjQjzysHB1zxuWL50hzaeXiw==",
+      "license": "MIT",
+      "dependencies": {
+        "mime-db": "1.52.0"
+      },
+      "engines": {
+        "node": ">= 0.6"
+      }
+    },
    "node_modules/minimatch": {
      "version": "3.1.2",
      "resolved": "https://registry.npmjs.org/minimatch/-/minimatch-3.1.2.tgz",
@@ -5364,6 +5544,12 @@
        "react-is": "^16.13.1"
      }
    },
+    "node_modules/proxy-from-env": {
+      "version": "1.1.0",
+      "resolved": "https://registry.npmjs.org/proxy-from-env/-/proxy-from-env-1.1.0.tgz",
+      "integrity": "sha512-D+zkORCbA9f1tdWRK0RaCR3GPv50cMxcrz4X8k5LTSUD1Dkw47mKJEZQNunItRTkWwgtaUSo1RVFRIG9ZXiFYg==",
+      "license": "MIT"
+    },
    "node_modules/punycode": {
      "version": "2.3.1",
      "resolved": "https://registry.npmjs.org/punycode/-/punycode-2.3.1.tgz",
@@ -5400,7 +5586,6 @@
      "resolved": "https://registry.npmjs.org/react/-/react-19.2.3.tgz",
      "integrity": "sha512-Ku/hhYbVjOQnXDZFv2+RibmLFGwFdeeKHFcOTlrt7xplBnya5OGn/hIRDsqDiSUcfORsDC7MPxwork8jBwsIWA==",
      "license": "MIT",
-      "peer": true,
      "engines": {
        "node": ">=0.10.0"
      }
@@ -5410,7 +5595,6 @@
      "resolved": "https://registry.npmjs.org/react-dom/-/react-dom-19.2.3.tgz",
      "integrity": "sha512-yELu4WmLPw5Mr/lmeEpox5rw3RETacE++JgHqQzd2dg+YbJuat3jH4ingc+WPZhxaoFzdv9y33G+F7Nl5O0GBg==",
      "license": "MIT",
-      "peer": true,
      "dependencies": {
        "scheduler": "^0.27.0"
      },
@@ -6112,7 +6296,6 @@
      "integrity": "sha512-5gTmgEY/sqK6gFXLIsQNH19lWb4ebPDLA4SdLP7dsWkIXHWlG66oPuVvXSGFPppYZz8ZDZq0dYYrbHfBCVUb1Q==",
      "dev": true,
      "license": "MIT",
-      "peer": true,
      "engines": {
        "node": ">=12"
      },
@@ -6275,7 +6458,6 @@
      "integrity": "sha512-jl1vZzPDinLr9eUt3J/t7V6FgNEw9QjvBPdysz9KfQDD41fQrC2Y4vKQdiaUpFT4bXlb1RHhLpp8wtm6M5TgSw==",
      "dev": true,
      "license": "Apache-2.0",
-      "peer": true,
      "bin": {
        "tsc": "bin/tsc",
        "tsserver": "bin/tsserver"
@@ -6331,7 +6513,6 @@
      "version": "6.21.0",
      "resolved": "https://registry.npmjs.org/undici-types/-/undici-types-6.21.0.tgz",
      "integrity": "sha512-iwDZqg0QAGrg9Rav5H4n0M64c3mkR59cJ6wQp+7C4nI0gsmExaedaYLNO44eT4AtBBwjbTiGPMlt2Md0T9H9JQ==",
-      "dev": true,
      "license": "MIT"
    },
    "node_modules/unrs-resolver": {
@@ -6534,6 +6715,26 @@
        "node": ">=0.10.0"
      }
    },
+    "node_modules/ws": {
+      "version": "8.19.0",
+      "resolved": "https://registry.npmjs.org/ws/-/ws-8.19.0.tgz",
+      "integrity": "sha512-blAT2mjOEIi0ZzruJfIhb3nps74PRWTCz1IjglWEEpQl5XS/UNama6u2/rjFkDDouqr4L67ry+1aGIALViWjDg==",
+      "engines": {
+        "node": ">=10.0.0"
+      },
+      "peerDependencies": {
+        "bufferutil": "^4.0.1",
+        "utf-8-validate": ">=5.0.2"
+      },
+      "peerDependenciesMeta": {
+        "bufferutil": {
+          "optional": true
+        },
+        "utf-8-validate": {
+          "optional": true
+        }
+      }
+    },
    "node_modules/yallist": {
      "version": "3.1.1",
      "resolved": "https://registry.npmjs.org/yallist/-/yallist-3.1.1.tgz",
@@ -6560,7 +6761,6 @@
      "integrity": "sha512-k7Nwx6vuWx1IJ9Bjuf4Zt1PEllcwe7cls3VNzm4CQ1/hgtFUK2bRNG3rvnpPUhFjmqJKAKtjV576KnUkHocg/g==",
      "dev": true,
      "license": "MIT",
-      "peer": true,
      "funding": {
        "url": "https://github.com/sponsors/colinhacks"
      }
--- a/frontend/package.json
+++ b/frontend/package.json
@@ -9,6 +9,9 @@
    "lint": "eslint"
  },
  "dependencies": {
+    "@supabase/supabase-js": "^2.93.1",
+    "axios": "^1.13.4",
+    "lucide-react": "^0.563.0",
    "next": "16.1.1",
    "react": "19.2.3",
    "react-dom": "19.2.3",
--- a/frontend/src/app/admin/page.tsx
+++ b/frontend/src/app/admin/page.tsx
@@ -2,15 +2,12 @@

 import { useState, useEffect } from 'react';
 import { useRouter } from 'next/navigation';
-import { getCurrentUser, User } from '@/lib/auth';
-
-const API_BASE = typeof window === 'undefined'
-    ? (process.env.NEXT_PUBLIC_API_URL || 'http://localhost:8006')
-    : '';
+import { getCurrentUser, User } from "@/shared/lib/auth";
+import api from "@/shared/api/axios";

 interface UserListItem {
    id: string;
-    email: string;
+    phone: string;
    username: string | null;
    role: string;
    is_active: boolean;
@@ -43,11 +40,7 @@ export default function AdminPage() {

    const fetchUsers = async () => {
        try {
-            const res = await fetch(`${API_BASE}/api/admin/users`, {
-                credentials: 'include'
-            });
-            if (!res.ok) throw new Error('获取用户列表失败');
-            const data = await res.json();
+            const { data } = await api.get('/api/admin/users');
            setUsers(data);
        } catch (err) {
            setError('获取用户列表失败');
@@ -59,15 +52,12 @@ export default function AdminPage() {
    const activateUser = async (userId: string) => {
        setActivatingId(userId);
        try {
-            const res = await fetch(`${API_BASE}/api/admin/users/${userId}/activate`, {
-                method: 'POST',
-                headers: { 'Content-Type': 'application/json' },
-                credentials: 'include',
-                body: JSON.stringify({ expires_days: expireDays || null })
+            await api.post(`/api/admin/users/${userId}/activate`, {
+                expires_days: expireDays || null
            });
-            if (res.ok) {
-                fetchUsers();
-            }
+            fetchUsers();
+        } catch (err) {
+            // axios interceptor handles 401/403
        } finally {
            setActivatingId(null);
        }
@@ -77,10 +67,7 @@ export default function AdminPage() {
        if (!confirm('确定要停用该用户吗？')) return;

        try {
-            await fetch(`${API_BASE}/api/admin/users/${userId}/deactivate`, {
-                method: 'POST',
-                credentials: 'include'
-            });
+            await api.post(`/api/admin/users/${userId}/deactivate`);
            fetchUsers();
        } catch (err) {
            alert('操作失败');
@@ -107,14 +94,14 @@ export default function AdminPage() {

    if (loading) {
        return (
-            <div className="min-h-screen flex items-center justify-center bg-slate-900">
+            <div className="min-h-dvh flex items-center justify-center">
                <div className="animate-spin rounded-full h-12 w-12 border-t-2 border-b-2 border-purple-500"></div>
            </div>
        );
    }

    return (
-        <div className="min-h-screen bg-gradient-to-br from-slate-900 via-purple-900/20 to-slate-900 p-8">
+        <div className="min-h-dvh p-8">
            <div className="max-w-6xl mx-auto">
                <div className="flex justify-between items-center mb-8">
                    <h1 className="text-3xl font-bold text-white">用户管理</h1>
@@ -157,8 +144,8 @@ export default function AdminPage() {
                                <tr key={user.id} className="hover:bg-white/5">
                                    <td className="px-6 py-4">
                                        <div>
-                                            <div className="text-white font-medium">{user.username || user.email.split('@')[0]}</div>
-                                            <div className="text-gray-400 text-sm">{user.email}</div>
+                                            <div className="text-white font-medium">{user.username || `用户${user.phone.slice(-4)}`}</div>
+                                            <div className="text-gray-400 text-sm">{user.phone}</div>
                                        </div>
                                    </td>
                                    <td className="px-6 py-4">
--- a/frontend/src/app/globals.css
+++ b/frontend/src/app/globals.css
@@ -19,23 +19,26 @@
  }
 }

-body {
-  background: var(--background);
-  color: var(--foreground);
-  font-family: Arial, Helvetica, sans-serif;
-}
-
-/* 隐藏滚动条但保留滚动功能 */
+/* iOS Safari 安全区域支持 + 滚动条隐藏 */
 html {
+  background-color: #0f172a !important;
+  min-height: 100%;
  scrollbar-width: none;
-  /* Firefox */
  -ms-overflow-style: none;
-  /* IE 和 Edge */
 }

 html::-webkit-scrollbar {
  display: none;
-  /* Chrome, Safari, Opera */
+}
+
+body {
+  margin: 0 !important;
+  min-height: 100dvh;
+  color: var(--foreground);
+  font-family: Arial, Helvetica, sans-serif;
+  padding-top: env(safe-area-inset-top);
+  padding-bottom: env(safe-area-inset-bottom);
+  background: linear-gradient(to bottom, #0f172a 0%, #0f172a 5%, #581c87 50%, #0f172a 95%, #0f172a 100%);
 }

 /* 自定义滚动条样式 - 深色主题 */
--- a/frontend/src/app/layout.tsx
+++ b/frontend/src/app/layout.tsx
@@ -1,6 +1,9 @@
-import type { Metadata } from "next";
+import type { Metadata, Viewport } from "next";
 import { Geist, Geist_Mono } from "next/font/google";
 import "./globals.css";
+import { AuthProvider } from "@/contexts/AuthContext";
+import { TaskProvider } from "@/contexts/TaskContext";
+import GlobalTaskIndicator from "@/components/GlobalTaskIndicator";

 const geistSans = Geist({
  variable: "--font-geist-sans",
@@ -13,8 +16,15 @@ const geistMono = Geist_Mono({
 });

 export const metadata: Metadata = {
-  title: "ViGent",
-  description: "ViGent Talking Head Agent",
+  title: "IPAgent",
+  description: "IPAgent Talking Head Agent",
+};
+
+export const viewport: Viewport = {
+  width: 'device-width',
+  initialScale: 1,
+  viewportFit: 'cover',
+  themeColor: '#0f172a',
 };

 export default function RootLayout({
@@ -27,7 +37,12 @@ export default function RootLayout({
      <body
        className={`${geistSans.variable} ${geistMono.variable} antialiased`}
      >
-        {children}
+        <AuthProvider>
+          <TaskProvider>
+            <GlobalTaskIndicator />
+            {children}
+          </TaskProvider>
+        </AuthProvider>
      </body>
    </html>
  );
--- a/frontend/src/app/login/page.tsx
+++ b/frontend/src/app/login/page.tsx
@@ -2,11 +2,11 @@

 import { useState } from 'react';
 import { useRouter } from 'next/navigation';
-import { login } from '@/lib/auth';
+import { login } from "@/shared/lib/auth";

 export default function LoginPage() {
    const router = useRouter();
-    const [email, setEmail] = useState('');
+    const [phone, setPhone] = useState('');
    const [password, setPassword] = useState('');
    const [error, setError] = useState('');
    const [loading, setLoading] = useState(false);
@@ -14,10 +14,17 @@ export default function LoginPage() {
    const handleSubmit = async (e: React.FormEvent) => {
        e.preventDefault();
        setError('');
+
+        // 验证手机号格式
+        if (!/^\d{11}$/.test(phone)) {
+            setError('请输入正确的11位手机号');
+            return;
+        }
+
        setLoading(true);

        try {
-            const result = await login(email, password);
+            const result = await login(phone, password);
            if (result.success) {
                router.push('/');
            } else {
@@ -31,25 +38,26 @@ export default function LoginPage() {
    };

    return (
-        <div className="min-h-screen flex items-center justify-center bg-gradient-to-br from-slate-900 via-purple-900 to-slate-900">
+        <div className="min-h-dvh flex items-center justify-center">
            <div className="w-full max-w-md p-8 bg-white/10 backdrop-blur-lg rounded-2xl shadow-2xl border border-white/20">
                <div className="text-center mb-8">
-                    <h1 className="text-3xl font-bold text-white mb-2">ViGent</h1>
+                    <h1 className="text-3xl font-bold text-white mb-2">IPAgent</h1>
                    <p className="text-gray-300">AI 视频生成平台</p>
                </div>

                <form onSubmit={handleSubmit} className="space-y-6">
                    <div>
                        <label className="block text-sm font-medium text-gray-200 mb-2">
-                            邮箱
+                            手机号
                        </label>
                        <input
-                            type="email"
-                            value={email}
-                            onChange={(e) => setEmail(e.target.value)}
+                            type="tel"
+                            value={phone}
+                            onChange={(e) => setPhone(e.target.value.replace(/\D/g, '').slice(0, 11))}
                            required
+                            maxLength={11}
                            className="w-full px-4 py-3 bg-white/5 border border-white/10 rounded-lg text-white placeholder-gray-400 focus:outline-none focus:ring-2 focus:ring-purple-500 focus:border-transparent"
-                            placeholder="your@email.com"
+                            placeholder="请输入11位手机号"
                        />
                    </div>

--- a/frontend/src/app/page.tsx
+++ b/frontend/src/app/page.tsx
@@ -1,630 +1,5 @@
+import { HomePage } from "@/features/home/ui/HomePage";

-"use client";
-
-import { useState, useEffect } from "react";
-import Link from "next/link";
-
-const API_BASE = typeof window === 'undefined'
-  ? 'http://localhost:8006'
-  : '';
-
-// 类型定义
-interface Material {
-  id: string;
-  name: string;
-  scene: string;
-  size_mb: number;
-  path: string;
-}
-
-interface Task {
-  task_id: string;
-  status: string;
-  progress: number;
-  message: string;
-  download_url?: string;
-}
-
-interface GeneratedVideo {
-  id: string;
-  name: string;
-  path: string;
-  size_mb: number;
-  created_at: number;
-}
-
-export default function Home() {
-  const [materials, setMaterials] = useState<Material[]>([]);
-  const [selectedMaterial, setSelectedMaterial] = useState<string>("");
-  const [text, setText] = useState<string>(
-    "大家好，欢迎来到我的频道，今天给大家分享一些有趣的内容。"
-  );
-  const [voice, setVoice] = useState<string>("zh-CN-YunxiNeural");
-  const [isGenerating, setIsGenerating] = useState(false);
-  const [currentTask, setCurrentTask] = useState<Task | null>(null);
-  const [generatedVideo, setGeneratedVideo] = useState<string | null>(null);
-  const [fetchError, setFetchError] = useState<string | null>(null);
-  const [debugData, setDebugData] = useState<string>("");
-  const [isUploading, setIsUploading] = useState(false);
-  const [uploadProgress, setUploadProgress] = useState(0);
-  const [uploadError, setUploadError] = useState<string | null>(null);
-  const [generatedVideos, setGeneratedVideos] = useState<GeneratedVideo[]>([]);
-  const [selectedVideoId, setSelectedVideoId] = useState<string | null>(null);
-
-  // 可选音色
-  const voices = [
-    { id: "zh-CN-YunxiNeural", name: "云溪 (男声-年轻)" },
-    { id: "zh-CN-YunjianNeural", name: "云健 (男声-新闻)" },
-    { id: "zh-CN-YunyangNeural", name: "云扬 (男声-专业)" },
-    { id: "zh-CN-XiaoxiaoNeural", name: "晓晓 (女声-活泼)" },
-    { id: "zh-CN-XiaoyiNeural", name: "晓伊 (女声-温柔)" },
-  ];
-
-  // 加载素材列表和历史视频
-  useEffect(() => {
-    fetchMaterials();
-    fetchGeneratedVideos();
-  }, []);
-
-  const fetchMaterials = async () => {
-    try {
-      setFetchError(null);
-      setDebugData("Loading...");
-
-      // Add timestamp to prevent caching
-      const url = `${API_BASE}/api/materials?t=${new Date().getTime()}`;
-      const res = await fetch(url);
-
-      if (!res.ok) {
-        throw new Error(`HTTP ${res.status} ${res.statusText}`);
-      }
-
-      const text = await res.text(); // Get raw text first
-      setDebugData(text.substring(0, 200) + (text.length > 200 ? "..." : "")); // Show preview
-
-      const data = JSON.parse(text);
-      setMaterials(data.materials || []);
-
-      if (data.materials?.length > 0) {
-        if (!selectedMaterial) {
-          setSelectedMaterial(data.materials[0].id);
-        }
-      }
-    } catch (error) {
-      console.error("获取素材失败:", error);
-      setFetchError(String(error));
-      setDebugData(`Error: ${String(error)}`);
-    }
-  };
-
-  // 获取已生成的视频列表（持久化）
-  const fetchGeneratedVideos = async () => {
-    try {
-      const res = await fetch(`${API_BASE}/api/videos/generated`);
-      if (res.ok) {
-        const data = await res.json();
-        setGeneratedVideos(data.videos || []);
-      }
-    } catch (error) {
-      console.error("获取历史视频失败:", error);
-    }
-  };
-
-  // 删除素材
-  const deleteMaterial = async (materialId: string) => {
-    if (!confirm("确定要删除这个素材吗？")) return;
-    try {
-      const res = await fetch(`${API_BASE}/api/materials/${materialId}`, {
-        method: "DELETE",
-      });
-      if (res.ok) {
-        fetchMaterials();
-        if (selectedMaterial === materialId) {
-          setSelectedMaterial("");
-        }
-      } else {
-        alert("删除失败");
-      }
-    } catch (error) {
-      alert("删除失败: " + error);
-    }
-  };
-
-  // 删除生成的视频
-  const deleteVideo = async (videoId: string) => {
-    if (!confirm("确定要删除这个视频吗？")) return;
-    try {
-      const res = await fetch(`${API_BASE}/api/videos/generated/${videoId}`, {
-        method: "DELETE",
-      });
-      if (res.ok) {
-        fetchGeneratedVideos();
-        if (selectedVideoId === videoId) {
-          setSelectedVideoId(null);
-          setGeneratedVideo(null);
-        }
-      } else {
-        alert("删除失败");
-      }
-    } catch (error) {
-      alert("删除失败: " + error);
-    }
-  };
-
-  // 上传视频
-  const handleUpload = async (e: React.ChangeEvent<HTMLInputElement>) => {
-    const file = e.target.files?.[0];
-    if (!file) return;
-
-    // 验证文件类型
-    const validTypes = ['.mp4', '.mov', '.avi'];
-    const ext = file.name.toLowerCase().slice(file.name.lastIndexOf('.'));
-    if (!validTypes.includes(ext)) {
-      setUploadError('仅支持 MP4、MOV、AVI 格式');
-      return;
-    }
-
-    setIsUploading(true);
-    setUploadProgress(0);
-    setUploadError(null);
-
-    const formData = new FormData();
-    formData.append('file', file);
-
-    // 使用 XMLHttpRequest 以获取上传进度
-    const xhr = new XMLHttpRequest();
-
-    xhr.upload.onprogress = (event) => {
-      if (event.lengthComputable) {
-        const progress = Math.round((event.loaded / event.total) * 100);
-        setUploadProgress(progress);
-      }
-    };
-
-    xhr.onload = () => {
-      setIsUploading(false);
-      if (xhr.status >= 200 && xhr.status < 300) {
-        fetchMaterials(); // 刷新素材列表
-        setUploadProgress(100);
-      } else {
-        setUploadError(`上传失败: ${xhr.statusText}`);
-      }
-    };
-
-    xhr.onerror = () => {
-      setIsUploading(false);
-      setUploadError('网络错误，上传失败');
-    };
-
-    xhr.open('POST', `${API_BASE}/api/materials`);
-    xhr.send(formData);
-
-    // 清空 input 以便可以再次选择同一文件
-    e.target.value = '';
-  };
-
-  // 生成视频
-  const handleGenerate = async () => {
-    if (!selectedMaterial || !text.trim()) {
-      alert("请选择素材并输入文案");
-      return;
-    }
-
-    setIsGenerating(true);
-    setGeneratedVideo(null);
-
-    try {
-      // 查找选中的素材对象以获取路径
-      const materialObj = materials.find(m => m.id === selectedMaterial);
-      if (!materialObj) {
-        alert("素材数据异常");
-        return;
-      }
-
-      // 创建生成任务
-      const res = await fetch(`${API_BASE}/api/videos/generate`, {
-        method: "POST",
-        headers: { "Content-Type": "application/json" },
-        body: JSON.stringify({
-          material_path: materialObj.path,
-          text: text,
-          voice: voice,
-          add_subtitle: true,
-        }),
-      });
-
-      const data = await res.json();
-      const taskId = data.task_id;
-
-      // 轮询任务状态
-      const pollTask = async () => {
-        const taskRes = await fetch(`${API_BASE}/api/videos/tasks/${taskId}`);
-        const taskData: Task = await taskRes.json();
-        setCurrentTask(taskData);
-
-        if (taskData.status === "completed") {
-          setGeneratedVideo(`${API_BASE}${taskData.download_url}`);
-          setIsGenerating(false);
-          fetchGeneratedVideos(); // 刷新历史视频列表
-        } else if (taskData.status === "failed") {
-          alert("视频生成失败: " + taskData.message);
-          setIsGenerating(false);
-        } else {
-          setTimeout(pollTask, 1000);
-        }
-      };
-
-      pollTask();
-    } catch (error) {
-      console.error("生成失败:", error);
-      setIsGenerating(false);
-    }
-  };
-
-  return (
-    <div className="min-h-screen bg-gradient-to-br from-slate-900 via-purple-900 to-slate-900">
-      {/* Header            <header className="border-b border-white/10 bg-black/20 backdrop-blur-sm">
-                <div className="max-w-6xl mx-auto px-6 py-4 flex items-center justify-between">
-                    <h1 className="text-2xl font-bold text-white flex items-center gap-3">
-                        <span className="text-4xl">🎬</span>
-                        ViGent
-                    </h1>
-                    <div className="flex items-center gap-4">
-                        <span className="px-4 py-2 bg-gradient-to-r from-purple-600 to-pink-600 text-white rounded-lg font-semibold">
-                            视频生成
-                        </span>
-                        <Link 
-                            href="/publish" 
-                            className="px-4 py-2 bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors"
-                        >
-                            发布管理
-                        </Link>
-                    </div>
-                </div>
-            </header> */}
-      <header className="border-b border-white/10 bg-black/20 backdrop-blur-sm">
-        <div className="max-w-6xl mx-auto px-6 py-4 flex items-center justify-between">
-          <Link href="/" className="text-2xl font-bold text-white flex items-center gap-3 hover:opacity-80 transition-opacity">
-            <span className="text-4xl">🎬</span>
-            ViGent
-          </Link>
-          <div className="flex items-center gap-4">
-            <span className="px-4 py-2 bg-gradient-to-r from-purple-600 to-pink-600 text-white rounded-lg font-semibold">
-              视频生成
-            </span>
-            <Link
-              href="/publish"
-              className="px-4 py-2 bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors"
-            >
-              发布管理
-            </Link>
-            <button
-              onClick={async () => {
-                if (confirm('确定要退出登录吗？')) {
-                  try {
-                    await fetch(`${API_BASE}/api/auth/logout`, { method: 'POST' });
-                    window.location.href = '/login';
-                  } catch (e) {
-                    window.location.href = '/login';
-                  }
-                }
-              }}
-              className="px-4 py-2 bg-red-500/10 hover:bg-red-500/20 text-red-200 rounded-lg transition-colors"
-            >
-              退出
-            </button>
-          </div>
-        </div>
-      </header>
-
-      <main className="max-w-6xl mx-auto px-6 py-8">
-        <div className="grid grid-cols-1 lg:grid-cols-2 gap-8">
-          {/* 左侧: 输入区域 */}
-          <div className="space-y-6">
-            {/* 素材选择 */}
-            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-              <div className="flex justify-between items-center mb-4">
-                <h2 className="text-lg font-semibold text-white flex items-center gap-2">
-                  📹 选择素材视频
-                </h2>
-                <div className="flex gap-2">
-                  {/* 隐藏的文件输入 */}
-                  <input
-                    type="file"
-                    id="video-upload"
-                    accept=".mp4,.mov,.avi"
-                    onChange={handleUpload}
-                    className="hidden"
-                  />
-                  <label
-                    htmlFor="video-upload"
-                    className={`px-3 py-1 text-xs rounded cursor-pointer transition-all ${isUploading
-                      ? "bg-gray-600 cursor-not-allowed text-gray-400"
-                      : "bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white"
-                      }`}
-                  >
-                    📤 上传视频
-                  </label>
-                  <button
-                    onClick={fetchMaterials}
-                    className="px-3 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300"
-                  >
-                    🔄 刷新
-                  </button>
-                </div>
-              </div>
-
-              {/* 上传进度条 */}
-              {isUploading && (
-                <div className="mb-4 p-4 bg-purple-500/10 rounded-xl border border-purple-500/30">
-                  <div className="flex justify-between text-sm text-purple-300 mb-2">
-                    <span>📤 上传中...</span>
-                    <span>{uploadProgress}%</span>
-                  </div>
-                  <div className="h-2 bg-black/30 rounded-full overflow-hidden">
-                    <div
-                      className="h-full bg-gradient-to-r from-purple-500 to-pink-500 transition-all duration-300"
-                      style={{ width: `${uploadProgress}%` }}
-                    />
-                  </div>
-                </div>
-              )}
-
-              {/* 上传错误提示 */}
-              {uploadError && (
-                <div className="mb-4 p-4 bg-red-500/20 text-red-200 rounded-xl text-sm flex justify-between items-center">
-                  <span>❌ {uploadError}</span>
-                  <button
-                    onClick={() => setUploadError(null)}
-                    className="text-red-300 hover:text-white"
-                  >
-                    ✕
-                  </button>
-                </div>
-              )}
-
-              {fetchError ? (
-                <div className="p-4 bg-red-500/20 text-red-200 rounded-xl text-sm mb-4">
-                  获取素材失败: {fetchError}
-                  <br />
-                  API: {API_BASE}/api/materials/
-                </div>
-              ) : materials.length === 0 ? (
-                <div className="text-center py-8 text-gray-400">
-                  <div className="text-5xl mb-4">📁</div>
-                  <p>暂无素材视频</p>
-                  <p className="text-sm mt-2">
-                    点击上方「📤 上传视频」按钮添加素材
-                  </p>
-                </div>
-              ) : (
-                <div className="grid grid-cols-2 gap-3">
-                  {materials.map((m) => (
-                    <div
-                      key={m.id}
-                      className={`p-4 rounded-xl border-2 transition-all text-left relative group ${selectedMaterial === m.id
-                        ? "border-purple-500 bg-purple-500/20"
-                        : "border-white/10 bg-white/5 hover:border-white/30"
-                        }`}
-                    >
-                      <button
-                        onClick={() => setSelectedMaterial(m.id)}
-                        className="w-full text-left"
-                      >
-                        <div className="text-white font-medium truncate pr-6">
-                          {m.scene || m.name}
-                        </div>
-                        <div className="text-gray-400 text-sm mt-1">
-                          {m.size_mb.toFixed(1)} MB
-                        </div>
-                      </button>
-                      <button
-                        onClick={(e) => {
-                          e.stopPropagation();
-                          deleteMaterial(m.id);
-                        }}
-                        className="absolute top-2 right-2 p-1 text-gray-500 hover:text-red-400 opacity-0 group-hover:opacity-100 transition-opacity"
-                        title="删除素材"
-                      >
-                        🗑️
-                      </button>
-                    </div>
-                  ))}
-                </div>
-              )}
-            </div>
-
-            {/* 文案输入 */}
-            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-              <h2 className="text-lg font-semibold text-white mb-4 flex items-center gap-2">
-                ✍️ 输入口播文案
-              </h2>
-              <textarea
-                value={text}
-                onChange={(e) => setText(e.target.value)}
-                placeholder="请输入你想说的话..."
-                className="w-full h-40 bg-black/30 border border-white/10 rounded-xl p-4 text-white placeholder-gray-500 resize-none focus:outline-none focus:border-purple-500 transition-colors"
-              />
-              <div className="flex justify-between mt-2 text-sm text-gray-400">
-                <span>{text.length} 字</span>
-                <span>预计时长: ~{Math.ceil(text.length / 4)} 秒</span>
-              </div>
-            </div>
-
-            {/* 音色选择 */}
-            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-              <h2 className="text-lg font-semibold text-white mb-4 flex items-center gap-2">
-                🎙️ 选择配音音色
-              </h2>
-              <div className="grid grid-cols-2 gap-3">
-                {voices.map((v) => (
-                  <button
-                    key={v.id}
-                    onClick={() => setVoice(v.id)}
-                    className={`p-3 rounded-xl border-2 transition-all text-left ${voice === v.id
-                      ? "border-purple-500 bg-purple-500/20"
-                      : "border-white/10 bg-white/5 hover:border-white/30"
-                      }`}
-                  >
-                    <span className="text-white text-sm">{v.name}</span>
-                  </button>
-                ))}
-              </div>
-            </div>
-
-            {/* 生成按钮 */}
-            <button
-              onClick={handleGenerate}
-              disabled={isGenerating || !selectedMaterial}
-              className={`w-full py-4 rounded-xl font-bold text-lg transition-all ${isGenerating || !selectedMaterial
-                ? "bg-gray-600 cursor-not-allowed text-gray-400"
-                : "bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white shadow-lg hover:shadow-purple-500/25"
-                }`}
-            >
-              {isGenerating ? (
-                <span className="flex items-center justify-center gap-3">
-                  <svg className="animate-spin h-5 w-5" viewBox="0 0 24 24">
-                    <circle
-                      className="opacity-25"
-                      cx="12"
-                      cy="12"
-                      r="10"
-                      stroke="currentColor"
-                      strokeWidth="4"
-                      fill="none"
-                    />
-                    <path
-                      className="opacity-75"
-                      fill="currentColor"
-                      d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4z"
-                    />
-                  </svg>
-                  生成中... {currentTask?.progress || 0}%
-                </span>
-              ) : (
-                "🚀 生成视频"
-              )}
-            </button>
-          </div>
-
-          {/* 右侧: 预览区域 */}
-          <div className="space-y-6">
-            {/* 进度显示 */}
-            {currentTask && isGenerating && (
-              <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-                <h2 className="text-lg font-semibold text-white mb-4">
-                  ⏳ 生成进度
-                </h2>
-                <div className="space-y-3">
-                  <div className="h-3 bg-black/30 rounded-full overflow-hidden">
-                    <div
-                      className="h-full bg-gradient-to-r from-purple-500 to-pink-500 transition-all duration-300"
-                      style={{ width: `${currentTask.progress}%` }}
-                    />
-                  </div>
-                  <p className="text-gray-300">{currentTask.message}</p>
-                </div>
-              </div>
-            )}
-
-            {/* 视频预览 */}
-            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-              <h2 className="text-lg font-semibold text-white mb-4 flex items-center gap-2">
-                🎥 视频预览
-              </h2>
-              <div className="aspect-video bg-black/50 rounded-xl overflow-hidden flex items-center justify-center">
-                {generatedVideo ? (
-                  <video
-                    src={generatedVideo}
-                    controls
-                    className="w-full h-full object-contain"
-                  />
-                ) : (
-                  <div className="text-gray-500 text-center">
-                    <div className="text-5xl mb-4">📹</div>
-                    <p>生成的视频将在这里预览</p>
-                  </div>
-                )}
-              </div>
-
-              {generatedVideo && (
-                <>
-                  <a
-                    href={generatedVideo}
-                    download
-                    className="mt-4 w-full py-3 rounded-xl bg-green-600 hover:bg-green-700 text-white font-medium flex items-center justify-center gap-2 transition-colors"
-                  >
-                    ⬇️ 下载视频
-                  </a>
-                  <Link
-                    href="/publish"
-                    className="mt-3 w-full py-3 rounded-xl bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white font-medium flex items-center justify-center gap-2 transition-colors"
-                  >
-                    📤 发布到社交平台
-                  </Link>
-                </>
-              )}
-            </div>
-
-            {/* 历史视频列表 */}
-            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-              <div className="flex justify-between items-center mb-4">
-                <h2 className="text-lg font-semibold text-white flex items-center gap-2">
-                  📂 历史视频
-                </h2>
-                <button
-                  onClick={fetchGeneratedVideos}
-                  className="px-3 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300"
-                >
-                  🔄 刷新
-                </button>
-              </div>
-              {generatedVideos.length === 0 ? (
-                <div className="text-center py-4 text-gray-500">
-                  <p>暂无生成的视频</p>
-                </div>
-              ) : (
-                <div className="space-y-2 max-h-64 overflow-y-auto hide-scrollbar">
-                  {generatedVideos.map((v) => (
-                    <div
-                      key={v.id}
-                      className={`p-3 rounded-lg border transition-all flex items-center justify-between group ${selectedVideoId === v.id
-                        ? "border-purple-500 bg-purple-500/20"
-                        : "border-white/10 bg-white/5 hover:border-white/30"
-                        }`}
-                    >
-                      <button
-                        onClick={() => {
-                          setSelectedVideoId(v.id);
-                          setGeneratedVideo(`${API_BASE}${v.path}`);
-                        }}
-                        className="flex-1 text-left"
-                      >
-                        <div className="text-white text-sm truncate">
-                          {new Date(v.created_at * 1000).toLocaleString('zh-CN')}
-                        </div>
-                        <div className="text-gray-400 text-xs">
-                          {v.size_mb.toFixed(1)} MB
-                        </div>
-                      </button>
-                      <button
-                        onClick={(e) => {
-                          e.stopPropagation();
-                          deleteVideo(v.id);
-                        }}
-                        className="p-1 text-gray-500 hover:text-red-400 opacity-0 group-hover:opacity-100 transition-opacity"
-                        title="删除视频"
-                      >
-                        🗑️
-                      </button>
-                    </div>
-                  ))}
-                </div>
-              )}
-            </div>
-          </div>
-        </div>
-      </main>
-    </div>
-  );
+export default function Page() {
+  return <HomePage />;
 }
--- a/frontend/src/app/publish/page.tsx
+++ b/frontend/src/app/publish/page.tsx
@@ -1,519 +1,5 @@
-"use client";
-
-import { useState, useEffect } from "react";
-import useSWR from 'swr';
-
-const fetcher = (url: string) => fetch(url).then((res) => res.json());
-import Link from "next/link";
-
-// 动态获取 API 地址：服务端使用 localhost，客户端使用当前域名
-const API_BASE = typeof window === 'undefined'
-    ? 'http://localhost:8006'
-    : '';
-
-interface Account {
-    platform: string;
-    name: string;
-    logged_in: boolean;
-    enabled: boolean;
-}
-
-interface Video {
-    name: string;
-    path: string;
-}
-
-export default function PublishPage() {
-    const [accounts, setAccounts] = useState<Account[]>([]);
-    const [videos, setVideos] = useState<Video[]>([]);
-    const [selectedVideo, setSelectedVideo] = useState<string>("");
-    const [selectedPlatforms, setSelectedPlatforms] = useState<string[]>([]);
-    const [title, setTitle] = useState<string>("");
-    const [tags, setTags] = useState<string>("");
-    const [isPublishing, setIsPublishing] = useState(false);
-    const [publishResults, setPublishResults] = useState<any[]>([]);
-    const [scheduleMode, setScheduleMode] = useState<"now" | "scheduled">("now");
-    const [publishTime, setPublishTime] = useState<string>("");
-    const [qrCodeImage, setQrCodeImage] = useState<string | null>(null);
-    const [qrPlatform, setQrPlatform] = useState<string | null>(null);
-    const [isLoadingQR, setIsLoadingQR] = useState(false);
-
-    // 加载账号和视频列表
-    useEffect(() => {
-        fetchAccounts();
-        fetchVideos();
-    }, []);
-
-    const fetchAccounts = async () => {
-        try {
-            const res = await fetch(`${API_BASE}/api/publish/accounts`);
-            const data = await res.json();
-            setAccounts(data.accounts || []);
-        } catch (error) {
-            console.error("获取账号失败:", error);
-        }
-    };
-
-    const fetchVideos = async () => {
-        try {
-            // 使用持久化的视频列表 API（从文件系统读取）
-            const res = await fetch(`${API_BASE}/api/videos/generated`);
-            const data = await res.json();
-
-            const videos = (data.videos || []).map((v: any) => ({
-                name: new Date(v.created_at * 1000).toLocaleString('zh-CN') + ` (${v.size_mb.toFixed(1)}MB)`,
-                path: v.path.startsWith('/') ? v.path.slice(1) : v.path,  // 移除开头的 /
-            }));
-
-            setVideos(videos);
-            if (videos.length > 0) {
-                setSelectedVideo(videos[0].path);
-            }
-        } catch (error) {
-            console.error("获取视频失败:", error);
-        }
-    };
-
-    const togglePlatform = (platform: string) => {
-        if (selectedPlatforms.includes(platform)) {
-            setSelectedPlatforms(selectedPlatforms.filter((p) => p !== platform));
-        } else {
-            setSelectedPlatforms([...selectedPlatforms, platform]);
-        }
-    };
-
-    const handlePublish = async () => {
-        if (!selectedVideo || !title || selectedPlatforms.length === 0) {
-            alert("请选择视频、填写标题并选择至少一个平台");
-            return;
-        }
-
-        setIsPublishing(true);
-        setPublishResults([]);
-
-        const tagList = tags.split(/[,，\s]+/).filter((t) => t.trim());
-
-        for (const platform of selectedPlatforms) {
-            try {
-                const res = await fetch(`${API_BASE}/api/publish`, {
-                    method: "POST",
-                    headers: { "Content-Type": "application/json" },
-                    body: JSON.stringify({
-                        video_path: selectedVideo,
-                        platform,
-                        title,
-                        tags: tagList,
-                        description: "",
-                        publish_time: scheduleMode === "scheduled" && publishTime
-                            ? new Date(publishTime).toISOString()
-                            : null
-                    }),
-                });
-
-                const result = await res.json();
-                setPublishResults((prev) => [...prev, result]);
-                // 发布成功后10秒自动清除结果
-                if (result.success) {
-                    setTimeout(() => {
-                        setPublishResults((prev) => prev.filter((r) => r !== result));
-                    }, 10000);
-                }
-            } catch (error) {
-                setPublishResults((prev) => [
-                    ...prev,
-                    { platform, success: false, message: String(error) },
-                ]);
-            }
-        }
-
-        setIsPublishing(false);
-    };
-
-    // SWR Polling for Login Status
-    const { data: loginStatus } = useSWR(
-        qrPlatform ? `${API_BASE}/api/publish/login/status/${qrPlatform}` : null,
-        fetcher,
-        {
-            refreshInterval: 2000,
-            onSuccess: (data) => {
-                if (data.success) {
-                    setQrCodeImage(null);
-                    setQrPlatform(null);
-                    alert('✅ 登录成功！');
-                    fetchAccounts();
-                }
-            }
-        }
-    );
-
-    // Timeout logic for QR code (business logic: stop after 2 mins)
-    useEffect(() => {
-        let timer: NodeJS.Timeout;
-        if (qrPlatform) {
-            timer = setTimeout(() => {
-                if (qrPlatform) { // Double check active
-                    setQrPlatform(null);
-                    setQrCodeImage(null);
-                    alert('登录超时，请重试');
-                }
-            }, 120000);
-        }
-        return () => clearTimeout(timer);
-    }, [qrPlatform]);
-
-    const handleLogin = async (platform: string) => {
-        setIsLoadingQR(true);
-        setQrPlatform(platform);  // 立即显示加载弹窗
-        setQrCodeImage(null);     // 清空旧二维码
-        try {
-            const res = await fetch(`${API_BASE}/api/publish/login/${platform}`, {
-                method: 'POST'
-            });
-            const result = await res.json();
-
-            if (result.success && result.qr_code) {
-                setQrCodeImage(result.qr_code);
-                // SWR hook will automatically start polling since qrPlatform is set
-            } else {
-                setQrPlatform(null);  // 失败时关闭弹窗
-                alert(result.message || '登录失败');
-            }
-        } catch (error) {
-            setQrPlatform(null);  // 失败时关闭弹窗
-            alert(`登录失败: ${error}`);
-        } finally {
-            setIsLoadingQR(false);
-        }
-    };
-
-    const handleLogout = async (platform: string) => {
-        if (!confirm('确定要注销登录吗？')) return;
-        try {
-            const res = await fetch(`${API_BASE}/api/publish/logout/${platform}`, {
-                method: 'POST'
-            });
-            const result = await res.json();
-            if (result.success) {
-                alert('已注销');
-                fetchAccounts();
-            } else {
-                alert(result.message || '注销失败');
-            }
-        } catch (error) {
-            alert(`注销失败: ${error}`);
-        }
-    };
-
-    const platformIcons: Record<string, string> = {
-        douyin: "🎵",
-        xiaohongshu: "📕",
-        weixin: "💬",
-        kuaishou: "⚡",
-        bilibili: "📺",
-    };
-
-    return (
-        <div className="min-h-screen bg-gradient-to-br from-gray-900 via-purple-900 to-gray-900">
-            {/* QR码弹窗 */}
-            {qrPlatform && (
-                <div className="fixed inset-0 bg-black/80 flex items-center justify-center z-50">
-                    <div className="bg-white rounded-2xl p-8 max-w-md min-w-[320px]">
-                        <h2 className="text-2xl font-bold mb-4 text-center">🔐 扫码登录 {qrPlatform}</h2>
-                        {isLoadingQR ? (
-                            <div className="flex flex-col items-center py-8">
-                                <div className="animate-spin w-16 h-16 border-4 border-purple-500 border-t-transparent rounded-full" />
-                                <p className="text-gray-600 mt-4">正在获取二维码...</p>
-                            </div>
-                        ) : qrCodeImage ? (
-                            <>
-                                <img
-                                    src={`data:image/png;base64,${qrCodeImage}`}
-                                    alt="QR Code"
-                                    className="w-full h-auto"
-                                />
-                                <p className="text-center text-gray-600 mt-4">
-                                    请使用手机扫码登录
-                                </p>
-                            </>
-                        ) : null}
-                        <button
-                            onClick={() => { setQrCodeImage(null); setQrPlatform(null); }}
-                            className="w-full mt-4 px-4 py-2 bg-gray-200 rounded-lg hover:bg-gray-300"
-                        >
-                            取消
-                        </button>
-                    </div>
-                </div>
-            )}
-
-            {/* Header - 统一样式 */}
-            <header className="border-b border-white/10 bg-black/20 backdrop-blur-sm">
-                <div className="max-w-6xl mx-auto px-6 py-4 flex items-center justify-between">
-                    <Link href="/" className="text-2xl font-bold text-white flex items-center gap-3 hover:opacity-80 transition-opacity">
-                        <span className="text-4xl">🎬</span>
-                        ViGent
-                    </Link>
-                    <div className="flex items-center gap-4">
-                        <Link
-                            href="/"
-                            className="px-4 py-2 bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors"
-                        >
-                            返回创作
-                        </Link>
-                        <span className="px-4 py-2 bg-gradient-to-r from-purple-600 to-pink-600 text-white rounded-lg font-semibold">
-                            发布管理
-                        </span>
-                        <button
-                            onClick={async () => {
-                                if (confirm('确定要退出登录吗？')) {
-                                    try {
-                                        await fetch(`${API_BASE}/api/auth/logout`, { method: 'POST' });
-                                        window.location.href = '/login';
-                                    } catch (e) {
-                                        window.location.href = '/login';
-                                    }
-                                }
-                            }}
-                            className="px-4 py-2 bg-red-500/10 hover:bg-red-500/20 text-red-200 rounded-lg transition-colors"
-                        >
-                            退出
-                        </button>
-                    </div>
-                </div>
-            </header>
-
-            <main className="max-w-6xl mx-auto px-6 py-8">
-                <div className="grid grid-cols-1 lg:grid-cols-2 gap-8">
-                    {/* 左侧: 账号管理 */}
-                    <div className="space-y-6">
-                        <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-                            <h2 className="text-lg font-semibold text-white mb-4 flex items-center gap-2">
-                                👤 平台账号
-                            </h2>
-
-                            <div className="space-y-3">
-                                {accounts.map((account) => (
-                                    <div
-                                        key={account.platform}
-                                        className="flex items-center justify-between p-4 bg-black/30 rounded-xl"
-                                    >
-                                        <div className="flex items-center gap-3">
-                                            <span className="text-2xl">
-                                                {platformIcons[account.platform]}
-                                            </span>
-                                            <div>
-                                                <div className="text-white font-medium">
-                                                    {account.name}
-                                                </div>
-                                                <div
-                                                    className={`text-sm ${account.logged_in
-                                                        ? "text-green-400"
-                                                        : "text-gray-500"
-                                                        }`}
-                                                >
-                                                    {account.logged_in ? "✓ 已登录" : "未登录"}
-                                                </div>
-                                            </div>
-                                        </div>
-                                        <div className="flex gap-2">
-                                            {account.logged_in ? (
-                                                <>
-                                                    <button
-                                                        onClick={() => handleLogin(account.platform)}
-                                                        className="px-3 py-1 bg-white/10 hover:bg-white/20 text-white text-sm rounded-lg transition-colors"
-                                                    >
-                                                        ↻ 重新登录
-                                                    </button>
-                                                    <button
-                                                        onClick={() => handleLogout(account.platform)}
-                                                        className="px-3 py-1 bg-red-500/80 hover:bg-red-600 text-white text-sm rounded-lg transition-colors"
-                                                    >
-                                                        注销
-                                                    </button>
-                                                </>
-                                            ) : (
-                                                <button
-                                                    onClick={() => handleLogin(account.platform)}
-                                                    className="px-3 py-1 bg-purple-600 hover:bg-purple-700 text-white text-sm rounded-lg transition-colors"
-                                                >
-                                                    🔐 扫码登录
-                                                </button>
-                                            )}
-                                        </div>
-                                    </div>
-                                ))}
-                            </div>
-                        </div>
-                    </div>
-
-                    {/* 右侧: 发布表单 */}
-                    <div className="space-y-6">
-                        {/* 选择视频 */}
-                        <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-                            <h2 className="text-lg font-semibold text-white mb-4">
-                                🎥 选择要发布的视频
-                            </h2>
-
-                            {videos.length === 0 ? (
-                                <p className="text-gray-400">
-                                    暂无已生成的视频，请先
-                                    <Link href="/" className="text-purple-400 hover:underline">
-                                        生成视频
-                                    </Link>
-                                </p>
-                            ) : (
-                                <select
-                                    value={selectedVideo}
-                                    onChange={(e) => setSelectedVideo(e.target.value)}
-                                    className="w-full p-3 bg-black/30 border border-white/10 rounded-xl text-white custom-select cursor-pointer hover:border-purple-500/50 transition-colors"
-                                >
-                                    {videos.map((v) => (
-                                        <option key={v.path} value={v.path}>
-                                            {v.name}
-                                        </option>
-                                    ))}
-                                </select>
-                            )}
-                        </div>
-
-                        {/* 填写信息 */}
-                        <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-                            <h2 className="text-lg font-semibold text-white mb-4">✍️ 发布信息</h2>
-
-                            <div className="space-y-4">
-                                <div>
-                                    <label className="block text-gray-400 text-sm mb-2">
-                                        标题
-                                    </label>
-                                    <input
-                                        type="text"
-                                        value={title}
-                                        onChange={(e) => setTitle(e.target.value)}
-                                        placeholder="输入视频标题..."
-                                        className="w-full p-3 bg-black/30 border border-white/10 rounded-xl text-white placeholder-gray-500"
-                                    />
-                                </div>
-                                <div>
-                                    <label className="block text-gray-400 text-sm mb-2">
-                                        标签 (用逗号分隔)
-                                    </label>
-                                    <input
-                                        type="text"
-                                        value={tags}
-                                        onChange={(e) => setTags(e.target.value)}
-                                        placeholder="AI, 数字人, 口播..."
-                                        className="w-full p-3 bg-black/30 border border-white/10 rounded-xl text-white placeholder-gray-500"
-                                    />
-                                </div>
-                                <div>
-                                    <label className="block text-gray-400 text-sm mb-2">
-                                        发布时间
-                                    </label>
-                                    <div className="flex gap-3 mb-3">
-                                        <button
-                                            onClick={() => setScheduleMode("now")}
-                                            className={`flex-1 px-4 py-2 rounded-lg font-medium transition-colors ${scheduleMode === "now"
-                                                ? "bg-purple-600 text-white"
-                                                : "bg-black/30 text-gray-400 hover:bg-black/50"
-                                                }`}
-                                        >
-                                            ⚡ 立即发布
-                                        </button>
-                                        <button
-                                            onClick={() => setScheduleMode("scheduled")}
-                                            className={`flex-1 px-4 py-2 rounded-lg font-medium transition-colors ${scheduleMode === "scheduled"
-                                                ? "bg-purple-600 text-white"
-                                                : "bg-black/30 text-gray-400 hover:bg-black/50"
-                                                }`}
-                                        >
-                                            ⏰ 定时发布
-                                        </button>
-                                    </div>
-                                    {scheduleMode === "scheduled" && (
-                                        <input
-                                            type="datetime-local"
-                                            value={publishTime}
-                                            onChange={(e) => setPublishTime(e.target.value)}
-                                            min={new Date().toISOString().slice(0, 16)}
-                                            className="w-full p-3 bg-black/30 border border-white/10 rounded-xl text-white"
-                                        />
-                                    )}
-                                </div>
-                            </div>
-                        </div>
-
-                        {/* 选择平台 */}
-                        <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
-                            <h2 className="text-lg font-semibold text-white mb-4">📱 选择发布平台</h2>
-
-                            <div className="grid grid-cols-3 gap-3">
-                                {accounts
-                                    .filter((a) => a.logged_in)
-                                    .map((account) => (
-                                        <button
-                                            key={account.platform}
-                                            onClick={() => togglePlatform(account.platform)}
-                                            className={`p-3 rounded-xl border-2 transition-all ${selectedPlatforms.includes(account.platform)
-                                                ? "border-purple-500 bg-purple-500/20"
-                                                : "border-white/10 bg-white/5 hover:border-white/30"
-                                                }`}
-                                        >
-                                            <span className="text-2xl block mb-1">
-                                                {platformIcons[account.platform]}
-                                            </span>
-                                            <span className="text-white text-sm">{account.name}</span>
-                                        </button>
-                                    ))}
-                            </div>
-
-                            {accounts.filter((a) => a.logged_in).length === 0 && (
-                                <p className="text-gray-400 text-center py-4">
-                                    请先登录至少一个平台账号
-                                </p>
-                            )}
-                        </div>
-
-                        {/* 发布按钮 */}
-                        <button
-                            onClick={handlePublish}
-                            disabled={isPublishing || selectedPlatforms.length === 0}
-                            className={`w-full py-4 rounded-xl font-bold text-lg transition-all ${isPublishing || selectedPlatforms.length === 0
-                                ? "bg-gray-600 cursor-not-allowed text-gray-400"
-                                : "bg-gradient-to-r from-green-600 to-teal-600 hover:from-green-700 hover:to-teal-700 text-white"
-                                }`}
-                        >
-                            {isPublishing ? "发布中..." : "🚀 一键发布"}
-                        </button>
-
-                        {/* 发布结果 */}
-                        {publishResults.length > 0 && (
-                            <div className="bg-white/5 rounded-2xl p-6 border border-white/10">
-                                <h2 className="text-lg font-semibold text-white mb-4">
-                                    发布结果
-                                </h2>
-                                <div className="space-y-2">
-                                    {publishResults.map((result, i) => (
-                                        <div
-                                            key={i}
-                                            className={`p-3 rounded-lg ${result.success ? "bg-green-500/20" : "bg-red-500/20"
-                                                }`}
-                                        >
-                                            <span className="text-white">
-                                                {platformIcons[result.platform]} {result.message}
-                                            </span>
-                                            {result.success && (
-                                                <p className="text-green-400/80 text-sm mt-1">
-                                                    ⏳ 审核一般需要几分钟，请耐心等待
-                                                </p>
-                                            )}
-                                        </div>
-                                    ))}
-                                </div>
-                            </div>
-                        )}
-                    </div>
-                </div>
-            </main>
-        </div>
-    );
-}
+import { PublishPage } from "@/features/publish/ui/PublishPage";
+
+export default function Page() {
+  return <PublishPage />;
+}
--- a/frontend/src/app/register/page.tsx
+++ b/frontend/src/app/register/page.tsx
@@ -2,11 +2,11 @@

 import { useState } from 'react';
 import { useRouter } from 'next/navigation';
-import { register } from '@/lib/auth';
+import { register } from "@/shared/lib/auth";

 export default function RegisterPage() {
    const router = useRouter();
-    const [email, setEmail] = useState('');
+    const [phone, setPhone] = useState('');
    const [password, setPassword] = useState('');
    const [confirmPassword, setConfirmPassword] = useState('');
    const [username, setUsername] = useState('');
@@ -18,6 +18,12 @@ export default function RegisterPage() {
        e.preventDefault();
        setError('');

+        // 验证手机号格式
+        if (!/^\d{11}$/.test(phone)) {
+            setError('请输入正确的11位手机号');
+            return;
+        }
+
        if (password !== confirmPassword) {
            setError('两次输入的密码不一致');
            return;
@@ -31,7 +37,7 @@ export default function RegisterPage() {
        setLoading(true);

        try {
-            const result = await register(email, password, username || undefined);
+            const result = await register(phone, password, username || undefined);
            if (result.success) {
                setSuccess(true);
            } else {
@@ -46,7 +52,7 @@ export default function RegisterPage() {

    if (success) {
        return (
-            <div className="min-h-screen flex items-center justify-center bg-gradient-to-br from-slate-900 via-purple-900 to-slate-900">
+            <div className="min-h-dvh flex items-center justify-center">
                <div className="w-full max-w-md p-8 bg-white/10 backdrop-blur-lg rounded-2xl shadow-2xl border border-white/20 text-center">
                    <div className="mb-6">
                        <svg className="w-16 h-16 mx-auto text-green-400" fill="none" stroke="currentColor" viewBox="0 0 24 24">
@@ -69,26 +75,28 @@ export default function RegisterPage() {
    }

    return (
-        <div className="min-h-screen flex items-center justify-center bg-gradient-to-br from-slate-900 via-purple-900 to-slate-900">
+        <div className="min-h-dvh flex items-center justify-center">
            <div className="w-full max-w-md p-8 bg-white/10 backdrop-blur-lg rounded-2xl shadow-2xl border border-white/20">
                <div className="text-center mb-8">
                    <h1 className="text-3xl font-bold text-white mb-2">注册账号</h1>
-                    <p className="text-gray-300">创建您的 ViGent 账号</p>
+                    <p className="text-gray-300">创建您的 IPAgent 账号</p>
                </div>

                <form onSubmit={handleSubmit} className="space-y-5">
                    <div>
                        <label className="block text-sm font-medium text-gray-200 mb-2">
-                            邮箱 <span className="text-red-400">*</span>
+                            手机号 <span className="text-red-400">*</span>
                        </label>
                        <input
-                            type="email"
-                            value={email}
-                            onChange={(e) => setEmail(e.target.value)}
+                            type="tel"
+                            value={phone}
+                            onChange={(e) => setPhone(e.target.value.replace(/\D/g, '').slice(0, 11))}
                            required
+                            maxLength={11}
                            className="w-full px-4 py-3 bg-white/5 border border-white/10 rounded-lg text-white placeholder-gray-400 focus:outline-none focus:ring-2 focus:ring-purple-500"
-                            placeholder="your@email.com"
+                            placeholder="请输入11位手机号"
                        />
+                        <p className="mt-1 text-xs text-gray-500">必须是11位数字</p>
                    </div>

                    <div>
--- a/frontend/src/components/AccountSettingsDropdown.tsx
+++ b/frontend/src/components/AccountSettingsDropdown.tsx
@@ -0,0 +1,211 @@
+"use client";
+
+import { useState, useEffect, useRef } from "react";
+import { useAuth } from "@/contexts/AuthContext";
+import api from "@/shared/api/axios";
+
+// 账户设置下拉菜单组件
+export default function AccountSettingsDropdown() {
+    const { user } = useAuth();
+    const [isOpen, setIsOpen] = useState(false);
+    const [showPasswordModal, setShowPasswordModal] = useState(false);
+    const [oldPassword, setOldPassword] = useState('');
+    const [newPassword, setNewPassword] = useState('');
+    const [confirmPassword, setConfirmPassword] = useState('');
+    const [error, setError] = useState('');
+    const [success, setSuccess] = useState('');
+    const [loading, setLoading] = useState(false);
+    const dropdownRef = useRef<HTMLDivElement>(null);
+
+    // 点击外部关闭菜单
+    useEffect(() => {
+        const handleClickOutside = (event: MouseEvent) => {
+            if (dropdownRef.current && !dropdownRef.current.contains(event.target as Node)) {
+                setIsOpen(false);
+            }
+        };
+        if (isOpen) {
+            document.addEventListener('mousedown', handleClickOutside);
+        }
+        return () => {
+            document.removeEventListener('mousedown', handleClickOutside);
+        };
+    }, [isOpen]);
+
+    // 格式化有效期
+    const formatExpiry = (expiresAt: string | null) => {
+        if (!expiresAt) return '永久有效';
+        const date = new Date(expiresAt);
+        return `${date.getFullYear()}-${String(date.getMonth() + 1).padStart(2, '0')}-${String(date.getDate()).padStart(2, '0')}`;
+    };
+
+    const handleLogout = async () => {
+        if (confirm('确定要退出登录吗？')) {
+            try {
+                await api.post('/api/auth/logout');
+            } catch (e) { }
+            window.location.href = '/login';
+        }
+    };
+
+    const handleChangePassword = async (e: React.FormEvent) => {
+        e.preventDefault();
+        setError('');
+        setSuccess('');
+
+        if (newPassword !== confirmPassword) {
+            setError('两次输入的新密码不一致');
+            return;
+        }
+
+        if (newPassword.length < 6) {
+            setError('新密码长度至少6位');
+            return;
+        }
+
+        setLoading(true);
+        try {
+            const res = await api.post('/api/auth/change-password', {
+                old_password: oldPassword,
+                new_password: newPassword
+            });
+            if (res.data.success) {
+                setSuccess('密码修改成功，正在跳转登录页...');
+                // 清除登录状态并跳转
+                setTimeout(async () => {
+                    try {
+                        await api.post('/api/auth/logout');
+                    } catch (e) { }
+                    window.location.href = '/login';
+                }, 1500);
+            } else {
+                setError(res.data.message || '修改失败');
+            }
+        } catch (err: any) {
+            setError(err.response?.data?.detail || '修改失败，请重试');
+        } finally {
+            setLoading(false);
+        }
+    };
+
+    return (
+        <div className="relative" ref={dropdownRef}>
+            <button
+                onClick={() => setIsOpen(!isOpen)}
+                className="px-2 sm:px-4 py-1 sm:py-2 text-sm sm:text-base bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors flex items-center gap-1"
+            >
+                <span>⚙️</span>
+                <span className="hidden sm:inline">账户</span>
+                <svg className={`w-4 h-4 transition-transform ${isOpen ? 'rotate-180' : ''}`} fill="none" stroke="currentColor" viewBox="0 0 24 24">
+                    <path strokeLinecap="round" strokeLinejoin="round" strokeWidth={2} d="M19 9l-7 7-7-7" />
+                </svg>
+            </button>
+
+            {/* 下拉菜单 */}
+            {isOpen && (
+                <div className="absolute right-0 mt-2 bg-gray-800 border border-white/10 rounded-lg shadow-xl z-[160] overflow-hidden whitespace-nowrap">
+                    {/* 有效期显示 */}
+                    <div className="px-3 py-2 border-b border-white/10 text-center">
+                        <div className="text-xs text-gray-400">账户有效期</div>
+                        <div className="text-sm text-white font-medium">
+                            {user?.expires_at ? formatExpiry(user.expires_at) : '永久有效'}
+                        </div>
+                    </div>
+                    <button
+                        onClick={() => {
+                            setIsOpen(false);
+                            setShowPasswordModal(true);
+                        }}
+                        className="w-full px-3 py-2 text-left text-sm text-white hover:bg-white/10 flex items-center gap-2"
+                    >
+                        🔐 修改密码
+                    </button>
+                    <button
+                        onClick={handleLogout}
+                        className="w-full px-3 py-2 text-left text-sm text-red-300 hover:bg-red-500/20 flex items-center gap-2"
+                    >
+                        🚪 退出登录
+                    </button>
+                </div>
+            )}
+
+            {/* 修改密码弹窗 */}
+            {showPasswordModal && (
+                <div className="fixed inset-0 z-[200] flex items-start justify-center pt-20 bg-black/60 backdrop-blur-sm p-4">
+                    <div className="w-full max-w-md p-6 bg-gray-900 border border-white/10 rounded-2xl shadow-2xl mx-4">
+                        <h3 className="text-xl font-bold text-white mb-4">修改密码</h3>
+                        <form onSubmit={handleChangePassword} className="space-y-4">
+                            <div>
+                                <label className="block text-sm text-gray-300 mb-1">当前密码</label>
+                                <input
+                                    type="password"
+                                    value={oldPassword}
+                                    onChange={(e) => setOldPassword(e.target.value)}
+                                    required
+                                    className="w-full px-3 py-2 bg-white/5 border border-white/10 rounded-lg text-white placeholder-gray-500 focus:outline-none focus:ring-2 focus:ring-purple-500"
+                                    placeholder="输入当前密码"
+                                />
+                            </div>
+                            <div>
+                                <label className="block text-sm text-gray-300 mb-1">新密码</label>
+                                <input
+                                    type="password"
+                                    value={newPassword}
+                                    onChange={(e) => setNewPassword(e.target.value)}
+                                    required
+                                    className="w-full px-3 py-2 bg-white/5 border border-white/10 rounded-lg text-white placeholder-gray-500 focus:outline-none focus:ring-2 focus:ring-purple-500"
+                                    placeholder="至少6位"
+                                />
+                            </div>
+                            <div>
+                                <label className="block text-sm text-gray-300 mb-1">确认新密码</label>
+                                <input
+                                    type="password"
+                                    value={confirmPassword}
+                                    onChange={(e) => setConfirmPassword(e.target.value)}
+                                    required
+                                    className="w-full px-3 py-2 bg-white/5 border border-white/10 rounded-lg text-white placeholder-gray-500 focus:outline-none focus:ring-2 focus:ring-purple-500"
+                                    placeholder="再次输入新密码"
+                                />
+                            </div>
+
+                            {error && (
+                                <div className="p-2 bg-red-500/20 border border-red-500/50 rounded text-red-200 text-sm">
+                                    {error}
+                                </div>
+                            )}
+                            {success && (
+                                <div className="p-2 bg-green-500/20 border border-green-500/50 rounded text-green-200 text-sm">
+                                    {success}
+                                </div>
+                            )}
+
+                            <div className="flex gap-3 pt-2">
+                                <button
+                                    type="button"
+                                    onClick={() => {
+                                        setShowPasswordModal(false);
+                                        setError('');
+                                        setOldPassword('');
+                                        setNewPassword('');
+                                        setConfirmPassword('');
+                                    }}
+                                    className="flex-1 py-2 bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors"
+                                >
+                                    取消
+                                </button>
+                                <button
+                                    type="submit"
+                                    disabled={loading}
+                                    className="flex-1 py-2 bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white rounded-lg transition-colors disabled:opacity-50"
+                                >
+                                    {loading ? '修改中...' : '确认修改'}
+                                </button>
+                            </div>
+                        </form>
+                    </div>
+                </div>
+            )}
+        </div>
+    );
+}
--- a/frontend/src/components/GlobalTaskIndicator.tsx
+++ b/frontend/src/components/GlobalTaskIndicator.tsx
@@ -0,0 +1,42 @@
+"use client";
+
+import { useTask } from "@/contexts/TaskContext";
+import Link from "next/link";
+
+export default function GlobalTaskIndicator() {
+  const { currentTask, isGenerating } = useTask();
+
+  if (!isGenerating) return null;
+
+  return (
+    <div className="fixed top-0 left-0 right-0 z-50 bg-gradient-to-r from-purple-600 to-pink-600 text-white shadow-lg">
+      <div className="max-w-6xl mx-auto px-6 py-3">
+        <div className="flex items-center justify-between">
+          <div className="flex items-center gap-3">
+            <div className="animate-spin rounded-full h-5 w-5 border-2 border-white border-t-transparent"></div>
+            <span className="font-medium">
+              视频生成中... {currentTask?.progress || 0}%
+            </span>
+            {currentTask?.message && (
+              <span className="text-white/80 text-sm">
+                {currentTask.message}
+              </span>
+            )}
+          </div>
+          <Link
+            href="/"
+            className="px-3 py-1 bg-white/20 hover:bg-white/30 rounded transition-colors text-sm"
+          >
+            查看详情
+          </Link>
+        </div>
+        <div className="mt-2 w-full bg-white/20 rounded-full h-1.5 overflow-hidden">
+          <div
+            className="bg-white h-full transition-all duration-300 ease-out"
+            style={{ width: `${currentTask?.progress || 0}%` }}
+          ></div>
+        </div>
+      </div>
+    </div>
+  );
+}
--- a/frontend/src/components/ScriptExtractionModal.tsx
+++ b/frontend/src/components/ScriptExtractionModal.tsx
@@ -0,0 +1,424 @@
+"use client";
+
+import { useState, useEffect } from "react";
+import api from "@/shared/api/axios";
+
+interface ScriptExtractionModalProps {
+    isOpen: boolean;
+    onClose: () => void;
+    onApply?: (text: string) => void;
+}
+
+export default function ScriptExtractionModal({
+    isOpen,
+    onClose,
+    onApply
+}: ScriptExtractionModalProps) {
+    const [isLoading, setIsLoading] = useState(false);
+    const [script, setScript] = useState("");
+    const [rewrittenScript, setRewrittenScript] = useState("");
+    const [error, setError] = useState<string | null>(null);
+    const [doRewrite, setDoRewrite] = useState(true);
+    const [step, setStep] = useState<'config' | 'processing' | 'result'>('config');
+    const [dragActive, setDragActive] = useState(false);
+    const [selectedFile, setSelectedFile] = useState<File | null>(null);
+
+    // New state for URL mode
+    const [activeTab, setActiveTab] = useState<'file' | 'url'>('url');
+    const [inputUrl, setInputUrl] = useState("");
+
+    // Reset state when modal opens
+    useEffect(() => {
+        if (isOpen) {
+            setStep('config');
+            setScript("");
+            setRewrittenScript("");
+            setError(null);
+            setIsLoading(false);
+            setSelectedFile(null);
+            setInputUrl("");
+            setActiveTab('url');
+        }
+    }, [isOpen]);
+
+    const handleDrag = (e: React.DragEvent) => {
+        e.preventDefault();
+        e.stopPropagation();
+        if (e.type === "dragenter" || e.type === "dragover") {
+            setDragActive(true);
+        } else if (e.type === "dragleave") {
+            setDragActive(false);
+        }
+    };
+
+    const handleDrop = (e: React.DragEvent) => {
+        e.preventDefault();
+        e.stopPropagation();
+        setDragActive(false);
+        if (e.dataTransfer.files && e.dataTransfer.files[0]) {
+            handleFile(e.dataTransfer.files[0]);
+        }
+    };
+
+    const handleFileChange = (e: React.ChangeEvent<HTMLInputElement>) => {
+        if (e.target.files && e.target.files[0]) {
+            handleFile(e.target.files[0]);
+        }
+    };
+
+    const handleFile = (file: File) => {
+        const validTypes = ['.mp4', '.mov', '.avi', '.mp3', '.wav', '.m4a'];
+        const ext = file.name.toLowerCase().slice(file.name.lastIndexOf('.'));
+        if (!validTypes.includes(ext)) {
+            setError(`不支持的文件格式 ${ext}，请上传视频或音频文件`);
+            return;
+        }
+        setSelectedFile(file);
+        setError(null);
+    };
+
+    const handleExtract = async () => {
+        if (activeTab === 'file' && !selectedFile) {
+            setError("请先上传文件");
+            return;
+        }
+        if (activeTab === 'url' && !inputUrl.trim()) {
+            setError("请先输入视频链接");
+            return;
+        }
+
+        setIsLoading(true);
+        setStep('processing');
+        setError(null);
+
+        try {
+            const formData = new FormData();
+            if (activeTab === 'file' && selectedFile) {
+                formData.append('file', selectedFile);
+            } else if (activeTab === 'url') {
+                formData.append('url', inputUrl.trim());
+            }
+            formData.append('rewrite', doRewrite ? 'true' : 'false');
+
+            const { data } = await api.post('/api/tools/extract-script', formData, {
+                headers: { 'Content-Type': 'multipart/form-data' },
+                timeout: 180000 // 3 minutes timeout
+            });
+
+            if (data.success) {
+                setScript(data.original_script);
+                setRewrittenScript(data.rewritten_script || "");
+                setStep('result');
+            } else {
+                setError("提取失败：未知错误");
+                setStep('config');
+            }
+        } catch (err: any) {
+            console.error(err);
+            const msg = err.response?.data?.detail || err.message || "请求失败";
+            setError(msg);
+            setStep('config');
+        } finally {
+            setIsLoading(false);
+        }
+    };
+
+    const copyToClipboard = (text: string) => {
+        if (navigator.clipboard && window.isSecureContext) {
+            navigator.clipboard.writeText(text).then(() => {
+                alert("已复制到剪贴板");
+            }).catch(err => {
+                console.error('Async: Could not copy text: ', err);
+                fallbackCopyTextToClipboard(text);
+            });
+        } else {
+            fallbackCopyTextToClipboard(text);
+        }
+    };
+
+    const fallbackCopyTextToClipboard = (text: string) => {
+        var textArea = document.createElement("textarea");
+        textArea.value = text;
+
+        // Avoid scrolling to bottom
+        textArea.style.top = "0";
+        textArea.style.left = "0";
+        textArea.style.position = "fixed";
+        textArea.style.opacity = "0";
+
+        document.body.appendChild(textArea);
+        textArea.focus();
+        textArea.select();
+
+        try {
+            var successful = document.execCommand('copy');
+            var msg = successful ? 'successful' : 'unsuccessful';
+            if (successful) {
+                alert("已复制到剪贴板");
+            } else {
+                alert("复制失败，请手动复制");
+            }
+        } catch (err) {
+            console.error('Fallback: Oops, unable to copy', err);
+            alert("复制失败，请手动复制");
+        }
+
+        document.body.removeChild(textArea);
+    };
+
+    // Close when clicking outside - DISABLED as per user request
+    // const modalRef = useRef<HTMLDivElement>(null);
+    // const handleBackdropClick = (e: React.MouseEvent) => {
+    //     if (modalRef.current && !modalRef.current.contains(e.target as Node)) {
+    //         onClose();
+    //     }
+    // };
+
+    if (!isOpen) return null;
+
+    return (
+        <div
+            className="fixed inset-0 z-50 flex items-center justify-center bg-black/80 backdrop-blur-sm p-4 animate-in fade-in duration-200"
+        >
+            <div
+                // ref={modalRef}
+                className="bg-[#1a1a1a] border border-white/10 rounded-2xl w-full max-w-2xl max-h-[90vh] overflow-hidden flex flex-col shadow-2xl"
+            >
+                {/* Header */}
+                <div className="flex items-center justify-between p-4 border-b border-white/10 bg-white/5">
+                    <h3 className="text-lg font-semibold text-white flex items-center gap-2">
+                        📜 文案提取助手
+                    </h3>
+                    <button
+                        onClick={onClose}
+                        className="text-gray-400 hover:text-white transition-colors text-2xl leading-none"
+                    >
+                        &times;
+                    </button>
+                </div>
+
+                {/* Content */}
+                <div className="flex-1 overflow-y-auto p-6">
+                    {step === 'config' && (
+                        <div className="space-y-6">
+
+                            {/* Tabs */}
+                            <div className="flex p-1 bg-white/5 rounded-xl border border-white/10">
+                                <button
+                                    onClick={() => setActiveTab('url')}
+                                    className={`flex-1 py-2 rounded-lg text-sm font-medium transition-all ${activeTab === 'url'
+                                        ? 'bg-purple-600 text-white shadow-lg'
+                                        : 'text-gray-400 hover:text-white hover:bg-white/5'
+                                        }`}
+                                >
+                                    🔗 粘贴链接
+                                </button>
+                                <button
+                                    onClick={() => setActiveTab('file')}
+                                    className={`flex-1 py-2 rounded-lg text-sm font-medium transition-all ${activeTab === 'file'
+                                        ? 'bg-purple-600 text-white shadow-lg'
+                                        : 'text-gray-400 hover:text-white hover:bg-white/5'
+                                        }`}
+                                >
+                                    📂 上传文件
+                                </button>
+                            </div>
+
+                            {/* URL Input Area */}
+                            {activeTab === 'url' && (
+                                <div className="space-y-2 py-4">
+                                    <div className="relative">
+                                        <input
+                                            type="text"
+                                            value={inputUrl}
+                                            onChange={(e) => setInputUrl(e.target.value)}
+                                            placeholder="请粘贴抖音、B站等主流平台视频链接..."
+                                            className="w-full bg-black/20 border border-white/10 rounded-xl px-4 py-4 text-white placeholder-gray-500 focus:outline-none focus:border-purple-500 transition-colors"
+                                        />
+                                        {inputUrl && (
+                                            <button
+                                                onClick={() => setInputUrl("")}
+                                                className="absolute right-3 top-1/2 -translate-y-1/2 text-gray-500 hover:text-white p-1"
+                                            >
+                                                ✕
+                                            </button>
+                                        )}
+                                    </div>
+                                    <p className="text-xs text-gray-400 px-1">
+                                        支持抖音、B站等主流平台分享链接，自动解析下载并提取文案。
+                                    </p>
+                                </div>
+                            )}
+
+                            {/* File Upload Area */}
+                            {activeTab === 'file' && (
+                                <div
+                                    className={`
+                                        relative border-2 border-dashed rounded-xl p-8 text-center transition-all cursor-pointer
+                                        ${dragActive ? 'border-purple-500 bg-purple-500/10' : 'border-white/20 hover:border-white/40 hover:bg-white/5'}
+                                        ${selectedFile ? 'bg-purple-900/10 border-purple-500/50' : ''}
+                                    `}
+                                    onDragEnter={handleDrag}
+                                    onDragLeave={handleDrag}
+                                    onDragOver={handleDrag}
+                                    onDrop={handleDrop}
+                                >
+                                    <input
+                                        type="file"
+                                        className="absolute inset-0 w-full h-full opacity-0 cursor-pointer"
+                                        onChange={handleFileChange}
+                                        accept=".mp4,.mov,.avi,.mp3,.wav,.m4a"
+                                    />
+
+                                    {selectedFile ? (
+                                        <div className="flex flex-col items-center">
+                                            <div className="text-4xl mb-2">📄</div>
+                                            <div className="font-medium text-white break-all max-w-xs">{selectedFile.name}</div>
+                                            <div className="text-sm text-gray-400 mt-1">{(selectedFile.size / (1024 * 1024)).toFixed(1)} MB</div>
+                                            <div className="mt-4 text-xs text-purple-400">点击更换文件</div>
+                                        </div>
+                                    ) : (
+                                        <div className="flex flex-col items-center">
+                                            <div className="text-4xl mb-2">📤</div>
+                                            <div className="font-medium text-white">点击上传或拖拽文件到此处</div>
+                                            <div className="text-sm text-gray-400 mt-2">支持 MP4, MOV, MP3, WAV 等音视频格式</div>
+                                        </div>
+                                    )}
+                                </div>
+                            )}
+
+                            {/* Options */}
+                            <div className="bg-white/5 rounded-xl p-4 border border-white/10">
+                                <label className="flex items-center gap-3 cursor-pointer">
+                                    <input
+                                        type="checkbox"
+                                        checked={doRewrite}
+                                        onChange={e => setDoRewrite(e.target.checked)}
+                                        className="w-5 h-5 accent-purple-600 rounded"
+                                    />
+                                    <div>
+                                        <div className="text-white font-medium">启用 AI 洗稿</div>
+                                        <div className="text-xs text-gray-400">自动将提取的文案重写为更自然流畅的口播稿</div>
+                                    </div>
+                                </label>
+                            </div>
+
+                            {error && (
+                                <div className="p-3 bg-red-500/20 text-red-200 rounded-lg text-sm text-center">
+                                    ❌ {error}
+                                </div>
+                            )}
+
+                            <div className="flex justify-center pt-2">
+                                <button
+                                    onClick={handleExtract}
+                                    className="w-full sm:w-auto px-10 py-3 bg-gradient-to-r from-purple-600 to-pink-600 text-white rounded-xl font-bold hover:shadow-lg hover:from-purple-500 hover:to-pink-500 transition-all transform hover:-translate-y-0.5 disabled:opacity-50 disabled:cursor-not-allowed"
+                                    disabled={activeTab === 'file' ? !selectedFile : !inputUrl.trim()}
+                                >
+                                    {activeTab === 'url' ? '🔗 解析并提取' : '🚀 开始提取'}
+                                </button>
+                            </div>
+                        </div>
+                    )}
+
+                    {step === 'processing' && (
+                        <div className="flex flex-col items-center justify-center py-20">
+                            <div className="relative w-20 h-20 mb-6">
+                                <div className="absolute inset-0 border-4 border-purple-500/30 rounded-full"></div>
+                                <div className="absolute inset-0 border-4 border-t-purple-500 rounded-full animate-spin"></div>
+                            </div>
+                            <h4 className="text-xl font-medium text-white mb-2">正在处理中...</h4>
+                            <p className="text-sm text-gray-400 text-center max-w-sm px-4">
+                                {activeTab === 'url' && "正在下载视频..."}<br />
+                                {doRewrite ? "正在进行语音识别和 AI 智能改写..." : "正在进行语音识别..."}<br />
+                                <span className="opacity-75">大文件可能需要几分钟，请不要关闭窗口</span>
+                            </p>
+                        </div>
+                    )}
+
+                    {step === 'result' && (
+                        <div className="space-y-6">
+                            {rewrittenScript && (
+                                <div className="space-y-2">
+                                    <div className="flex justify-between items-center">
+                                        <h4 className="font-semibold text-purple-300 flex items-center gap-2">
+                                            ✨ AI 洗稿结果 <span className="text-xs font-normal text-purple-400/70">(推荐)</span>
+                                        </h4>
+                                        {onApply && (
+                                            <button
+                                                onClick={() => {
+                                                    onApply(rewrittenScript);
+                                                    onClose();
+                                                }}
+                                                className="text-xs bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-500 hover:to-pink-500 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1 shadow-sm"
+                                            >
+                                                📥 填入
+                                            </button>
+                                        )}
+                                        <button
+                                            onClick={() => copyToClipboard(rewrittenScript)}
+                                            className="text-xs bg-purple-600 hover:bg-purple-500 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1"
+                                        >
+                                            📋 复制内容
+                                        </button>
+                                    </div>
+                                    <div className="bg-purple-900/10 border border-purple-500/20 rounded-xl p-4 max-h-60 overflow-y-auto custom-scrollbar">
+                                        <p className="text-gray-200 text-sm leading-relaxed whitespace-pre-wrap">
+                                            {rewrittenScript}
+                                        </p>
+                                    </div>
+                                </div>
+                            )}
+
+                            <div className="space-y-2">
+                                <div className="flex justify-between items-center">
+                                    <h4 className="font-semibold text-gray-400 flex items-center gap-2">
+                                        🎙️ 原始识别结果
+                                    </h4>
+                                    {onApply && (
+                                        <button
+                                            onClick={() => {
+                                                onApply(script);
+                                                onClose();
+                                            }}
+                                            className="text-xs bg-white/10 hover:bg-white/20 text-white px-3 py-1.5 rounded-lg transition-colors flex items-center gap-1"
+                                        >
+                                            📥 填入
+                                        </button>
+                                    )}
+                                    <button
+                                        onClick={() => copyToClipboard(script)}
+                                        className="text-xs bg-white/10 hover:bg-white/20 text-white px-3 py-1.5 rounded-lg transition-colors"
+                                    >
+                                        复制
+                                    </button>
+                                </div>
+                                <div className="bg-white/5 border border-white/10 rounded-xl p-4 max-h-40 overflow-y-auto custom-scrollbar">
+                                    <p className="text-gray-400 text-sm leading-relaxed whitespace-pre-wrap">
+                                        {script}
+                                    </p>
+                                </div>
+                            </div>
+
+                            <div className="flex justify-center pt-4">
+                                <button
+                                    onClick={() => {
+                                        setStep('config');
+                                        setScript("");
+                                        setRewrittenScript("");
+                                        setSelectedFile(null);
+                                        setInputUrl("");
+                                        // Keep current tab active
+                                    }}
+                                    className="px-6 py-2 bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors"
+                                >
+                                    提取下一个
+                                </button>
+                            </div>
+                        </div>
+                    )}
+                </div>
+            </div>
+        </div>
+    );
+}
--- a/frontend/src/components/VideoPreviewModal.tsx
+++ b/frontend/src/components/VideoPreviewModal.tsx
@@ -0,0 +1,80 @@
+"use client";
+
+import { useEffect } from "react";
+import { X, Video } from "lucide-react";
+
+interface VideoPreviewModalProps {
+    videoUrl: string | null;
+    onClose: () => void;
+    title?: string;
+    subtitle?: string;
+}
+
+export default function VideoPreviewModal({
+    videoUrl,
+    onClose,
+    title = "视频预览",
+    subtitle = "ESC 关闭 · 点击空白关闭",
+}: VideoPreviewModalProps) {
+  useEffect(() => {
+    if (!videoUrl) return;
+    // 按 ESC 关闭
+    const handleEsc = (e: KeyboardEvent) => {
+      if (e.key === 'Escape') onClose();
+    };
+    const prevOverflow = document.body.style.overflow;
+    document.addEventListener('keydown', handleEsc);
+    // 禁止背景滚动
+    document.body.style.overflow = 'hidden';
+
+    return () => {
+      document.removeEventListener('keydown', handleEsc);
+      document.body.style.overflow = prevOverflow;
+    };
+  }, [videoUrl, onClose]);
+
+    if (!videoUrl) return null;
+
+    return (
+        <div
+            className="fixed inset-0 z-[200] flex items-center justify-center bg-black/80 backdrop-blur-sm p-4 animate-in fade-in duration-200"
+            onClick={onClose}
+        >
+            <div
+                className="relative w-full max-w-4xl bg-gray-900 border border-white/10 rounded-2xl shadow-2xl overflow-hidden flex flex-col"
+                onClick={(e) => e.stopPropagation()}
+            >
+                <div className="flex items-center justify-between px-6 py-3 border-b border-white/10 bg-gradient-to-r from-white/5 via-white/0 to-white/5">
+                    <div className="flex items-center gap-3">
+                        <div className="h-9 w-9 rounded-lg bg-white/10 flex items-center justify-center text-white">
+                            <Video className="h-5 w-5" />
+                        </div>
+                        <div>
+                            <h3 className="text-lg font-semibold text-white">
+                                {title}
+                            </h3>
+                            <p className="text-xs text-gray-400">
+                                {subtitle}
+                            </p>
+                        </div>
+                    </div>
+                    <button
+                        onClick={onClose}
+                        className="p-2 text-gray-400 hover:text-white hover:bg-white/10 rounded-lg transition-colors"
+                    >
+                        <X className="h-5 w-5" />
+                    </button>
+                </div>
+
+                <div className="bg-black flex items-center justify-center min-h-[50vh] max-h-[80vh]">
+                    <video
+                        src={videoUrl}
+                        controls
+                        autoPlay
+                        className="w-full h-full max-h-[80vh] object-contain"
+                    />
+                </div>
+            </div>
+        </div>
+    );
+}
--- a/frontend/src/contexts/AuthContext.tsx
+++ b/frontend/src/contexts/AuthContext.tsx
@@ -0,0 +1,80 @@
+"use client";
+
+import { createContext, useContext, useState, useEffect, ReactNode } from "react";
+import api from "@/shared/api/axios";
+
+interface User {
+  id: string;
+  phone: string;
+  username: string | null;
+  role: string;
+  is_active: boolean;
+  expires_at: string | null;
+}
+
+interface AuthContextType {
+  userId: string | null;
+  user: User | null;
+  isLoading: boolean;
+  isAuthenticated: boolean;
+}
+
+const AuthContext = createContext<AuthContextType>({
+  userId: null,
+  user: null,
+  isLoading: true,
+  isAuthenticated: false,
+});
+
+export function AuthProvider({ children }: { children: ReactNode }) {
+  const [user, setUser] = useState<User | null>(null);
+  const [isLoading, setIsLoading] = useState(true);
+
+  useEffect(() => {
+    let retryCount = 0;
+    const maxRetries = 2;
+
+    const fetchUser = async () => {
+      console.log("[AuthContext] 开始获取用户信息...");
+      try {
+        const { data } = await api.get('/api/auth/me');
+        console.log("[AuthContext] 获取用户信息成功:", data);
+        if (data && data.id) {
+          setUser(data);
+          console.log("[AuthContext] 设置 user:", data);
+        } else {
+          console.warn("[AuthContext] 响应中没有用户数据");
+        }
+        setIsLoading(false);
+      } catch (error) {
+        console.error("[AuthContext] 获取用户信息失败:", error);
+        // 重试逻辑
+        if (retryCount < maxRetries) {
+          retryCount++;
+          console.log(`[AuthContext] 重试 ${retryCount}/${maxRetries}...`);
+          setTimeout(fetchUser, 1000);
+        } else {
+          console.error("[AuthContext] 重试次数用尽，放弃获取用户信息");
+          setIsLoading(false);
+        }
+      }
+    };
+
+    fetchUser();
+  }, []);
+
+  return (
+    <AuthContext.Provider value={{
+      userId: user?.id || null,
+      user,
+      isLoading,
+      isAuthenticated: !!user
+    }}>
+      {children}
+    </AuthContext.Provider>
+  );
+}
+
+export function useAuth() {
+  return useContext(AuthContext);
+}
--- a/frontend/src/contexts/TaskContext.tsx
+++ b/frontend/src/contexts/TaskContext.tsx
@@ -0,0 +1,119 @@
+"use client";
+
+import { createContext, useContext, useState, useEffect, ReactNode } from "react";
+import api from "@/shared/api/axios";
+
+interface Task {
+  task_id: string;
+  status: string;
+  progress: number;
+  message: string;
+  download_url?: string;
+}
+
+interface TaskContextType {
+  currentTask: Task | null;
+  isGenerating: boolean;
+  startTask: (taskId: string) => void;
+  clearTask: () => void;
+}
+
+const TaskContext = createContext<TaskContextType | undefined>(undefined);
+
+export function TaskProvider({ children }: { children: ReactNode }) {
+  const [currentTask, setCurrentTask] = useState<Task | null>(null);
+  const [isGenerating, setIsGenerating] = useState(false);
+  const [taskId, setTaskId] = useState<string | null>(null);
+
+  // 轮询任务状态
+  useEffect(() => {
+    if (!taskId) return;
+
+    const pollTask = async () => {
+      try {
+        const { data } = await api.get(`/api/videos/tasks/${taskId}`);
+        setCurrentTask(data);
+
+        // 处理任务完成、失败或不存在的情况
+        if (data.status === "completed" || data.status === "failed" || data.status === "not_found") {
+          setIsGenerating(false);
+          setTaskId(null);
+          // 清除 localStorage
+          if (typeof window !== 'undefined') {
+            const keys = Object.keys(localStorage);
+            keys.forEach(key => {
+              if (key.includes('_current_task')) {
+                localStorage.removeItem(key);
+              }
+            });
+          }
+        }
+      } catch (error) {
+        console.error("轮询任务失败:", error);
+        setIsGenerating(false);
+        setTaskId(null);
+        // 清除 localStorage
+        if (typeof window !== 'undefined') {
+          const keys = Object.keys(localStorage);
+          keys.forEach(key => {
+            if (key.includes('_current_task')) {
+              localStorage.removeItem(key);
+            }
+          });
+        }
+      }
+    };
+
+    // 立即执行一次
+    pollTask();
+
+    // 每秒轮询
+    const interval = setInterval(pollTask, 1000);
+
+    return () => clearInterval(interval);
+  }, [taskId]);
+
+  // 页面加载时恢复任务
+  useEffect(() => {
+    if (typeof window === 'undefined') return;
+
+    // 查找所有可能的任务ID
+    const keys = Object.keys(localStorage);
+    const taskKey = keys.find(key => key.includes('_current_task'));
+
+    if (taskKey) {
+      const savedTaskId = localStorage.getItem(taskKey);
+      if (savedTaskId) {
+        console.log("[TaskContext] 恢复任务:", savedTaskId);
+        setTaskId(savedTaskId);
+        setIsGenerating(true);
+      }
+    }
+  }, []);
+
+  const startTask = (newTaskId: string) => {
+    setTaskId(newTaskId);
+    setIsGenerating(true);
+    setCurrentTask(null);
+  };
+
+  const clearTask = () => {
+    setTaskId(null);
+    setIsGenerating(false);
+    setCurrentTask(null);
+  };
+
+  return (
+    <TaskContext.Provider value={{ currentTask, isGenerating, startTask, clearTask }}>
+      {children}
+    </TaskContext.Provider>
+  );
+}
+
+export function useTask() {
+  const context = useContext(TaskContext);
+  if (context === undefined) {
+    throw new Error("useTask must be used within a TaskProvider");
+  }
+  return context;
+}
--- a/frontend/src/features/home/model/useBgm.ts
+++ b/frontend/src/features/home/model/useBgm.ts
@@ -0,0 +1,55 @@
+import { useCallback, useState } from "react";
+import api from "@/shared/api/axios";
+
+export interface BgmItem {
+  id: string;
+  name: string;
+  ext?: string;
+}
+
+interface UseBgmOptions {
+  storageKey: string;
+  selectedBgmId: string;
+  setSelectedBgmId: React.Dispatch<React.SetStateAction<string>>;
+}
+
+export const useBgm = ({
+  storageKey,
+  selectedBgmId,
+  setSelectedBgmId,
+}: UseBgmOptions) => {
+  const [bgmList, setBgmList] = useState<BgmItem[]>([]);
+  const [bgmLoading, setBgmLoading] = useState(false);
+  const [bgmError, setBgmError] = useState<string>("");
+
+  const fetchBgmList = useCallback(async () => {
+    setBgmLoading(true);
+    setBgmError("");
+    try {
+      const { data } = await api.get('/api/assets/bgm');
+      const items: BgmItem[] = Array.isArray(data.bgm) ? data.bgm : [];
+      setBgmList(items);
+
+      const savedBgmId = localStorage.getItem(`vigent_${storageKey}_bgmId`);
+      setSelectedBgmId((prev) => {
+        if (prev && items.some((item) => item.id === prev)) return prev;
+        if (savedBgmId && items.some((item) => item.id === savedBgmId)) return savedBgmId;
+        return items[0]?.id || "";
+      });
+    } catch (error: any) {
+      const message = error?.response?.data?.detail || error?.message || '加载失败';
+      setBgmError(message);
+      setBgmList([]);
+      console.error("获取背景音乐失败:", error);
+    } finally {
+      setBgmLoading(false);
+    }
+  }, [setSelectedBgmId, storageKey]);
+
+  return {
+    bgmList,
+    bgmLoading,
+    bgmError,
+    fetchBgmList,
+  };
+};
--- a/frontend/src/features/home/model/useGeneratedVideos.ts
+++ b/frontend/src/features/home/model/useGeneratedVideos.ts
@@ -0,0 +1,81 @@
+import { useCallback, useState } from "react";
+import api from "@/shared/api/axios";
+
+interface GeneratedVideo {
+  id: string;
+  name: string;
+  path: string;
+  size_mb: number;
+  created_at: number;
+}
+
+interface UseGeneratedVideosOptions {
+  storageKey: string;
+  selectedVideoId: string | null;
+  setSelectedVideoId: React.Dispatch<React.SetStateAction<string | null>>;
+  setGeneratedVideo: React.Dispatch<React.SetStateAction<string | null>>;
+  resolveMediaUrl: (url?: string | null) => string | null;
+}
+
+export const useGeneratedVideos = ({
+  storageKey,
+  selectedVideoId,
+  setSelectedVideoId,
+  setGeneratedVideo,
+  resolveMediaUrl,
+}: UseGeneratedVideosOptions) => {
+  const [generatedVideos, setGeneratedVideos] = useState<GeneratedVideo[]>([]);
+
+  const fetchGeneratedVideos = useCallback(async (preferVideoId?: string) => {
+    try {
+      const { data } = await api.get('/api/videos/generated');
+      const videos: GeneratedVideo[] = data.videos || [];
+      setGeneratedVideos(videos);
+
+      const savedSelectedVideoId = localStorage.getItem(`vigent_${storageKey}_selectedVideoId`);
+      const currentId = preferVideoId || selectedVideoId || savedSelectedVideoId || null;
+      let nextId: string | null = null;
+      let nextUrl: string | null = null;
+
+      if (currentId) {
+        const found = videos.find(v => v.id === currentId);
+        if (found) {
+          nextId = found.id;
+          nextUrl = resolveMediaUrl(found.path);
+        }
+      }
+
+      if (!nextId && videos.length > 0) {
+        nextId = videos[0].id;
+        nextUrl = resolveMediaUrl(videos[0].path);
+      }
+
+      if (nextId) {
+        setSelectedVideoId(nextId);
+        setGeneratedVideo(nextUrl);
+      }
+    } catch (error) {
+      console.error("获取历史视频失败:", error);
+    }
+  }, [resolveMediaUrl, selectedVideoId, setGeneratedVideo, setSelectedVideoId, storageKey]);
+
+  const deleteVideo = useCallback(async (videoId: string) => {
+    if (!confirm("确定要删除这个视频吗？")) return;
+    try {
+      await api.delete(`/api/videos/generated/${videoId}`);
+      if (selectedVideoId === videoId) {
+        setSelectedVideoId(null);
+        setGeneratedVideo(null);
+      }
+      fetchGeneratedVideos();
+    } catch (error) {
+      alert("删除失败: " + error);
+    }
+  }, [fetchGeneratedVideos, selectedVideoId, setGeneratedVideo, setSelectedVideoId]);
+
+  return {
+    generatedVideos,
+    fetchGeneratedVideos,
+    deleteVideo,
+  };
+};
--- a/frontend/src/features/home/model/useHomeController.ts
+++ b/frontend/src/features/home/model/useHomeController.ts
@@ -0,0 +1,743 @@
+import { useEffect, useRef, useState } from "react";
+import api from "@/shared/api/axios";
+import {
+  buildTextShadow,
+  formatDate,
+  getApiBaseUrl,
+  getFontFormat,
+  resolveAssetUrl,
+  resolveBgmUrl,
+  resolveMediaUrl,
+} from "@/shared/lib/media";
+import { clampTitle } from "@/shared/lib/title";
+import { useTitleInput } from "@/shared/hooks/useTitleInput";
+import { useAuth } from "@/contexts/AuthContext";
+import { useTask } from "@/contexts/TaskContext";
+import { useBgm } from "@/features/home/model/useBgm";
+import { useGeneratedVideos } from "@/features/home/model/useGeneratedVideos";
+import { useHomePersistence } from "@/features/home/model/useHomePersistence";
+import { useMaterials } from "@/features/home/model/useMaterials";
+import { useMediaPlayers } from "@/features/home/model/useMediaPlayers";
+import { useRefAudios } from "@/features/home/model/useRefAudios";
+import { useTitleSubtitleStyles } from "@/features/home/model/useTitleSubtitleStyles";
+
+const VOICES = [
+  { id: "zh-CN-YunxiNeural", name: "云溪 (男声-年轻)" },
+  { id: "zh-CN-YunjianNeural", name: "云健 (男声-新闻)" },
+  { id: "zh-CN-YunyangNeural", name: "云扬 (男声-专业)" },
+  { id: "zh-CN-XiaoxiaoNeural", name: "晓晓 (女声-活泼)" },
+  { id: "zh-CN-XiaoyiNeural", name: "晓伊 (女声-温柔)" },
+];
+
+const FIXED_REF_TEXT =
+  "其实生活中有许多美好的瞬间，比如清晨的阳光，或者一杯温热的清茶。希望这次生成的音色能够自然、流畅，完美还原出我最真实的声音状态。";
+
+const scrollContainerToItem = (container: HTMLDivElement, item: HTMLDivElement) => {
+  const containerRect = container.getBoundingClientRect();
+  const itemRect = item.getBoundingClientRect();
+  const itemTop = itemRect.top - containerRect.top + container.scrollTop;
+  const itemBottom = itemTop + itemRect.height;
+  const viewTop = container.scrollTop;
+  const viewBottom = viewTop + container.clientHeight;
+
+  if (itemTop < viewTop) {
+    container.scrollTo({ top: Math.max(itemTop - 8, 0), behavior: "smooth" });
+  } else if (itemBottom > viewBottom) {
+    container.scrollTo({ top: itemBottom - container.clientHeight + 8, behavior: "smooth" });
+  }
+};
+
+interface GeneratedVideo {
+  id: string;
+  name: string;
+  path: string;
+  size_mb: number;
+  created_at: number;
+}
+
+interface RefAudio {
+  id: string;
+  name: string;
+  path: string;
+  ref_text: string;
+  duration_sec: number;
+  created_at: number;
+}
+
+export const useHomeController = () => {
+  const apiBase = getApiBaseUrl();
+
+  const [selectedMaterial, setSelectedMaterial] = useState<string>("");
+  const [previewMaterial, setPreviewMaterial] = useState<string | null>(null);
+
+  const [text, setText] = useState<string>("");
+  const [voice, setVoice] = useState<string>("zh-CN-YunxiNeural");
+
+  // 使用全局任务状态
+  const { currentTask, isGenerating, startTask } = useTask();
+
+  const [generatedVideo, setGeneratedVideo] = useState<string | null>(null);
+  const [selectedVideoId, setSelectedVideoId] = useState<string | null>(null);
+
+  // 字幕和标题相关状态
+  const [videoTitle, setVideoTitle] = useState<string>("");
+  const [enableSubtitles, setEnableSubtitles] = useState<boolean>(true);
+  const [selectedSubtitleStyleId, setSelectedSubtitleStyleId] = useState<string>("");
+  const [selectedTitleStyleId, setSelectedTitleStyleId] = useState<string>("");
+  const [subtitleFontSize, setSubtitleFontSize] = useState<number>(60);
+  const [titleFontSize, setTitleFontSize] = useState<number>(90);
+  const [subtitleSizeLocked, setSubtitleSizeLocked] = useState<boolean>(false);
+  const [titleSizeLocked, setTitleSizeLocked] = useState<boolean>(false);
+  const [showStylePreview, setShowStylePreview] = useState<boolean>(false);
+  const [materialDimensions, setMaterialDimensions] = useState<{ width: number; height: number } | null>(null);
+  const [previewContainerWidth, setPreviewContainerWidth] = useState<number>(0);
+
+  // 背景音乐相关状态
+  const [selectedBgmId, setSelectedBgmId] = useState<string>("");
+  const [enableBgm, setEnableBgm] = useState<boolean>(false);
+  const [bgmVolume, setBgmVolume] = useState<number>(0.2);
+
+  // 声音克隆相关状态
+  const [ttsMode, setTtsMode] = useState<"edgetts" | "voiceclone">("edgetts");
+  const [selectedRefAudio, setSelectedRefAudio] = useState<RefAudio | null>(null);
+  const [refText, setRefText] = useState(FIXED_REF_TEXT);
+
+  // 音频预览与重命名状态
+  const [editingAudioId, setEditingAudioId] = useState<string | null>(null);
+  const [editName, setEditName] = useState("");
+  const bgmItemRefs = useRef<Record<string, HTMLDivElement | null>>({});
+  const bgmListContainerRef = useRef<HTMLDivElement | null>(null);
+  const titlePreviewContainerRef = useRef<HTMLDivElement | null>(null);
+  const materialItemRefs = useRef<Record<string, HTMLDivElement | null>>({});
+  const videoItemRefs = useRef<Record<string, HTMLDivElement | null>>({});
+
+  // 重命名参考音频
+  const startEditing = (audio: RefAudio, e: React.MouseEvent) => {
+    e.stopPropagation();
+    setEditingAudioId(audio.id);
+    // 去掉后缀名进行编辑 (体验更好)
+    const nameWithoutExt = audio.name.substring(0, audio.name.lastIndexOf("."));
+    setEditName(nameWithoutExt || audio.name);
+  };
+
+  const cancelEditing = (e: React.MouseEvent) => {
+    e.stopPropagation();
+    setEditingAudioId(null);
+    setEditName("");
+  };
+
+  const saveEditing = async (audioId: string, e: React.MouseEvent) => {
+    e.stopPropagation();
+    if (!editName.trim()) return;
+
+    try {
+      await api.put(`/api/ref-audios/${encodeURIComponent(audioId)}`, { new_name: editName });
+      setEditingAudioId(null);
+      fetchRefAudios(); // 刷新列表
+    } catch (err: any) {
+      alert("重命名失败: " + err);
+    }
+  };
+
+  // AI 生成标题标签
+  const [isGeneratingMeta, setIsGeneratingMeta] = useState(false);
+
+  // 在线录音相关
+  const [isRecording, setIsRecording] = useState(false);
+  const [recordedBlob, setRecordedBlob] = useState<Blob | null>(null);
+  const [recordingTime, setRecordingTime] = useState(0);
+  const mediaRecorderRef = useRef<MediaRecorder | null>(null);
+  const recordingIntervalRef = useRef<NodeJS.Timeout | null>(null);
+
+  // 使用全局认证状态
+  const { userId, isLoading: isAuthLoading } = useAuth();
+
+  // 文案提取模态框
+  const [extractModalOpen, setExtractModalOpen] = useState(false);
+
+  // 获取存储 key 的前缀（登录用户使用 userId，未登录使用 guest）
+  const storageKey = userId || "guest";
+
+  const {
+    materials,
+    fetchError,
+    isUploading,
+    uploadProgress,
+    uploadError,
+    setUploadError,
+    fetchMaterials,
+    deleteMaterial,
+    handleUpload,
+  } = useMaterials({
+    selectedMaterial,
+    setSelectedMaterial,
+  });
+
+  const {
+    subtitleStyles,
+    titleStyles,
+    refreshSubtitleStyles,
+    refreshTitleStyles,
+  } = useTitleSubtitleStyles({
+    isAuthLoading,
+    storageKey,
+    setSelectedSubtitleStyleId,
+    setSelectedTitleStyleId,
+  });
+
+  const {
+    refAudios,
+    isUploadingRef,
+    uploadRefError,
+    setUploadRefError,
+    fetchRefAudios,
+    uploadRefAudio,
+    deleteRefAudio,
+  } = useRefAudios({
+    fixedRefText: FIXED_REF_TEXT,
+    selectedRefAudio,
+    setSelectedRefAudio,
+    setRefText,
+  });
+
+  const {
+    bgmList,
+    bgmLoading,
+    bgmError,
+    fetchBgmList,
+  } = useBgm({
+    storageKey,
+    selectedBgmId,
+    setSelectedBgmId,
+  });
+
+  const {
+    playingAudioId,
+    playingBgmId,
+    togglePlayPreview,
+    toggleBgmPreview,
+  } = useMediaPlayers({
+    bgmVolume,
+    resolveBgmUrl,
+    resolveMediaUrl,
+    setSelectedBgmId,
+    setEnableBgm,
+  });
+
+  const {
+    generatedVideos,
+    fetchGeneratedVideos,
+    deleteVideo,
+  } = useGeneratedVideos({
+    storageKey,
+    selectedVideoId,
+    setSelectedVideoId,
+    setGeneratedVideo,
+    resolveMediaUrl,
+  });
+
+  const { isRestored } = useHomePersistence({
+    isAuthLoading,
+    storageKey,
+    text,
+    setText,
+    videoTitle,
+    setVideoTitle,
+    enableSubtitles,
+    setEnableSubtitles,
+    ttsMode,
+    setTtsMode,
+    voice,
+    setVoice,
+    selectedMaterial,
+    setSelectedMaterial,
+    selectedSubtitleStyleId,
+    setSelectedSubtitleStyleId,
+    selectedTitleStyleId,
+    setSelectedTitleStyleId,
+    subtitleFontSize,
+    setSubtitleFontSize,
+    titleFontSize,
+    setTitleFontSize,
+    setSubtitleSizeLocked,
+    setTitleSizeLocked,
+    selectedBgmId,
+    setSelectedBgmId,
+    bgmVolume,
+    setBgmVolume,
+    enableBgm,
+    setEnableBgm,
+    selectedVideoId,
+    setSelectedVideoId,
+    selectedRefAudio,
+  });
+
+  const syncTitleToPublish = (value: string) => {
+    if (typeof window !== "undefined") {
+      localStorage.setItem(`vigent_${storageKey}_publish_title`, value);
+    }
+  };
+
+  const titleInput = useTitleInput({
+    value: videoTitle,
+    onChange: setVideoTitle,
+    onCommit: syncTitleToPublish,
+  });
+
+  // 加载素材列表和历史视频
+  useEffect(() => {
+    if (isAuthLoading) return;
+    void Promise.allSettled([
+      fetchMaterials(),
+      fetchGeneratedVideos(),
+      fetchRefAudios(),
+      refreshSubtitleStyles(),
+      refreshTitleStyles(),
+      fetchBgmList(),
+    ]);
+  }, [isAuthLoading]);
+
+  useEffect(() => {
+    const material = materials.find((item) => item.id === selectedMaterial);
+    if (!material?.path) {
+      setMaterialDimensions(null);
+      return;
+    }
+    const url = resolveMediaUrl(material.path);
+    if (!url) {
+      setMaterialDimensions(null);
+      return;
+    }
+
+    let isActive = true;
+    const video = document.createElement("video");
+    video.crossOrigin = "anonymous";
+    video.preload = "metadata";
+    video.src = url;
+    video.load();
+
+    const handleLoaded = () => {
+      if (!isActive) return;
+      if (video.videoWidth && video.videoHeight) {
+        setMaterialDimensions({ width: video.videoWidth, height: video.videoHeight });
+      } else {
+        setMaterialDimensions(null);
+      }
+    };
+
+    const handleError = () => {
+      if (!isActive) return;
+      setMaterialDimensions(null);
+    };
+
+    video.addEventListener("loadedmetadata", handleLoaded);
+    video.addEventListener("error", handleError);
+
+    return () => {
+      isActive = false;
+      video.removeEventListener("loadedmetadata", handleLoaded);
+      video.removeEventListener("error", handleError);
+    };
+  }, [materials, selectedMaterial]);
+
+  useEffect(() => {
+    if (!titlePreviewContainerRef.current) return;
+    const container = titlePreviewContainerRef.current;
+
+    const resizeObserver = new ResizeObserver((entries) => {
+      for (const entry of entries) {
+        setPreviewContainerWidth(entry.contentRect.width);
+      }
+    });
+
+    resizeObserver.observe(container);
+
+    return () => {
+      resizeObserver.disconnect();
+    };
+  }, []);
+
+  useEffect(() => {
+    if (subtitleSizeLocked || subtitleStyles.length === 0) return;
+    const active = subtitleStyles.find((s) => s.id === selectedSubtitleStyleId)
+      || subtitleStyles.find((s) => s.is_default)
+      || subtitleStyles[0];
+    if (active?.font_size) {
+      setSubtitleFontSize(active.font_size);
+    }
+  }, [subtitleStyles, selectedSubtitleStyleId, subtitleSizeLocked]);
+
+  useEffect(() => {
+    if (titleSizeLocked || titleStyles.length === 0) return;
+    const active = titleStyles.find((s) => s.id === selectedTitleStyleId)
+      || titleStyles.find((s) => s.is_default)
+      || titleStyles[0];
+    if (active?.font_size) {
+      setTitleFontSize(active.font_size);
+    }
+  }, [titleStyles, selectedTitleStyleId, titleSizeLocked]);
+
+  useEffect(() => {
+    if (!enableBgm || selectedBgmId || bgmList.length === 0) return;
+    const savedBgmId = localStorage.getItem(`vigent_${storageKey}_bgmId`);
+    const savedItem = savedBgmId && bgmList.find((item) => item.id === savedBgmId);
+    if (savedItem) {
+      setSelectedBgmId(savedBgmId);
+      return;
+    }
+    setSelectedBgmId(bgmList[0].id);
+  }, [enableBgm, selectedBgmId, bgmList, storageKey, setSelectedBgmId]);
+
+  useEffect(() => {
+    if (!selectedBgmId) return;
+    const container = bgmListContainerRef.current;
+    const target = bgmItemRefs.current[selectedBgmId];
+    if (container && target) {
+      scrollContainerToItem(container, target);
+    }
+  }, [selectedBgmId, bgmList]);
+
+  useEffect(() => {
+    if (!selectedMaterial) return;
+    const target = materialItemRefs.current[selectedMaterial];
+    if (target) {
+      target.scrollIntoView({ block: "nearest", behavior: "smooth" });
+    }
+  }, [selectedMaterial, materials]);
+
+  useEffect(() => {
+    if (!selectedVideoId) return;
+    const target = videoItemRefs.current[selectedVideoId];
+    if (target) {
+      target.scrollIntoView({ block: "nearest", behavior: "smooth" });
+    }
+  }, [selectedVideoId, generatedVideos]);
+
+  // 自动选择参考音频 (恢复上次选择 或 默认最新的)
+  useEffect(() => {
+    // 只有在数据加载完成且尚未选择时才执行
+    if (refAudios.length > 0 && !selectedRefAudio && isRestored) {
+      const savedId = localStorage.getItem(`vigent_${storageKey}_refAudioId`);
+      let targetAudio = null;
+
+      if (savedId) {
+        targetAudio = refAudios.find((a) => a.id === savedId);
+      }
+
+      // 如果没找到保存的，或者没有保存，则默认选第一个(最新的)
+      if (!targetAudio) {
+        targetAudio = refAudios[0];
+      }
+
+      setSelectedRefAudio(targetAudio);
+      setRefText(targetAudio.ref_text);
+    }
+  }, [refAudios, selectedRefAudio, isRestored, storageKey, setSelectedRefAudio, setRefText]);
+
+  useEffect(() => {
+    if (!selectedRefAudio || !isRestored) return;
+    localStorage.setItem(`vigent_${storageKey}_refAudioId`, selectedRefAudio.id);
+  }, [selectedRefAudio, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (!selectedRefAudio) return;
+    setRefText(selectedRefAudio.ref_text);
+  }, [selectedRefAudio]);
+
+  // 开始录音
+  const startRecording = async () => {
+    try {
+      const stream = await navigator.mediaDevices.getUserMedia({ audio: true });
+      const mediaRecorder = new MediaRecorder(stream, { mimeType: "audio/webm" });
+      const chunks: BlobPart[] = [];
+
+      mediaRecorder.ondataavailable = (e) => chunks.push(e.data);
+      mediaRecorder.onstop = () => {
+        const blob = new Blob(chunks, { type: "audio/webm" });
+        setRecordedBlob(blob);
+        stream.getTracks().forEach((track) => track.stop());
+      };
+
+      mediaRecorder.start();
+      setIsRecording(true);
+      setRecordingTime(0);
+      mediaRecorderRef.current = mediaRecorder;
+
+      // 计时器
+      recordingIntervalRef.current = setInterval(() => {
+        setRecordingTime((prev) => prev + 1);
+      }, 1000);
+    } catch (err) {
+      alert("无法访问麦克风，请检查权限设置");
+      console.error(err);
+    }
+  };
+
+  // 停止录音
+  const stopRecording = () => {
+    mediaRecorderRef.current?.stop();
+    setIsRecording(false);
+    if (recordingIntervalRef.current) {
+      clearInterval(recordingIntervalRef.current);
+      recordingIntervalRef.current = null;
+    }
+  };
+
+  // 使用录音（上传到后端，使用固定参考文字）
+  const useRecording = async () => {
+    if (!recordedBlob) return;
+
+    // 回归：使用固定文件名，依靠后端自动重命名 (recording(1).webm)
+    const filename = "recording.webm";
+
+    const file = new File([recordedBlob], filename, { type: "audio/webm" });
+    await uploadRefAudio(file);
+    setRecordedBlob(null);
+    setRecordingTime(0);
+  };
+
+  // 格式化录音时长
+  const formatRecordingTime = (seconds: number) => {
+    const mins = Math.floor(seconds / 60);
+    const secs = seconds % 60;
+    return `${mins}:${secs.toString().padStart(2, "0")}`;
+  };
+
+  // AI 生成标题和标签
+  const handleGenerateMeta = async () => {
+    if (!text.trim()) {
+      alert("请先输入口播文案");
+      return;
+    }
+
+    setIsGeneratingMeta(true);
+    try {
+      const { data } = await api.post("/api/ai/generate-meta", { text: text.trim() });
+
+      // 更新首页标题
+      const nextTitle = clampTitle(data.title || "");
+      titleInput.commitValue(nextTitle);
+
+      // 同步到发布页 localStorage
+      localStorage.setItem(`vigent_${storageKey}_publish_tags`, JSON.stringify(data.tags || []));
+    } catch (err: any) {
+      console.error("AI generate meta failed:", err);
+      const errorMsg = err.response?.data?.detail || err.message || String(err);
+      alert(`AI 生成失败: ${errorMsg}`);
+    } finally {
+      setIsGeneratingMeta(false);
+    }
+  };
+
+  // 生成视频
+  const handleGenerate = async () => {
+    if (!selectedMaterial || !text.trim()) {
+      alert("请先选择素材并填写文案");
+      return;
+    }
+
+    // 声音克隆模式校验
+    if (ttsMode === "voiceclone") {
+      if (!selectedRefAudio) {
+        alert("请选择或上传参考音频");
+        return;
+      }
+    }
+
+    if (enableBgm && !selectedBgmId) {
+      alert("请选择背景音乐");
+      return;
+    }
+
+    setGeneratedVideo(null);
+
+    try {
+      // 查找选中的素材对象以获取路径
+      const materialObj = materials.find((m) => m.id === selectedMaterial);
+      if (!materialObj) {
+        alert("素材数据异常");
+        return;
+      }
+
+      // 构建请求参数
+      const payload: Record<string, any> = {
+        material_path: materialObj.path,
+        text: text,
+        tts_mode: ttsMode,
+        title: videoTitle.trim() || undefined,
+        enable_subtitles: enableSubtitles,
+      };
+
+      if (enableSubtitles && selectedSubtitleStyleId) {
+        payload.subtitle_style_id = selectedSubtitleStyleId;
+      }
+
+      if (enableSubtitles && subtitleFontSize) {
+        payload.subtitle_font_size = Math.round(subtitleFontSize);
+      }
+
+      if (videoTitle.trim() && selectedTitleStyleId) {
+        payload.title_style_id = selectedTitleStyleId;
+      }
+
+      if (videoTitle.trim() && titleFontSize) {
+        payload.title_font_size = Math.round(titleFontSize);
+      }
+
+      if (enableBgm && selectedBgmId) {
+        payload.bgm_id = selectedBgmId;
+        payload.bgm_volume = bgmVolume;
+      }
+
+      if (ttsMode === "edgetts") {
+        payload.voice = voice;
+      } else {
+        payload.ref_audio_id = selectedRefAudio!.id;
+        payload.ref_text = refText;
+      }
+
+      // 创建生成任务
+      const { data } = await api.post("/api/videos/generate", payload);
+
+      const taskId = data.task_id;
+
+      // 保存任务ID到 localStorage，以便页面切换后恢复
+      localStorage.setItem(`vigent_${storageKey}_current_task`, taskId);
+
+      // 使用全局 TaskContext 开始任务
+      startTask(taskId);
+    } catch (error) {
+      console.error("生成失败:", error);
+    }
+  };
+
+  const handleSelectRefAudio = (audio: RefAudio) => {
+    setSelectedRefAudio(audio);
+    setRefText(audio.ref_text);
+  };
+
+  const handlePreviewMaterial = (path: string) => {
+    setPreviewMaterial(resolveMediaUrl(path));
+  };
+
+  const handleSelectVideo = (video: GeneratedVideo) => {
+    setSelectedVideoId(video.id);
+    setGeneratedVideo(resolveMediaUrl(video.path));
+  };
+
+  const registerMaterialRef = (id: string, el: HTMLDivElement | null) => {
+    materialItemRefs.current[id] = el;
+  };
+
+  const registerBgmItemRef = (id: string, el: HTMLDivElement | null) => {
+    bgmItemRefs.current[id] = el;
+  };
+
+  const registerVideoRef = (id: string, el: HTMLDivElement | null) => {
+    videoItemRefs.current[id] = el;
+  };
+
+  return {
+    apiBase,
+    registerMaterialRef,
+    previewMaterial,
+    setPreviewMaterial,
+    materials,
+    fetchError,
+    isUploading,
+    uploadProgress,
+    uploadError,
+    setUploadError,
+    fetchMaterials,
+    deleteMaterial,
+    handleUpload,
+    selectedMaterial,
+    setSelectedMaterial,
+    handlePreviewMaterial,
+    text,
+    setText,
+    extractModalOpen,
+    setExtractModalOpen,
+    handleGenerateMeta,
+    isGeneratingMeta,
+    showStylePreview,
+    setShowStylePreview,
+    videoTitle,
+    titleInput,
+    titleStyles,
+    selectedTitleStyleId,
+    setSelectedTitleStyleId,
+    titleFontSize,
+    setTitleFontSize,
+    setTitleSizeLocked,
+    subtitleStyles,
+    selectedSubtitleStyleId,
+    setSelectedSubtitleStyleId,
+    subtitleFontSize,
+    setSubtitleFontSize,
+    setSubtitleSizeLocked,
+    enableSubtitles,
+    setEnableSubtitles,
+    resolveAssetUrl,
+    getFontFormat,
+    buildTextShadow,
+    previewContainerWidth,
+    materialDimensions,
+    titlePreviewContainerRef,
+    ttsMode,
+    setTtsMode,
+    voices: VOICES,
+    voice,
+    setVoice,
+    refAudios,
+    selectedRefAudio,
+    handleSelectRefAudio,
+    isUploadingRef,
+    uploadRefError,
+    setUploadRefError,
+    uploadRefAudio,
+    fetchRefAudios,
+    playingAudioId,
+    togglePlayPreview,
+    editingAudioId,
+    editName,
+    setEditName,
+    startEditing,
+    saveEditing,
+    cancelEditing,
+    deleteRefAudio,
+    recordedBlob,
+    isRecording,
+    recordingTime,
+    startRecording,
+    stopRecording,
+    useRecording,
+    formatRecordingTime,
+    fixedRefText: FIXED_REF_TEXT,
+    bgmList,
+    bgmLoading,
+    bgmError,
+    enableBgm,
+    setEnableBgm,
+    fetchBgmList,
+    selectedBgmId,
+    setSelectedBgmId,
+    playingBgmId,
+    toggleBgmPreview,
+    bgmVolume,
+    setBgmVolume,
+    bgmListContainerRef,
+    registerBgmItemRef,
+    currentTask,
+    isGenerating,
+    handleGenerate,
+    generatedVideo,
+    generatedVideos,
+    selectedVideoId,
+    handleSelectVideo,
+    deleteVideo,
+    fetchGeneratedVideos,
+    registerVideoRef,
+    formatDate,
+  };
+};
--- a/frontend/src/features/home/model/useHomePersistence.ts
+++ b/frontend/src/features/home/model/useHomePersistence.ts
@@ -0,0 +1,251 @@
+import { useEffect, useState } from "react";
+import { clampTitle } from "@/shared/lib/title";
+
+interface RefAudio {
+  id: string;
+  name: string;
+  path: string;
+  ref_text: string;
+  duration_sec: number;
+  created_at: number;
+}
+
+interface UseHomePersistenceOptions {
+  isAuthLoading: boolean;
+  storageKey: string;
+  text: string;
+  setText: React.Dispatch<React.SetStateAction<string>>;
+  videoTitle: string;
+  setVideoTitle: React.Dispatch<React.SetStateAction<string>>;
+  enableSubtitles: boolean;
+  setEnableSubtitles: React.Dispatch<React.SetStateAction<boolean>>;
+  ttsMode: 'edgetts' | 'voiceclone';
+  setTtsMode: React.Dispatch<React.SetStateAction<'edgetts' | 'voiceclone'>>;
+  voice: string;
+  setVoice: React.Dispatch<React.SetStateAction<string>>;
+  selectedMaterial: string;
+  setSelectedMaterial: React.Dispatch<React.SetStateAction<string>>;
+  selectedSubtitleStyleId: string;
+  setSelectedSubtitleStyleId: React.Dispatch<React.SetStateAction<string>>;
+  selectedTitleStyleId: string;
+  setSelectedTitleStyleId: React.Dispatch<React.SetStateAction<string>>;
+  subtitleFontSize: number;
+  setSubtitleFontSize: React.Dispatch<React.SetStateAction<number>>;
+  titleFontSize: number;
+  setTitleFontSize: React.Dispatch<React.SetStateAction<number>>;
+  setSubtitleSizeLocked: React.Dispatch<React.SetStateAction<boolean>>;
+  setTitleSizeLocked: React.Dispatch<React.SetStateAction<boolean>>;
+  selectedBgmId: string;
+  setSelectedBgmId: React.Dispatch<React.SetStateAction<string>>;
+  bgmVolume: number;
+  setBgmVolume: React.Dispatch<React.SetStateAction<number>>;
+  enableBgm: boolean;
+  setEnableBgm: React.Dispatch<React.SetStateAction<boolean>>;
+  selectedVideoId: string | null;
+  setSelectedVideoId: React.Dispatch<React.SetStateAction<string | null>>;
+  selectedRefAudio: RefAudio | null;
+}
+
+export const useHomePersistence = ({
+  isAuthLoading,
+  storageKey,
+  text,
+  setText,
+  videoTitle,
+  setVideoTitle,
+  enableSubtitles,
+  setEnableSubtitles,
+  ttsMode,
+  setTtsMode,
+  voice,
+  setVoice,
+  selectedMaterial,
+  setSelectedMaterial,
+  selectedSubtitleStyleId,
+  setSelectedSubtitleStyleId,
+  selectedTitleStyleId,
+  setSelectedTitleStyleId,
+  subtitleFontSize,
+  setSubtitleFontSize,
+  titleFontSize,
+  setTitleFontSize,
+  setSubtitleSizeLocked,
+  setTitleSizeLocked,
+  selectedBgmId,
+  setSelectedBgmId,
+  bgmVolume,
+  setBgmVolume,
+  enableBgm,
+  setEnableBgm,
+  selectedVideoId,
+  setSelectedVideoId,
+  selectedRefAudio,
+}: UseHomePersistenceOptions) => {
+  const [isRestored, setIsRestored] = useState(false);
+
+  useEffect(() => {
+    if (isAuthLoading) return;
+
+    const savedText = localStorage.getItem(`vigent_${storageKey}_text`);
+    const savedTitle = localStorage.getItem(`vigent_${storageKey}_title`);
+    const savedSubtitles = localStorage.getItem(`vigent_${storageKey}_subtitles`);
+    const savedTtsMode = localStorage.getItem(`vigent_${storageKey}_ttsMode`);
+    const savedVoice = localStorage.getItem(`vigent_${storageKey}_voice`);
+    const savedMaterial = localStorage.getItem(`vigent_${storageKey}_material`);
+    const savedSubtitleStyle = localStorage.getItem(`vigent_${storageKey}_subtitleStyle`);
+    const savedTitleStyle = localStorage.getItem(`vigent_${storageKey}_titleStyle`);
+    const savedSubtitleFontSize = localStorage.getItem(`vigent_${storageKey}_subtitleFontSize`);
+    const savedTitleFontSize = localStorage.getItem(`vigent_${storageKey}_titleFontSize`);
+    const savedBgmId = localStorage.getItem(`vigent_${storageKey}_bgmId`);
+    const savedSelectedVideoId = localStorage.getItem(`vigent_${storageKey}_selectedVideoId`);
+    const savedBgmVolume = localStorage.getItem(`vigent_${storageKey}_bgmVolume`);
+    const savedEnableBgm = localStorage.getItem(`vigent_${storageKey}_enableBgm`);
+
+    setText(savedText || "大家好，欢迎来到我的频道，今天给大家分享一些有趣的内容。");
+    setVideoTitle(savedTitle ? clampTitle(savedTitle) : "");
+    setEnableSubtitles(savedSubtitles !== null ? savedSubtitles === 'true' : true);
+    setTtsMode((savedTtsMode as 'edgetts' | 'voiceclone') || 'edgetts');
+    setVoice(savedVoice || "zh-CN-YunxiNeural");
+
+    if (savedMaterial) setSelectedMaterial(savedMaterial);
+    if (savedSubtitleStyle) setSelectedSubtitleStyleId(savedSubtitleStyle);
+    if (savedTitleStyle) setSelectedTitleStyleId(savedTitleStyle);
+
+    if (savedSubtitleFontSize) {
+      const parsed = parseInt(savedSubtitleFontSize, 10);
+      if (!Number.isNaN(parsed)) {
+        setSubtitleFontSize(parsed);
+        setSubtitleSizeLocked(true);
+      }
+    }
+
+    if (savedTitleFontSize) {
+      const parsed = parseInt(savedTitleFontSize, 10);
+      if (!Number.isNaN(parsed)) {
+        setTitleFontSize(parsed);
+        setTitleSizeLocked(true);
+      }
+    }
+
+    if (savedBgmId) setSelectedBgmId(savedBgmId);
+    if (savedBgmVolume) setBgmVolume(parseFloat(savedBgmVolume));
+    if (savedEnableBgm !== null) setEnableBgm(savedEnableBgm === 'true');
+    if (savedSelectedVideoId) setSelectedVideoId(savedSelectedVideoId);
+
+    setIsRestored(true);
+  }, [
+    isAuthLoading,
+    setBgmVolume,
+    setEnableBgm,
+    setEnableSubtitles,
+    setSelectedBgmId,
+    setSelectedMaterial,
+    setSelectedSubtitleStyleId,
+    setSelectedTitleStyleId,
+    setSelectedVideoId,
+    setSubtitleFontSize,
+    setSubtitleSizeLocked,
+    setText,
+    setTitleFontSize,
+    setTitleSizeLocked,
+    setTtsMode,
+    setVideoTitle,
+    setVoice,
+    storageKey,
+  ]);
+
+  useEffect(() => {
+    if (!isRestored) return;
+    const timeout = setTimeout(() => {
+      localStorage.setItem(`vigent_${storageKey}_text`, text);
+    }, 300);
+    return () => clearTimeout(timeout);
+  }, [text, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (!isRestored) return;
+    const timeout = setTimeout(() => {
+      localStorage.setItem(`vigent_${storageKey}_title`, videoTitle);
+    }, 300);
+    return () => clearTimeout(timeout);
+  }, [videoTitle, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored) localStorage.setItem(`vigent_${storageKey}_subtitles`, String(enableSubtitles));
+  }, [enableSubtitles, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored) localStorage.setItem(`vigent_${storageKey}_ttsMode`, ttsMode);
+  }, [ttsMode, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored) localStorage.setItem(`vigent_${storageKey}_voice`, voice);
+  }, [voice, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored && selectedMaterial) {
+      localStorage.setItem(`vigent_${storageKey}_material`, selectedMaterial);
+    }
+  }, [selectedMaterial, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored && selectedSubtitleStyleId) {
+      localStorage.setItem(`vigent_${storageKey}_subtitleStyle`, selectedSubtitleStyleId);
+    }
+  }, [selectedSubtitleStyleId, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored && selectedTitleStyleId) {
+      localStorage.setItem(`vigent_${storageKey}_titleStyle`, selectedTitleStyleId);
+    }
+  }, [selectedTitleStyleId, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored) {
+      localStorage.setItem(`vigent_${storageKey}_subtitleFontSize`, String(subtitleFontSize));
+    }
+  }, [subtitleFontSize, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored) {
+      localStorage.setItem(`vigent_${storageKey}_titleFontSize`, String(titleFontSize));
+    }
+  }, [titleFontSize, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored) {
+      localStorage.setItem(`vigent_${storageKey}_bgmId`, selectedBgmId);
+    }
+  }, [selectedBgmId, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (!isRestored) return;
+    const timeout = setTimeout(() => {
+      localStorage.setItem(`vigent_${storageKey}_bgmVolume`, String(bgmVolume));
+    }, 300);
+    return () => clearTimeout(timeout);
+  }, [bgmVolume, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored) {
+      localStorage.setItem(`vigent_${storageKey}_enableBgm`, String(enableBgm));
+    }
+  }, [enableBgm, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (!isRestored) return;
+    if (selectedVideoId) {
+      localStorage.setItem(`vigent_${storageKey}_selectedVideoId`, selectedVideoId);
+    } else {
+      localStorage.removeItem(`vigent_${storageKey}_selectedVideoId`);
+    }
+  }, [selectedVideoId, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (isRestored && selectedRefAudio) {
+      localStorage.setItem(`vigent_${storageKey}_refAudioId`, selectedRefAudio.id);
+    }
+  }, [selectedRefAudio, storageKey, isRestored]);
+
+  return { isRestored };
+};
--- a/frontend/src/features/home/model/useMaterials.ts
+++ b/frontend/src/features/home/model/useMaterials.ts
@@ -0,0 +1,113 @@
+import { useCallback, useState } from "react";
+import api from "@/shared/api/axios";
+
+interface Material {
+  id: string;
+  name: string;
+  scene: string;
+  size_mb: number;
+  path: string;
+}
+
+interface UseMaterialsOptions {
+  selectedMaterial: string;
+  setSelectedMaterial: React.Dispatch<React.SetStateAction<string>>;
+}
+
+export const useMaterials = ({
+  selectedMaterial,
+  setSelectedMaterial,
+}: UseMaterialsOptions) => {
+  const [materials, setMaterials] = useState<Material[]>([]);
+  const [fetchError, setFetchError] = useState<string | null>(null);
+  const [isUploading, setIsUploading] = useState(false);
+  const [uploadProgress, setUploadProgress] = useState(0);
+  const [uploadError, setUploadError] = useState<string | null>(null);
+
+  const fetchMaterials = useCallback(async () => {
+    try {
+      setFetchError(null);
+
+      const { data } = await api.get(`/api/materials?t=${new Date().getTime()}`);
+      const nextMaterials = data.materials || [];
+      setMaterials(nextMaterials);
+
+      const nextSelected = nextMaterials.find((item: Material) => item.id === selectedMaterial)?.id
+        || nextMaterials[0]?.id
+        || "";
+      if (nextSelected !== selectedMaterial) {
+        setSelectedMaterial(nextSelected);
+      }
+    } catch (error) {
+      console.error("获取素材失败:", error);
+      setFetchError(String(error));
+    }
+  }, [selectedMaterial, setSelectedMaterial]);
+
+  const deleteMaterial = useCallback(async (materialId: string) => {
+    if (!confirm("确定要删除这个素材吗？")) return;
+    try {
+      await api.delete(`/api/materials/${materialId}`);
+      fetchMaterials();
+      if (selectedMaterial === materialId) {
+        setSelectedMaterial("");
+      }
+    } catch (error) {
+      alert("删除失败: " + error);
+    }
+  }, [fetchMaterials, selectedMaterial, setSelectedMaterial]);
+
+  const handleUpload = useCallback(async (e: React.ChangeEvent<HTMLInputElement>) => {
+    const file = e.target.files?.[0];
+    if (!file) return;
+
+    const validTypes = ['.mp4', '.mov', '.avi'];
+    const ext = file.name.toLowerCase().slice(file.name.lastIndexOf('.'));
+    if (!validTypes.includes(ext)) {
+      setUploadError('仅支持 MP4、MOV、AVI 格式');
+      return;
+    }
+
+    setIsUploading(true);
+    setUploadProgress(0);
+    setUploadError(null);
+
+    try {
+      const formData = new FormData();
+      formData.append('file', file);
+
+      await api.post('/api/materials', formData, {
+        headers: { 'Content-Type': 'multipart/form-data' },
+        onUploadProgress: (progressEvent) => {
+          if (progressEvent.total) {
+            const progress = Math.round((progressEvent.loaded / progressEvent.total) * 100);
+            setUploadProgress(progress);
+          }
+        },
+      });
+
+      setUploadProgress(100);
+      setIsUploading(false);
+      fetchMaterials();
+    } catch (err: any) {
+      console.error("Upload failed:", err);
+      setIsUploading(false);
+      const errorMsg = err.response?.data?.detail || err.message || String(err);
+      setUploadError(`上传失败: ${errorMsg}`);
+    }
+
+    e.target.value = '';
+  }, [fetchMaterials]);
+
+  return {
+    materials,
+    fetchError,
+    isUploading,
+    uploadProgress,
+    uploadError,
+    setUploadError,
+    fetchMaterials,
+    deleteMaterial,
+    handleUpload,
+  };
+};
--- a/frontend/src/features/home/model/useMediaPlayers.ts
+++ b/frontend/src/features/home/model/useMediaPlayers.ts
@@ -0,0 +1,116 @@
+import { useCallback, useEffect, useRef, useState } from "react";
+import type { BgmItem } from "@/features/home/model/useBgm";
+
+interface RefAudio {
+  id: string;
+  name: string;
+  path: string;
+  ref_text: string;
+  duration_sec: number;
+  created_at: number;
+}
+
+interface UseMediaPlayersOptions {
+  bgmVolume: number;
+  resolveBgmUrl: (bgmId?: string | null) => string | null;
+  resolveMediaUrl: (url?: string | null) => string | null;
+  setSelectedBgmId: React.Dispatch<React.SetStateAction<string>>;
+  setEnableBgm: React.Dispatch<React.SetStateAction<boolean>>;
+}
+
+export const useMediaPlayers = ({
+  bgmVolume,
+  resolveBgmUrl,
+  resolveMediaUrl,
+  setSelectedBgmId,
+  setEnableBgm,
+}: UseMediaPlayersOptions) => {
+  const [playingAudioId, setPlayingAudioId] = useState<string | null>(null);
+  const [playingBgmId, setPlayingBgmId] = useState<string | null>(null);
+  const audioPlayerRef = useRef<HTMLAudioElement | null>(null);
+  const bgmPlayerRef = useRef<HTMLAudioElement | null>(null);
+
+  const stopAudio = useCallback(() => {
+    if (audioPlayerRef.current) {
+      audioPlayerRef.current.pause();
+      audioPlayerRef.current.currentTime = 0;
+      audioPlayerRef.current = null;
+    }
+    setPlayingAudioId(null);
+  }, []);
+
+  const stopBgm = useCallback(() => {
+    if (bgmPlayerRef.current) {
+      bgmPlayerRef.current.pause();
+      bgmPlayerRef.current.currentTime = 0;
+      bgmPlayerRef.current = null;
+    }
+    setPlayingBgmId(null);
+  }, []);
+
+  const togglePlayPreview = useCallback((audio: RefAudio, e: React.MouseEvent) => {
+    e.stopPropagation();
+
+    if (bgmPlayerRef.current) {
+      stopBgm();
+    }
+
+    if (playingAudioId === audio.id) {
+      stopAudio();
+      return;
+    }
+
+    stopAudio();
+
+    const audioUrl = resolveMediaUrl(audio.path) || audio.path;
+    if (!audioUrl) {
+      alert("无法播放该参考音频");
+      return;
+    }
+    const player = new Audio(audioUrl);
+    player.onended = () => setPlayingAudioId(null);
+    player.play().catch((err) => alert("播放失败: " + err));
+    audioPlayerRef.current = player;
+    setPlayingAudioId(audio.id);
+  }, [playingAudioId, resolveMediaUrl, stopAudio, stopBgm]);
+
+  const toggleBgmPreview = useCallback((bgm: BgmItem, e: React.MouseEvent) => {
+    e.stopPropagation();
+    setSelectedBgmId(bgm.id);
+    setEnableBgm(true);
+
+    const bgmUrl = resolveBgmUrl(bgm.id);
+    if (!bgmUrl) {
+      alert("无法播放该背景音乐");
+      return;
+    }
+
+    if (playingBgmId === bgm.id) {
+      stopBgm();
+      return;
+    }
+
+    stopAudio();
+    stopBgm();
+
+    const player = new Audio(bgmUrl);
+    player.volume = Math.max(0, Math.min(bgmVolume, 1));
+    player.onended = () => setPlayingBgmId(null);
+    player.play().catch((err) => alert("播放失败: " + err));
+    bgmPlayerRef.current = player;
+    setPlayingBgmId(bgm.id);
+  }, [bgmVolume, playingBgmId, resolveBgmUrl, setEnableBgm, setSelectedBgmId, stopAudio, stopBgm]);
+
+  useEffect(() => {
+    if (bgmPlayerRef.current) {
+      bgmPlayerRef.current.volume = Math.max(0, Math.min(bgmVolume, 1));
+    }
+  }, [bgmVolume]);
+
+  return {
+    playingAudioId,
+    playingBgmId,
+    togglePlayPreview,
+    toggleBgmPreview,
+  };
+};
--- a/frontend/src/features/home/model/useRefAudios.ts
+++ b/frontend/src/features/home/model/useRefAudios.ts
@@ -0,0 +1,91 @@
+import { useCallback, useState } from "react";
+import api from "@/shared/api/axios";
+
+interface RefAudio {
+  id: string;
+  name: string;
+  path: string;
+  ref_text: string;
+  duration_sec: number;
+  created_at: number;
+}
+
+interface UseRefAudiosOptions {
+  fixedRefText: string;
+  selectedRefAudio: RefAudio | null;
+  setSelectedRefAudio: React.Dispatch<React.SetStateAction<RefAudio | null>>;
+  setRefText: React.Dispatch<React.SetStateAction<string>>;
+}
+
+export const useRefAudios = ({
+  fixedRefText,
+  selectedRefAudio,
+  setSelectedRefAudio,
+  setRefText,
+}: UseRefAudiosOptions) => {
+  const [refAudios, setRefAudios] = useState<RefAudio[]>([]);
+  const [isUploadingRef, setIsUploadingRef] = useState(false);
+  const [uploadRefError, setUploadRefError] = useState<string | null>(null);
+
+  const fetchRefAudios = useCallback(async () => {
+    try {
+      const { data } = await api.get('/api/ref-audios');
+      const items: RefAudio[] = data.items || [];
+      items.sort((a, b) => b.created_at - a.created_at);
+      setRefAudios(items);
+    } catch (error) {
+      console.error("获取参考音频失败:", error);
+    }
+  }, []);
+
+  const uploadRefAudio = useCallback(async (file: File) => {
+    const refTextInput = fixedRefText;
+
+    setIsUploadingRef(true);
+    setUploadRefError(null);
+
+    try {
+      const formData = new FormData();
+      formData.append('file', file);
+      formData.append('ref_text', refTextInput);
+
+      const { data } = await api.post('/api/ref-audios', formData, {
+        headers: { 'Content-Type': 'multipart/form-data' },
+      });
+
+      await fetchRefAudios();
+      setSelectedRefAudio(data);
+      setRefText(data.ref_text);
+      setIsUploadingRef(false);
+    } catch (err: any) {
+      console.error("Upload ref audio failed:", err);
+      setIsUploadingRef(false);
+      const errorMsg = err.response?.data?.detail || err.message || String(err);
+      setUploadRefError(`上传失败: ${errorMsg}`);
+    }
+  }, [fetchRefAudios, fixedRefText, setRefText, setSelectedRefAudio]);
+
+  const deleteRefAudio = useCallback(async (audioId: string) => {
+    if (!confirm("确定要删除这个参考音频吗？")) return;
+    try {
+      await api.delete(`/api/ref-audios/${encodeURIComponent(audioId)}`);
+      fetchRefAudios();
+      if (selectedRefAudio?.id === audioId) {
+        setSelectedRefAudio(null);
+        setRefText('');
+      }
+    } catch (error) {
+      alert("删除失败: " + error);
+    }
+  }, [fetchRefAudios, selectedRefAudio, setRefText, setSelectedRefAudio]);
+
+  return {
+    refAudios,
+    isUploadingRef,
+    uploadRefError,
+    setUploadRefError,
+    fetchRefAudios,
+    uploadRefAudio,
+    deleteRefAudio,
+  };
+};
--- a/frontend/src/features/home/model/useTitleSubtitleStyles.ts
+++ b/frontend/src/features/home/model/useTitleSubtitleStyles.ts
@@ -0,0 +1,98 @@
+import { useCallback, useEffect, useState } from "react";
+import api from "@/shared/api/axios";
+
+export interface SubtitleStyleOption {
+  id: string;
+  label: string;
+  font_family?: string;
+  font_file?: string;
+  font_size?: number;
+  highlight_color?: string;
+  normal_color?: string;
+  stroke_color?: string;
+  stroke_size?: number;
+  letter_spacing?: number;
+  bottom_margin?: number;
+  is_default?: boolean;
+}
+
+export interface TitleStyleOption {
+  id: string;
+  label: string;
+  font_family?: string;
+  font_file?: string;
+  font_size?: number;
+  color?: string;
+  stroke_color?: string;
+  stroke_size?: number;
+  letter_spacing?: number;
+  font_weight?: number;
+  top_margin?: number;
+  is_default?: boolean;
+}
+
+interface UseTitleSubtitleStylesOptions {
+  isAuthLoading: boolean;
+  storageKey: string;
+  setSelectedSubtitleStyleId: React.Dispatch<React.SetStateAction<string>>;
+  setSelectedTitleStyleId: React.Dispatch<React.SetStateAction<string>>;
+}
+
+export const useTitleSubtitleStyles = ({
+  isAuthLoading,
+  storageKey,
+  setSelectedSubtitleStyleId,
+  setSelectedTitleStyleId,
+}: UseTitleSubtitleStylesOptions) => {
+  const [subtitleStyles, setSubtitleStyles] = useState<SubtitleStyleOption[]>([]);
+  const [titleStyles, setTitleStyles] = useState<TitleStyleOption[]>([]);
+
+  const refreshSubtitleStyles = useCallback(async () => {
+    try {
+      const { data } = await api.get('/api/assets/subtitle-styles');
+      const styles: SubtitleStyleOption[] = data.styles || [];
+      setSubtitleStyles(styles);
+
+      const savedStyleId = localStorage.getItem(`vigent_${storageKey}_subtitleStyle`);
+      setSelectedSubtitleStyleId((prev) => {
+        if (prev && styles.some((s) => s.id === prev)) return prev;
+        if (savedStyleId && styles.some((s) => s.id === savedStyleId)) return savedStyleId;
+        const defaultStyle = styles.find((s) => s.is_default) || styles[0];
+        return defaultStyle?.id || "";
+      });
+    } catch (error) {
+      console.error("获取字幕样式失败:", error);
+    }
+  }, [setSelectedSubtitleStyleId, storageKey]);
+
+  const refreshTitleStyles = useCallback(async () => {
+    try {
+      const { data } = await api.get('/api/assets/title-styles');
+      const styles: TitleStyleOption[] = data.styles || [];
+      setTitleStyles(styles);
+
+      const savedStyleId = localStorage.getItem(`vigent_${storageKey}_titleStyle`);
+      setSelectedTitleStyleId((prev) => {
+        if (prev && styles.some((s) => s.id === prev)) return prev;
+        if (savedStyleId && styles.some((s) => s.id === savedStyleId)) return savedStyleId;
+        const defaultStyle = styles.find((s) => s.is_default) || styles[0];
+        return defaultStyle?.id || "";
+      });
+    } catch (error) {
+      console.error("获取标题样式失败:", error);
+    }
+  }, [setSelectedTitleStyleId, storageKey]);
+
+  useEffect(() => {
+    if (isAuthLoading) return;
+    refreshSubtitleStyles();
+    refreshTitleStyles();
+  }, [isAuthLoading, refreshSubtitleStyles, refreshTitleStyles]);
+
+  return {
+    subtitleStyles,
+    titleStyles,
+    refreshSubtitleStyles,
+    refreshTitleStyles,
+  };
+};
--- a/frontend/src/features/home/ui/BgmPanel.tsx
+++ b/frontend/src/features/home/ui/BgmPanel.tsx
@@ -0,0 +1,137 @@
+import type { RefObject, MouseEvent } from "react";
+import { RefreshCw, Play, Pause } from "lucide-react";
+
+interface BgmItem {
+  id: string;
+  name: string;
+  ext?: string;
+}
+
+interface BgmPanelProps {
+  bgmList: BgmItem[];
+  bgmLoading: boolean;
+  bgmError: string;
+  enableBgm: boolean;
+  onToggleEnable: (value: boolean) => void;
+  onRefresh: () => void;
+  selectedBgmId: string;
+  onSelectBgm: (id: string) => void;
+  playingBgmId: string | null;
+  onTogglePreview: (bgm: BgmItem, event: MouseEvent) => void;
+  bgmVolume: number;
+  onVolumeChange: (value: number) => void;
+  bgmListContainerRef: RefObject<HTMLDivElement | null>;
+  registerBgmItemRef: (id: string, element: HTMLDivElement | null) => void;
+}
+
+export function BgmPanel({
+  bgmList,
+  bgmLoading,
+  bgmError,
+  enableBgm,
+  onToggleEnable,
+  onRefresh,
+  selectedBgmId,
+  onSelectBgm,
+  playingBgmId,
+  onTogglePreview,
+  bgmVolume,
+  onVolumeChange,
+  bgmListContainerRef,
+  registerBgmItemRef,
+}: BgmPanelProps) {
+  return (
+    <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+      <div className="flex items-center justify-between mb-4">
+        <h2 className="text-lg font-semibold text-white flex items-center gap-2">🎵 背景音乐</h2>
+        <div className="flex items-center gap-2">
+          <button
+            onClick={onRefresh}
+            className="px-2 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300 flex items-center gap-1"
+          >
+            <RefreshCw className="h-3.5 w-3.5" />
+            刷新
+          </button>
+          <label className="relative inline-flex items-center cursor-pointer">
+            <input
+              type="checkbox"
+              checked={enableBgm}
+              onChange={(e) => onToggleEnable(e.target.checked)}
+              className="sr-only peer"
+            />
+            <div className="w-11 h-6 bg-gray-600 peer-focus:outline-none rounded-full peer peer-checked:after:translate-x-full peer-checked:after:border-white after:content-[''] after:absolute after:top-[2px] after:left-[2px] after:bg-white after:border-gray-300 after:border after:rounded-full after:h-5 after:w-5 after:transition-all peer-checked:bg-purple-600"></div>
+          </label>
+        </div>
+      </div>
+
+      {bgmLoading ? (
+        <div className="text-center py-4 text-gray-400 text-sm">正在加载背景音乐...</div>
+      ) : bgmError ? (
+        <div className="text-center py-4 text-red-300 text-sm">
+          加载失败：{bgmError}
+          <button
+            onClick={onRefresh}
+            className="ml-2 px-2 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300"
+          >
+            重试
+          </button>
+        </div>
+      ) : bgmList.length === 0 ? (
+        <div className="text-center py-4 text-gray-500 text-sm">暂无背景音乐，请先导入素材</div>
+      ) : (
+        <div
+          ref={bgmListContainerRef}
+          className={`space-y-2 max-h-64 overflow-y-auto hide-scrollbar ${enableBgm ? '' : 'opacity-70'}`}
+        >
+          {bgmList.map((bgm) => (
+            <div
+              key={bgm.id}
+              ref={(el) => registerBgmItemRef(bgm.id, el)}
+              className={`p-3 rounded-lg border transition-all flex items-center justify-between group ${selectedBgmId === bgm.id
+                ? "border-purple-500 bg-purple-500/20"
+                : "border-white/10 bg-white/5 hover:border-white/30"
+                }`}
+            >
+              <button onClick={() => onSelectBgm(bgm.id)} className="flex-1 text-left">
+                <div className="text-white text-sm truncate">{bgm.name}</div>
+                <div className="text-xs text-gray-400">.{bgm.ext || 'audio'}</div>
+              </button>
+              <div className="flex items-center gap-2 pl-2">
+                <button
+                  onClick={(e) => onTogglePreview(bgm, e)}
+                  className="p-1 text-gray-500 hover:text-purple-400 transition-colors"
+                  title="试听"
+                >
+                  {playingBgmId === bgm.id ? (
+                    <Pause className="h-4 w-4" />
+                  ) : (
+                    <Play className="h-4 w-4" />
+                  )}
+                </button>
+                {selectedBgmId === bgm.id && (
+                  <span className="text-xs text-purple-300">已选</span>
+                )}
+              </div>
+            </div>
+          ))}
+        </div>
+      )}
+
+      {enableBgm && (
+        <div className="mt-4">
+          <label className="text-sm text-gray-300 mb-2 block">音量</label>
+          <input
+            type="range"
+            min="0"
+            max="1"
+            step="0.05"
+            value={bgmVolume}
+            onChange={(e) => onVolumeChange(parseFloat(e.target.value))}
+            className="w-full accent-purple-500"
+          />
+          <div className="text-xs text-gray-400 mt-1">当前: {Math.round(bgmVolume * 100)}%</div>
+        </div>
+      )}
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/GenerateActionBar.tsx
+++ b/frontend/src/features/home/ui/GenerateActionBar.tsx
@@ -0,0 +1,53 @@
+import { Rocket } from "lucide-react";
+
+interface GenerateActionBarProps {
+  isGenerating: boolean;
+  progress: number;
+  disabled: boolean;
+  onGenerate: () => void;
+}
+
+export function GenerateActionBar({
+  isGenerating,
+  progress,
+  disabled,
+  onGenerate,
+}: GenerateActionBarProps) {
+  return (
+    <button
+      onClick={onGenerate}
+      disabled={disabled}
+      className={`w-full py-4 rounded-xl font-bold text-lg transition-all ${disabled
+        ? "bg-gray-600 cursor-not-allowed text-gray-400"
+        : "bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white shadow-lg hover:shadow-purple-500/25"
+        }`}
+    >
+      {isGenerating ? (
+        <span className="flex items-center justify-center gap-3">
+          <svg className="animate-spin h-5 w-5" viewBox="0 0 24 24">
+            <circle
+              className="opacity-25"
+              cx="12"
+              cy="12"
+              r="10"
+              stroke="currentColor"
+              strokeWidth="4"
+              fill="none"
+            />
+            <path
+              className="opacity-75"
+              fill="currentColor"
+              d="M4 12a8 8 0 018-8V0C5.373 0 0 5.373 0 12h4z"
+            />
+          </svg>
+          生成中... {progress}%
+        </span>
+      ) : (
+        <span className="flex items-center justify-center gap-2">
+          <Rocket className="h-5 w-5" />
+          生成视频
+        </span>
+      )}
+    </button>
+  );
+}
--- a/frontend/src/features/home/ui/HistoryList.tsx
+++ b/frontend/src/features/home/ui/HistoryList.tsx
@@ -0,0 +1,80 @@
+import { RefreshCw, Trash2 } from "lucide-react";
+
+interface GeneratedVideo {
+  id: string;
+  name: string;
+  path: string;
+  size_mb: number;
+  created_at: number;
+}
+
+interface HistoryListProps {
+  generatedVideos: GeneratedVideo[];
+  selectedVideoId: string | null;
+  onSelectVideo: (video: GeneratedVideo) => void;
+  onDeleteVideo: (id: string) => void;
+  onRefresh: () => void;
+  registerVideoRef: (id: string, element: HTMLDivElement | null) => void;
+  formatDate: (timestamp: number) => string;
+}
+
+export function HistoryList({
+  generatedVideos,
+  selectedVideoId,
+  onSelectVideo,
+  onDeleteVideo,
+  onRefresh,
+  registerVideoRef,
+  formatDate,
+}: HistoryListProps) {
+  return (
+    <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+      <div className="flex justify-between items-center mb-4">
+        <h2 className="text-lg font-semibold text-white flex items-center gap-2">📂 历史作品</h2>
+        <button
+          onClick={onRefresh}
+          className="px-3 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300 flex items-center gap-1"
+        >
+          <RefreshCw className="h-3.5 w-3.5" />
+          刷新
+        </button>
+      </div>
+      {generatedVideos.length === 0 ? (
+        <div className="text-center py-4 text-gray-500">
+          <p>暂无生成的作品</p>
+        </div>
+      ) : (
+        <div
+          className="space-y-2 max-h-64 overflow-y-auto hide-scrollbar"
+          style={{ contentVisibility: 'auto' }}
+        >
+          {generatedVideos.map((v) => (
+            <div
+              key={v.id}
+              ref={(el) => registerVideoRef(v.id, el)}
+              className={`p-3 rounded-lg border transition-all flex items-center justify-between group ${selectedVideoId === v.id
+                ? "border-purple-500 bg-purple-500/20"
+                : "border-white/10 bg-white/5 hover:border-white/30"
+                }`}
+            >
+              <button onClick={() => onSelectVideo(v)} className="flex-1 text-left">
+                <div className="text-white text-sm truncate">{formatDate(v.created_at)}</div>
+                <div className="text-gray-400 text-xs">{v.size_mb.toFixed(1)} MB</div>
+              </button>
+              <button
+                onClick={(e) => {
+                  e.stopPropagation();
+                  onDeleteVideo(v.id);
+                }}
+                className="p-1 text-gray-500 hover:text-red-400 opacity-0 group-hover:opacity-100 transition-opacity"
+                title="删除视频"
+              >
+                <Trash2 className="h-4 w-4" />
+              </button>
+            </div>
+          ))}
+        </div>
+      )}
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/HomeHeader.tsx
+++ b/frontend/src/features/home/ui/HomeHeader.tsx
@@ -0,0 +1,30 @@
+import Link from "next/link";
+import AccountSettingsDropdown from "@/components/AccountSettingsDropdown";
+
+export function HomeHeader() {
+  return (
+    <header className="border-b border-white/10 bg-black/20 backdrop-blur-sm relative z-[100]">
+      <div className="max-w-6xl mx-auto px-4 sm:px-6 py-3 sm:py-4 flex items-center justify-between">
+        <Link
+          href="/"
+          className="text-xl sm:text-2xl font-bold text-white flex items-center gap-2 sm:gap-3 hover:opacity-80 transition-opacity"
+        >
+          <span className="text-3xl sm:text-4xl">🎬</span>
+          IPAgent
+        </Link>
+        <div className="flex items-center gap-1 sm:gap-4">
+          <span className="px-2 sm:px-4 py-1 sm:py-2 text-sm sm:text-base bg-gradient-to-r from-purple-600 to-pink-600 text-white rounded-lg font-semibold">
+            视频生成
+          </span>
+          <Link
+            href="/publish"
+            className="px-2 sm:px-4 py-1 sm:py-2 text-sm sm:text-base bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors"
+          >
+            发布管理
+          </Link>
+          <AccountSettingsDropdown />
+        </div>
+      </div>
+    </header>
+  );
+}
--- a/frontend/src/features/home/ui/HomePage.tsx
+++ b/frontend/src/features/home/ui/HomePage.tsx
@@ -0,0 +1,295 @@
+"use client";
+
+import VideoPreviewModal from "@/components/VideoPreviewModal";
+import ScriptExtractionModal from "@/components/ScriptExtractionModal";
+import { useHomeController } from "@/features/home/model/useHomeController";
+import { BgmPanel } from "@/features/home/ui/BgmPanel";
+import { GenerateActionBar } from "@/features/home/ui/GenerateActionBar";
+import { HistoryList } from "@/features/home/ui/HistoryList";
+import { HomeHeader } from "@/features/home/ui/HomeHeader";
+import { MaterialSelector } from "@/features/home/ui/MaterialSelector";
+import { PreviewPanel } from "@/features/home/ui/PreviewPanel";
+import { RefAudioPanel } from "@/features/home/ui/RefAudioPanel";
+import { ScriptEditor } from "@/features/home/ui/ScriptEditor";
+import { TitleSubtitlePanel } from "@/features/home/ui/TitleSubtitlePanel";
+import { VoiceSelector } from "@/features/home/ui/VoiceSelector";
+
+export function HomePage() {
+  const {
+    apiBase,
+    registerMaterialRef,
+    previewMaterial,
+    setPreviewMaterial,
+    materials,
+    fetchError,
+    isUploading,
+    uploadProgress,
+    uploadError,
+    setUploadError,
+    fetchMaterials,
+    deleteMaterial,
+    handleUpload,
+    selectedMaterial,
+    setSelectedMaterial,
+    handlePreviewMaterial,
+    text,
+    setText,
+    extractModalOpen,
+    setExtractModalOpen,
+    handleGenerateMeta,
+    isGeneratingMeta,
+    showStylePreview,
+    setShowStylePreview,
+    videoTitle,
+    titleInput,
+    titleStyles,
+    selectedTitleStyleId,
+    setSelectedTitleStyleId,
+    titleFontSize,
+    setTitleFontSize,
+    setTitleSizeLocked,
+    subtitleStyles,
+    selectedSubtitleStyleId,
+    setSelectedSubtitleStyleId,
+    subtitleFontSize,
+    setSubtitleFontSize,
+    setSubtitleSizeLocked,
+    enableSubtitles,
+    setEnableSubtitles,
+    resolveAssetUrl,
+    getFontFormat,
+    buildTextShadow,
+    previewContainerWidth,
+    materialDimensions,
+    titlePreviewContainerRef,
+    ttsMode,
+    setTtsMode,
+    voices,
+    voice,
+    setVoice,
+    refAudios,
+    selectedRefAudio,
+    handleSelectRefAudio,
+    isUploadingRef,
+    uploadRefError,
+    setUploadRefError,
+    uploadRefAudio,
+    fetchRefAudios,
+    playingAudioId,
+    togglePlayPreview,
+    editingAudioId,
+    editName,
+    setEditName,
+    startEditing,
+    saveEditing,
+    cancelEditing,
+    deleteRefAudio,
+    recordedBlob,
+    isRecording,
+    recordingTime,
+    startRecording,
+    stopRecording,
+    useRecording,
+    formatRecordingTime,
+    fixedRefText,
+    bgmList,
+    bgmLoading,
+    bgmError,
+    enableBgm,
+    setEnableBgm,
+    fetchBgmList,
+    selectedBgmId,
+    setSelectedBgmId,
+    playingBgmId,
+    toggleBgmPreview,
+    bgmVolume,
+    setBgmVolume,
+    bgmListContainerRef,
+    registerBgmItemRef,
+    currentTask,
+    isGenerating,
+    handleGenerate,
+    generatedVideo,
+    generatedVideos,
+    selectedVideoId,
+    handleSelectVideo,
+    deleteVideo,
+    fetchGeneratedVideos,
+    registerVideoRef,
+    formatDate,
+  } = useHomeController();
+
+  return (
+    <div className="min-h-dvh">
+      <HomeHeader />
+
+      <main className="max-w-6xl mx-auto px-6 py-8">
+        <div className="grid grid-cols-1 lg:grid-cols-2 gap-8">
+          {/* 左侧: 输入区域 */}
+          <div className="space-y-6">
+            {/* 素材选择 */}
+            <MaterialSelector
+              materials={materials}
+              selectedMaterial={selectedMaterial}
+              isUploading={isUploading}
+              uploadProgress={uploadProgress}
+              uploadError={uploadError}
+              fetchError={fetchError}
+              apiBase={apiBase}
+              onUploadChange={handleUpload}
+              onRefresh={fetchMaterials}
+              onSelectMaterial={setSelectedMaterial}
+              onPreviewMaterial={handlePreviewMaterial}
+              onDeleteMaterial={deleteMaterial}
+              onClearUploadError={() => setUploadError(null)}
+              registerMaterialRef={registerMaterialRef}
+            />
+
+            {/* 文案输入 */}
+            <ScriptEditor
+              text={text}
+              onChangeText={setText}
+              onOpenExtractModal={() => setExtractModalOpen(true)}
+              onGenerateMeta={handleGenerateMeta}
+              isGeneratingMeta={isGeneratingMeta}
+            />
+
+            {/* 标题和字幕设置 */}
+            <TitleSubtitlePanel
+              showStylePreview={showStylePreview}
+              onTogglePreview={() => setShowStylePreview((prev) => !prev)}
+              videoTitle={videoTitle}
+              onTitleChange={titleInput.handleChange}
+              onTitleCompositionStart={titleInput.handleCompositionStart}
+              onTitleCompositionEnd={titleInput.handleCompositionEnd}
+              titleStyles={titleStyles}
+              selectedTitleStyleId={selectedTitleStyleId}
+              onSelectTitleStyle={setSelectedTitleStyleId}
+              titleFontSize={titleFontSize}
+              onTitleFontSizeChange={(value) => {
+                setTitleFontSize(value);
+                setTitleSizeLocked(true);
+              }}
+              subtitleStyles={subtitleStyles}
+              selectedSubtitleStyleId={selectedSubtitleStyleId}
+              onSelectSubtitleStyle={setSelectedSubtitleStyleId}
+              subtitleFontSize={subtitleFontSize}
+              onSubtitleFontSizeChange={(value) => {
+                setSubtitleFontSize(value);
+                setSubtitleSizeLocked(true);
+              }}
+              enableSubtitles={enableSubtitles}
+              onToggleSubtitles={setEnableSubtitles}
+              resolveAssetUrl={resolveAssetUrl}
+              getFontFormat={getFontFormat}
+              buildTextShadow={buildTextShadow}
+              previewScale={previewContainerWidth && (materialDimensions?.width || 1280)
+                ? previewContainerWidth / (materialDimensions?.width || 1280)
+                : 1}
+              previewAspectRatio={materialDimensions
+                ? `${materialDimensions.width} / ${materialDimensions.height}`
+                : "16 / 9"}
+              previewBaseWidth={materialDimensions?.width || 1280}
+              previewBaseHeight={materialDimensions?.height || 720}
+              previewContainerRef={titlePreviewContainerRef}
+            />
+
+            {/* 配音方式选择 */}
+            <VoiceSelector
+              ttsMode={ttsMode}
+              onSelectTtsMode={setTtsMode}
+              voices={voices}
+              voice={voice}
+              onSelectVoice={setVoice}
+              voiceCloneSlot={(
+                <RefAudioPanel
+                  refAudios={refAudios}
+                  selectedRefAudio={selectedRefAudio}
+                  onSelectRefAudio={handleSelectRefAudio}
+                  isUploadingRef={isUploadingRef}
+                  uploadRefError={uploadRefError}
+                  onClearUploadRefError={() => setUploadRefError(null)}
+                  onUploadRefAudio={uploadRefAudio}
+                  onFetchRefAudios={fetchRefAudios}
+                  playingAudioId={playingAudioId}
+                  onTogglePlayPreview={togglePlayPreview}
+                  editingAudioId={editingAudioId}
+                  editName={editName}
+                  onEditNameChange={setEditName}
+                  onStartEditing={startEditing}
+                  onSaveEditing={saveEditing}
+                  onCancelEditing={cancelEditing}
+                  onDeleteRefAudio={deleteRefAudio}
+                  recordedBlob={recordedBlob}
+                  isRecording={isRecording}
+                  recordingTime={recordingTime}
+                  onStartRecording={startRecording}
+                  onStopRecording={stopRecording}
+                  onUseRecording={useRecording}
+                  formatRecordingTime={formatRecordingTime}
+                  fixedRefText={fixedRefText}
+                />
+              )}
+            />
+
+            {/* 背景音乐 */}
+            <BgmPanel
+              bgmList={bgmList}
+              bgmLoading={bgmLoading}
+              bgmError={bgmError}
+              enableBgm={enableBgm}
+              onToggleEnable={setEnableBgm}
+              onRefresh={fetchBgmList}
+              selectedBgmId={selectedBgmId}
+              onSelectBgm={setSelectedBgmId}
+              playingBgmId={playingBgmId}
+              onTogglePreview={toggleBgmPreview}
+              bgmVolume={bgmVolume}
+              onVolumeChange={setBgmVolume}
+              bgmListContainerRef={bgmListContainerRef}
+              registerBgmItemRef={registerBgmItemRef}
+            />
+
+            {/* 生成按钮 */}
+            <GenerateActionBar
+              isGenerating={isGenerating}
+              progress={currentTask?.progress || 0}
+              disabled={isGenerating || !selectedMaterial || (ttsMode === "voiceclone" && !selectedRefAudio)}
+              onGenerate={handleGenerate}
+            />
+          </div>
+
+          {/* 右侧: 预览区域 */}
+          <div className="space-y-6">
+            <PreviewPanel
+              currentTask={currentTask}
+              isGenerating={isGenerating}
+              generatedVideo={generatedVideo}
+            />
+
+            <HistoryList
+              generatedVideos={generatedVideos}
+              selectedVideoId={selectedVideoId}
+              onSelectVideo={handleSelectVideo}
+              onDeleteVideo={deleteVideo}
+              onRefresh={() => fetchGeneratedVideos()}
+              registerVideoRef={registerVideoRef}
+              formatDate={formatDate}
+            />
+          </div>
+        </div>
+      </main>
+      <VideoPreviewModal
+        onClose={() => setPreviewMaterial(null)}
+        videoUrl={previewMaterial}
+        title="素材预览"
+      />
+
+      <ScriptExtractionModal
+        isOpen={extractModalOpen}
+        onClose={() => setExtractModalOpen(false)}
+        onApply={(nextText) => setText(nextText)}
+      />
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/MaterialSelector.tsx
+++ b/frontend/src/features/home/ui/MaterialSelector.tsx
@@ -0,0 +1,168 @@
+import type { ChangeEvent } from "react";
+import { Upload, RefreshCw, Eye, Trash2, X } from "lucide-react";
+
+interface Material {
+  id: string;
+  name: string;
+  scene: string;
+  size_mb: number;
+  path: string;
+}
+
+interface MaterialSelectorProps {
+  materials: Material[];
+  selectedMaterial: string;
+  isUploading: boolean;
+  uploadProgress: number;
+  uploadError: string | null;
+  fetchError: string | null;
+  apiBase: string;
+  onUploadChange: (event: ChangeEvent<HTMLInputElement>) => void;
+  onRefresh: () => void;
+  onSelectMaterial: (id: string) => void;
+  onPreviewMaterial: (path: string) => void;
+  onDeleteMaterial: (id: string) => void;
+  onClearUploadError: () => void;
+  registerMaterialRef: (id: string, element: HTMLDivElement | null) => void;
+}
+
+export function MaterialSelector({
+  materials,
+  selectedMaterial,
+  isUploading,
+  uploadProgress,
+  uploadError,
+  fetchError,
+  apiBase,
+  onUploadChange,
+  onRefresh,
+  onSelectMaterial,
+  onPreviewMaterial,
+  onDeleteMaterial,
+  onClearUploadError,
+  registerMaterialRef,
+}: MaterialSelectorProps) {
+  return (
+    <div className="bg-white/5 rounded-2xl p-4 sm:p-6 border border-white/10 backdrop-blur-sm">
+      <div className="flex justify-between items-center gap-2 mb-4">
+        <h2 className="text-base sm:text-lg font-semibold text-white flex items-center gap-2 whitespace-nowrap">
+          📹 视频素材
+          <span className="ml-1 text-[11px] sm:text-xs text-gray-400/90 font-normal">
+            (上传自拍视频)
+          </span>
+        </h2>
+        <div className="flex gap-1.5">
+          <input
+            type="file"
+            id="video-upload"
+            accept=".mp4,.mov,.avi"
+            onChange={onUploadChange}
+            className="hidden"
+          />
+          <label
+            htmlFor="video-upload"
+            className={`px-2 py-1 text-xs rounded cursor-pointer transition-all whitespace-nowrap flex items-center gap-1 ${isUploading
+              ? "bg-gray-600 cursor-not-allowed text-gray-400"
+              : "bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white"
+              }`}
+          >
+            <Upload className="h-3.5 w-3.5" />
+            上传
+          </label>
+          <button
+            onClick={onRefresh}
+            className="px-2 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300 whitespace-nowrap flex items-center gap-1"
+          >
+            <RefreshCw className="h-3.5 w-3.5" />
+            刷新
+          </button>
+        </div>
+      </div>
+
+      {isUploading && (
+        <div className="mb-4 p-4 bg-purple-500/10 rounded-xl border border-purple-500/30">
+          <div className="flex justify-between text-sm text-purple-300 mb-2">
+            <span>📤 上传中...</span>
+            <span>{uploadProgress}%</span>
+          </div>
+          <div className="h-2 bg-black/30 rounded-full overflow-hidden">
+            <div
+              className="h-full bg-gradient-to-r from-purple-500 to-pink-500 transition-all duration-300"
+              style={{ width: `${uploadProgress}%` }}
+            />
+          </div>
+        </div>
+      )}
+
+      {uploadError && (
+        <div className="mb-4 p-4 bg-red-500/20 text-red-200 rounded-xl text-sm flex justify-between items-center">
+          <span>❌ {uploadError}</span>
+          <button onClick={onClearUploadError} className="text-red-300 hover:text-white">
+            <X className="h-3.5 w-3.5" />
+          </button>
+        </div>
+      )}
+
+      {fetchError ? (
+        <div className="p-4 bg-red-500/20 text-red-200 rounded-xl text-sm mb-4">
+          获取素材失败: {fetchError}
+          <br />
+          API: {apiBase}/api/materials/
+        </div>
+      ) : materials.length === 0 ? (
+        <div className="text-center py-8 text-gray-400">
+          <div className="text-5xl mb-4">📁</div>
+          <p>暂无视频素材</p>
+          <p className="text-sm mt-2">
+            点击上方「📤 上传视频」按钮添加视频素材
+          </p>
+        </div>
+      ) : (
+        <div
+          className="space-y-2 max-h-64 overflow-y-auto hide-scrollbar"
+          style={{ contentVisibility: 'auto' }}
+        >
+          {materials.map((m) => (
+            <div
+              key={m.id}
+              ref={(el) => registerMaterialRef(m.id, el)}
+              className={`p-3 rounded-lg border transition-all flex items-center justify-between group ${selectedMaterial === m.id
+                ? "border-purple-500 bg-purple-500/20"
+                : "border-white/10 bg-white/5 hover:border-white/30"
+                }`}
+            >
+              <button onClick={() => onSelectMaterial(m.id)} className="flex-1 text-left">
+                <div className="text-white text-sm truncate">{m.scene || m.name}</div>
+                <div className="text-gray-400 text-xs">{m.size_mb.toFixed(1)} MB</div>
+              </button>
+              <div className="flex items-center gap-2 pl-2">
+                <button
+                  onClick={(e) => {
+                    e.stopPropagation();
+                    if (m.path) {
+                      onPreviewMaterial(m.path);
+                    }
+                  }}
+                  className="p-1 text-gray-500 hover:text-white opacity-0 group-hover:opacity-100 transition-opacity"
+                  title="预览视频"
+                >
+                  <Eye className="h-4 w-4" />
+                </button>
+                <button
+                  onClick={(e) => {
+                    e.stopPropagation();
+                    onDeleteMaterial(m.id);
+                  }}
+                  className="p-1 text-gray-500 hover:text-red-400 opacity-0 group-hover:opacity-100 transition-opacity"
+                  title="删除素材"
+                >
+                  <Trash2 className="h-4 w-4" />
+                </button>
+              </div>
+            </div>
+          ))}
+        </div>
+      )}
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/PreviewPanel.tsx
+++ b/frontend/src/features/home/ui/PreviewPanel.tsx
@@ -0,0 +1,74 @@
+import Link from "next/link";
+import { Download, Send } from "lucide-react";
+
+interface Task {
+  task_id: string;
+  status: string;
+  progress: number;
+  message: string;
+}
+
+interface PreviewPanelProps {
+  currentTask: Task | null;
+  isGenerating: boolean;
+  generatedVideo: string | null;
+}
+
+export function PreviewPanel({
+  currentTask,
+  isGenerating,
+  generatedVideo,
+}: PreviewPanelProps) {
+  return (
+    <>
+      {currentTask && isGenerating && (
+        <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+          <h2 className="text-lg font-semibold text-white mb-4">⏳ 生成进度</h2>
+          <div className="space-y-3">
+            <div className="h-3 bg-black/30 rounded-full overflow-hidden">
+              <div
+                className="h-full bg-gradient-to-r from-purple-500 to-pink-500 transition-all duration-300"
+                style={{ width: `${currentTask.progress}%` }}
+              />
+            </div>
+            <p className="text-gray-300">正在AI生成中...</p>
+          </div>
+        </div>
+      )}
+
+      <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+        <h2 className="text-lg font-semibold text-white mb-4 flex items-center gap-2">🎥 作品预览</h2>
+        <div className="aspect-video bg-black/50 rounded-xl overflow-hidden flex items-center justify-center">
+          {generatedVideo ? (
+            <video src={generatedVideo} controls className="w-full h-full object-contain" />
+          ) : (
+            <div className="text-gray-500 text-center">
+              <div className="text-5xl mb-4">📹</div>
+              <p>生成的作品将在这里预览</p>
+            </div>
+          )}
+        </div>
+
+        {generatedVideo && (
+          <>
+            <a
+              href={generatedVideo}
+              download
+              className="mt-4 w-full py-3 rounded-xl bg-green-600 hover:bg-green-700 text-white font-medium flex items-center justify-center gap-2 transition-colors"
+            >
+              <Download className="h-4 w-4" />
+              下载视频
+            </a>
+            <Link
+              href="/publish"
+              className="mt-3 w-full py-3 rounded-xl bg-gradient-to-r from-purple-600 to-pink-600 hover:from-purple-700 hover:to-pink-700 text-white font-medium flex items-center justify-center gap-2 transition-colors"
+            >
+              <Send className="h-4 w-4" />
+              发布到社交平台
+            </Link>
+          </>
+        )}
+      </div>
+    </>
+  );
+}
--- a/frontend/src/features/home/ui/RefAudioPanel.tsx
+++ b/frontend/src/features/home/ui/RefAudioPanel.tsx
@@ -0,0 +1,277 @@
+import { useEffect, useState } from "react";
+import type { MouseEvent } from "react";
+import { Upload, RefreshCw, Play, Pause, Pencil, Trash2, Check, X, Mic, Square } from "lucide-react";
+
+interface RefAudio {
+  id: string;
+  name: string;
+  path: string;
+  ref_text: string;
+  duration_sec: number;
+  created_at: number;
+}
+
+interface RefAudioPanelProps {
+  refAudios: RefAudio[];
+  selectedRefAudio: RefAudio | null;
+  onSelectRefAudio: (audio: RefAudio) => void;
+  isUploadingRef: boolean;
+  uploadRefError: string | null;
+  onClearUploadRefError: () => void;
+  onUploadRefAudio: (file: File) => void;
+  onFetchRefAudios: () => void;
+  playingAudioId: string | null;
+  onTogglePlayPreview: (audio: RefAudio, event: MouseEvent) => void;
+  editingAudioId: string | null;
+  editName: string;
+  onEditNameChange: (value: string) => void;
+  onStartEditing: (audio: RefAudio, event: MouseEvent) => void;
+  onSaveEditing: (id: string, event: MouseEvent) => void;
+  onCancelEditing: (event: MouseEvent) => void;
+  onDeleteRefAudio: (id: string) => void;
+  recordedBlob: Blob | null;
+  isRecording: boolean;
+  recordingTime: number;
+  onStartRecording: () => void;
+  onStopRecording: () => void;
+  onUseRecording: () => void;
+  formatRecordingTime: (seconds: number) => string;
+  fixedRefText: string;
+}
+
+export function RefAudioPanel({
+  refAudios,
+  selectedRefAudio,
+  onSelectRefAudio,
+  isUploadingRef,
+  uploadRefError,
+  onClearUploadRefError,
+  onUploadRefAudio,
+  onFetchRefAudios,
+  playingAudioId,
+  onTogglePlayPreview,
+  editingAudioId,
+  editName,
+  onEditNameChange,
+  onStartEditing,
+  onSaveEditing,
+  onCancelEditing,
+  onDeleteRefAudio,
+  recordedBlob,
+  isRecording,
+  recordingTime,
+  onStartRecording,
+  onStopRecording,
+  onUseRecording,
+  formatRecordingTime,
+  fixedRefText,
+}: RefAudioPanelProps) {
+  const [recordedUrl, setRecordedUrl] = useState<string | null>(null);
+
+  useEffect(() => {
+    if (!recordedBlob) {
+      setRecordedUrl(null);
+      return;
+    }
+    const url = URL.createObjectURL(recordedBlob);
+    setRecordedUrl(url);
+    return () => {
+      URL.revokeObjectURL(url);
+    };
+  }, [recordedBlob]);
+
+  return (
+    <div className="space-y-4">
+      <div>
+        <div className="flex justify-between items-center mb-2">
+          <span className="text-sm text-gray-300">📁 我的参考音频</span>
+          <div className="flex gap-2">
+            <input
+              type="file"
+              id="ref-audio-upload"
+              accept=".wav,.mp3,.m4a,.webm,.ogg,.flac,.aac"
+              onChange={(e) => {
+                const file = e.target.files?.[0];
+                if (file) {
+                  onUploadRefAudio(file);
+                }
+                e.target.value = '';
+              }}
+              className="hidden"
+            />
+            <label
+              htmlFor="ref-audio-upload"
+              className={`px-2 py-1 text-xs rounded cursor-pointer transition-all flex items-center gap-1 ${isUploadingRef
+                ? "bg-gray-600 cursor-not-allowed text-gray-400"
+                : "bg-purple-600 hover:bg-purple-700 text-white"
+                }`}
+            >
+              <Upload className="h-3.5 w-3.5" />
+              上传
+            </label>
+            <button
+              onClick={onFetchRefAudios}
+              className="px-2 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300 flex items-center gap-1"
+            >
+              <RefreshCw className="h-3.5 w-3.5" />
+              刷新
+            </button>
+          </div>
+        </div>
+
+        {isUploadingRef && (
+          <div className="mb-2 p-2 bg-purple-500/10 rounded text-sm text-purple-300">
+            ⏳ 上传中...
+          </div>
+        )}
+
+        {uploadRefError && (
+          <div className="mb-2 p-2 bg-red-500/20 text-red-200 rounded text-xs flex justify-between">
+            <span>❌ {uploadRefError}</span>
+            <button onClick={onClearUploadRefError} className="text-red-300 hover:text-white">
+              <X className="h-3.5 w-3.5" />
+            </button>
+          </div>
+        )}
+
+        {refAudios.length === 0 ? (
+          <div className="text-center py-4 text-gray-500 text-sm">
+            暂无参考音频，请上传或录制
+          </div>
+        ) : (
+          <div className="grid grid-cols-2 gap-2" style={{ contentVisibility: 'auto' }}>
+            {refAudios.map((audio) => (
+              <div
+                key={audio.id}
+                className={`p-2 rounded-lg border transition-all relative group cursor-pointer ${selectedRefAudio?.id === audio.id
+                  ? "border-purple-500 bg-purple-500/20"
+                  : "border-white/10 bg-white/5 hover:border-white/30"
+                  }`}
+                onClick={() => {
+                  if (editingAudioId !== audio.id) {
+                    onSelectRefAudio(audio);
+                  }
+                }}
+              >
+                {editingAudioId === audio.id ? (
+                  <div className="flex items-center gap-1" onClick={(e) => e.stopPropagation()}>
+                    <input
+                      type="text"
+                      value={editName}
+                      onChange={(e) => onEditNameChange(e.target.value)}
+                      className="w-full bg-black/50 text-white text-xs px-1 py-0.5 rounded border border-purple-500 focus:outline-none"
+                      autoFocus
+                      onKeyDown={(e) => {
+                        if (e.key === 'Enter') onSaveEditing(audio.id, e as any);
+                        if (e.key === 'Escape') onCancelEditing(e as any);
+                      }}
+                    />
+                    <button onClick={(e) => onSaveEditing(audio.id, e)} className="text-green-400 hover:text-green-300 text-xs">
+                      <Check className="h-3 w-3" />
+                    </button>
+                    <button onClick={(e) => onCancelEditing(e)} className="text-gray-400 hover:text-gray-300 text-xs">
+                      <X className="h-3 w-3" />
+                    </button>
+                  </div>
+                ) : (
+                  <>
+                    <div className="flex justify-between items-start mb-1">
+                      <div className="text-white text-xs truncate pr-1 flex-1" title={audio.name}>
+                        {audio.name}
+                      </div>
+                      <div className="flex gap-1 opacity-0 group-hover:opacity-100 transition-opacity">
+                        <button
+                          onClick={(e) => onTogglePlayPreview(audio, e)}
+                          className="text-gray-400 hover:text-purple-400 text-xs"
+                          title="试听"
+                        >
+                          {playingAudioId === audio.id ? (
+                            <Pause className="h-3.5 w-3.5" />
+                          ) : (
+                            <Play className="h-3.5 w-3.5" />
+                          )}
+                        </button>
+                        <button
+                          onClick={(e) => onStartEditing(audio, e)}
+                          className="text-gray-400 hover:text-blue-400 text-xs"
+                          title="重命名"
+                        >
+                          <Pencil className="h-3.5 w-3.5" />
+                        </button>
+                        <button
+                          onClick={(e) => {
+                            e.stopPropagation();
+                            onDeleteRefAudio(audio.id);
+                          }}
+                          className="text-gray-400 hover:text-red-400 text-xs"
+                          title="删除"
+                        >
+                          <Trash2 className="h-3.5 w-3.5" />
+                        </button>
+                      </div>
+                    </div>
+                    <div className="text-gray-400 text-xs">{audio.duration_sec.toFixed(1)}s</div>
+                  </>
+                )}
+              </div>
+            ))}
+          </div>
+        )}
+      </div>
+
+      <div className="border-t border-white/10 pt-4">
+        <span className="text-sm text-gray-300 mb-2 block">🎤 或在线录音</span>
+        <div className="flex gap-2 items-center">
+          {!isRecording ? (
+            <button
+              onClick={onStartRecording}
+              className="px-4 py-2 bg-red-600 hover:bg-red-700 text-white rounded-lg text-sm font-medium transition-colors flex items-center gap-2"
+            >
+              <Mic className="h-4 w-4" />
+              开始录音
+            </button>
+          ) : (
+            <button
+              onClick={onStopRecording}
+              className="px-4 py-2 bg-gray-600 hover:bg-gray-700 text-white rounded-lg text-sm font-medium transition-colors flex items-center gap-2"
+            >
+              <Square className="h-4 w-4" />
+              停止
+            </button>
+          )}
+          {isRecording && (
+            <span className="text-red-400 text-sm animate-pulse">
+              🔴 录音中 {formatRecordingTime(recordingTime)}
+            </span>
+          )}
+        </div>
+
+        {recordedBlob && !isRecording && (
+          <div className="mt-3 p-3 bg-green-500/10 border border-green-500/30 rounded-lg">
+            <div className="flex items-center gap-2 mb-2">
+              <span className="text-green-300 text-sm">✅ 录音完成 ({formatRecordingTime(recordingTime)})</span>
+              <audio src={recordedUrl || ''} controls className="h-8" />
+            </div>
+            <button
+              onClick={onUseRecording}
+              disabled={isUploadingRef}
+              className="px-3 py-1 bg-green-600 hover:bg-green-700 text-white rounded text-sm disabled:bg-gray-600"
+            >
+              使用此录音
+            </button>
+          </div>
+        )}
+      </div>
+
+      <div className="border-t border-white/10 pt-4">
+        <label className="text-sm text-gray-300 mb-2 block">📝 录音/上传时请朗读以下内容：</label>
+        <div className="w-full bg-black/30 border border-white/10 rounded-lg p-3 text-white text-sm">
+          {fixedRefText}
+        </div>
+        <p className="text-xs text-gray-500 mt-1">
+          请清晰朗读上述内容完成录音，系统将以此为参考克隆您的声音
+        </p>
+      </div>
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/ScriptEditor.tsx
+++ b/frontend/src/features/home/ui/ScriptEditor.tsx
@@ -0,0 +1,66 @@
+import { FileText, Loader2, Sparkles } from "lucide-react";
+
+interface ScriptEditorProps {
+  text: string;
+  onChangeText: (value: string) => void;
+  onOpenExtractModal: () => void;
+  onGenerateMeta: () => void;
+  isGeneratingMeta: boolean;
+}
+
+export function ScriptEditor({
+  text,
+  onChangeText,
+  onOpenExtractModal,
+  onGenerateMeta,
+  isGeneratingMeta,
+}: ScriptEditorProps) {
+  return (
+    <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+      <div className="flex justify-between items-center gap-2 mb-4">
+        <h2 className="text-base sm:text-lg font-semibold text-white flex items-center gap-2 whitespace-nowrap">
+          ✍️ 文案提取与编辑
+        </h2>
+        <div className="flex gap-2">
+          <button
+            onClick={onOpenExtractModal}
+            className="px-2 py-1 text-xs rounded transition-all whitespace-nowrap bg-purple-600 hover:bg-purple-700 text-white flex items-center gap-1"
+          >
+            <FileText className="h-3.5 w-3.5" />
+            文案提取助手
+          </button>
+          <button
+            onClick={onGenerateMeta}
+            disabled={isGeneratingMeta || !text.trim()}
+            className={`px-2 py-1 text-xs rounded transition-all whitespace-nowrap ${isGeneratingMeta || !text.trim()
+              ? "bg-gray-600 cursor-not-allowed text-gray-400"
+              : "bg-gradient-to-r from-blue-600 to-cyan-600 hover:from-blue-700 hover:to-cyan-700 text-white"
+              }`}
+          >
+            {isGeneratingMeta ? (
+              <span className="flex items-center gap-1">
+                <Loader2 className="h-3.5 w-3.5 animate-spin" />
+                生成中...
+              </span>
+            ) : (
+              <span className="flex items-center gap-1">
+                <Sparkles className="h-3.5 w-3.5" />
+                AI生成标题标签
+              </span>
+            )}
+          </button>
+        </div>
+      </div>
+      <textarea
+        value={text}
+        onChange={(e) => onChangeText(e.target.value)}
+        placeholder="请输入你想说的话..."
+        className="w-full h-40 bg-black/30 border border-white/10 rounded-xl p-4 text-white placeholder-gray-500 resize-none focus:outline-none focus:border-purple-500 transition-colors hide-scrollbar"
+      />
+      <div className="flex justify-between mt-2 text-sm text-gray-400">
+        <span>{text.length} 字</span>
+        <span>预计时长: ~{Math.ceil(text.length / 4)} 秒</span>
+      </div>
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/TitleSubtitlePanel.tsx
+++ b/frontend/src/features/home/ui/TitleSubtitlePanel.tsx
@@ -0,0 +1,315 @@
+import type { RefObject } from "react";
+import { Eye } from "lucide-react";
+
+interface SubtitleStyleOption {
+  id: string;
+  label: string;
+  font_family?: string;
+  font_file?: string;
+  font_size?: number;
+  highlight_color?: string;
+  normal_color?: string;
+  stroke_color?: string;
+  stroke_size?: number;
+  letter_spacing?: number;
+  bottom_margin?: number;
+  is_default?: boolean;
+}
+
+interface TitleStyleOption {
+  id: string;
+  label: string;
+  font_family?: string;
+  font_file?: string;
+  font_size?: number;
+  color?: string;
+  stroke_color?: string;
+  stroke_size?: number;
+  letter_spacing?: number;
+  font_weight?: number;
+  top_margin?: number;
+  is_default?: boolean;
+}
+
+interface TitleSubtitlePanelProps {
+  showStylePreview: boolean;
+  onTogglePreview: () => void;
+  videoTitle: string;
+  onTitleChange: (value: string) => void;
+  onTitleCompositionStart?: () => void;
+  onTitleCompositionEnd?: (value: string) => void;
+  titleStyles: TitleStyleOption[];
+  selectedTitleStyleId: string;
+  onSelectTitleStyle: (id: string) => void;
+  titleFontSize: number;
+  onTitleFontSizeChange: (value: number) => void;
+  subtitleStyles: SubtitleStyleOption[];
+  selectedSubtitleStyleId: string;
+  onSelectSubtitleStyle: (id: string) => void;
+  subtitleFontSize: number;
+  onSubtitleFontSizeChange: (value: number) => void;
+  enableSubtitles: boolean;
+  onToggleSubtitles: (value: boolean) => void;
+  resolveAssetUrl: (path?: string | null) => string | null;
+  getFontFormat: (fontFile?: string) => string;
+  buildTextShadow: (color: string, size: number) => string;
+  previewScale?: number;
+  previewAspectRatio?: string;
+  previewBaseWidth?: number;
+  previewBaseHeight?: number;
+  previewContainerRef?: RefObject<HTMLDivElement | null>;
+}
+
+export function TitleSubtitlePanel({
+  showStylePreview,
+  onTogglePreview,
+  videoTitle,
+  onTitleChange,
+  onTitleCompositionStart,
+  onTitleCompositionEnd,
+  titleStyles,
+  selectedTitleStyleId,
+  onSelectTitleStyle,
+  titleFontSize,
+  onTitleFontSizeChange,
+  subtitleStyles,
+  selectedSubtitleStyleId,
+  onSelectSubtitleStyle,
+  subtitleFontSize,
+  onSubtitleFontSizeChange,
+  enableSubtitles,
+  onToggleSubtitles,
+  resolveAssetUrl,
+  getFontFormat,
+  buildTextShadow,
+  previewScale = 1,
+  previewAspectRatio = '16 / 9',
+  previewBaseWidth = 1280,
+  previewBaseHeight = 720,
+  previewContainerRef,
+}: TitleSubtitlePanelProps) {
+  const activeSubtitleStyle = subtitleStyles.find((s) => s.id === selectedSubtitleStyleId)
+    || subtitleStyles.find((s) => s.is_default)
+    || subtitleStyles[0];
+
+  const activeTitleStyle = titleStyles.find((s) => s.id === selectedTitleStyleId)
+    || titleStyles.find((s) => s.is_default)
+    || titleStyles[0];
+
+  const previewTitleText = videoTitle.trim() || "这里是标题预览";
+  const subtitleHighlightText = "最近，一个叫Cloudbot";
+  const subtitleNormalText = "的开源项目在GitHub上彻底火了";
+
+  const subtitleHighlightColor = activeSubtitleStyle?.highlight_color || "#FFE600";
+  const subtitleNormalColor = activeSubtitleStyle?.normal_color || "#FFFFFF";
+  const subtitleStrokeColor = activeSubtitleStyle?.stroke_color || "#000000";
+  const subtitleStrokeSize = activeSubtitleStyle?.stroke_size ?? 3;
+  const subtitleLetterSpacing = activeSubtitleStyle?.letter_spacing ?? 2;
+  const subtitleBottomMargin = activeSubtitleStyle?.bottom_margin ?? 0;
+  const subtitleFontFamilyName = `SubtitlePreview-${activeSubtitleStyle?.id || "default"}`;
+  const subtitleFontUrl = activeSubtitleStyle?.font_file
+    ? resolveAssetUrl(`fonts/${activeSubtitleStyle.font_file}`)
+    : null;
+
+  const titleColor = activeTitleStyle?.color || "#FFFFFF";
+  const titleStrokeColor = activeTitleStyle?.stroke_color || "#000000";
+  const titleStrokeSize = activeTitleStyle?.stroke_size ?? 8;
+  const titleLetterSpacing = activeTitleStyle?.letter_spacing ?? 4;
+  const titleTopMargin = activeTitleStyle?.top_margin ?? 0;
+  const titleFontWeight = activeTitleStyle?.font_weight ?? 900;
+  const titleFontFamilyName = `TitlePreview-${activeTitleStyle?.id || "default"}`;
+  const titleFontUrl = activeTitleStyle?.font_file
+    ? resolveAssetUrl(`fonts/${activeTitleStyle.font_file}`)
+    : null;
+
+  return (
+    <div className="bg-white/5 rounded-2xl p-4 sm:p-6 border border-white/10 backdrop-blur-sm">
+      <div className="flex items-center justify-between mb-4 gap-2">
+        <h2 className="text-base sm:text-lg font-semibold text-white flex items-center gap-2">
+          🎬 标题与字幕
+        </h2>
+        <button
+          onClick={onTogglePreview}
+          className="px-2 py-1 text-xs bg-white/10 hover:bg-white/20 rounded text-gray-300 flex items-center gap-1"
+        >
+          <Eye className="h-3.5 w-3.5" />
+          {showStylePreview ? "收起预览" : "预览样式"}
+        </button>
+      </div>
+
+      {showStylePreview && (
+        <div
+          ref={previewContainerRef}
+          className="mb-4 rounded-xl border border-white/10 bg-black/40 relative overflow-hidden"
+          style={{ aspectRatio: previewAspectRatio, minHeight: '180px' }}
+        >
+          {(titleFontUrl || subtitleFontUrl) && (
+            <style>{`
+              ${titleFontUrl ? `@font-face { font-family: '${titleFontFamilyName}'; src: url('${titleFontUrl}') format('${getFontFormat(activeTitleStyle?.font_file)}'); font-weight: 400; font-style: normal; }` : ''}
+              ${subtitleFontUrl ? `@font-face { font-family: '${subtitleFontFamilyName}'; src: url('${subtitleFontUrl}') format('${getFontFormat(activeSubtitleStyle?.font_file)}'); font-weight: 400; font-style: normal; }` : ''}
+            `}</style>
+          )}
+          <div className="absolute inset-0 opacity-20 bg-gradient-to-br from-purple-500/40 via-transparent to-pink-500/30" />
+          <div
+            className="absolute top-0 left-0"
+            style={{
+              width: `${previewBaseWidth}px`,
+              height: `${previewBaseHeight}px`,
+              transform: `scale(${previewScale})`,
+              transformOrigin: 'top left',
+            }}
+          >
+            <div
+              className="w-full text-center"
+              style={{
+                position: 'absolute',
+                top: `${titleTopMargin}px`,
+                left: 0,
+                right: 0,
+                color: titleColor,
+                fontSize: `${titleFontSize}px`,
+                fontWeight: titleFontWeight,
+                fontFamily: titleFontUrl
+                  ? `'${titleFontFamilyName}', "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans SC", sans-serif`
+                  : '"PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans SC", sans-serif',
+                textShadow: buildTextShadow(titleStrokeColor, titleStrokeSize),
+                letterSpacing: `${titleLetterSpacing}px`,
+                lineHeight: 1.2,
+                opacity: videoTitle.trim() ? 1 : 0.7,
+                padding: '0 5%',
+              }}
+            >
+              {previewTitleText}
+            </div>
+
+            <div
+              className="w-full text-center"
+              style={{
+                position: 'absolute',
+                bottom: `${subtitleBottomMargin}px`,
+                left: 0,
+                right: 0,
+                fontSize: `${subtitleFontSize}px`,
+                fontFamily: subtitleFontUrl
+                  ? `'${subtitleFontFamilyName}', "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans SC", sans-serif`
+                  : '"PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans SC", sans-serif',
+                textShadow: buildTextShadow(subtitleStrokeColor, subtitleStrokeSize),
+                letterSpacing: `${subtitleLetterSpacing}px`,
+                lineHeight: 1.35,
+                padding: '0 6%',
+              }}
+            >
+              {enableSubtitles ? (
+                <>
+                  <span style={{ color: subtitleHighlightColor }}>{subtitleHighlightText}</span>
+                  <span style={{ color: subtitleNormalColor }}>{subtitleNormalText}</span>
+                </>
+              ) : (
+                <span className="text-gray-400 text-sm">字幕已关闭</span>
+              )}
+            </div>
+          </div>
+        </div>
+      )}
+
+      <div className="mb-4">
+        <label className="text-sm text-gray-300 mb-2 block">片头标题（限制15个字）</label>
+        <input
+          type="text"
+          value={videoTitle}
+          onChange={(e) => onTitleChange(e.target.value)}
+          onCompositionStart={onTitleCompositionStart}
+          onCompositionEnd={(e) => onTitleCompositionEnd?.(e.currentTarget.value)}
+          placeholder="输入视频标题，将在片头显示"
+          className="w-full px-3 sm:px-4 py-2 text-sm sm:text-base bg-black/30 border border-white/10 rounded-xl text-white placeholder-gray-500 focus:outline-none focus:border-purple-500 transition-colors"
+        />
+      </div>
+
+      {titleStyles.length > 0 && (
+        <div className="mb-4">
+          <label className="text-sm text-gray-300 mb-2 block">标题样式</label>
+          <div className="grid grid-cols-2 gap-2">
+            {titleStyles.map((style) => (
+              <button
+                key={style.id}
+                onClick={() => onSelectTitleStyle(style.id)}
+                className={`p-2 rounded-lg border transition-all text-left ${selectedTitleStyleId === style.id
+                  ? "border-purple-500 bg-purple-500/20"
+                  : "border-white/10 bg-white/5 hover:border-white/30"
+                  }`}
+              >
+                <div className="text-white text-sm truncate">{style.label}</div>
+                <div className="text-xs text-gray-400 truncate">
+                  {style.font_family || style.font_file || ""}
+                </div>
+              </button>
+            ))}
+          </div>
+          <div className="mt-3">
+            <label className="text-xs text-gray-400 mb-2 block">标题字号: {titleFontSize}px</label>
+            <input
+              type="range"
+              min="48"
+              max="110"
+              step="1"
+              value={titleFontSize}
+              onChange={(e) => onTitleFontSizeChange(parseInt(e.target.value, 10))}
+              className="w-full accent-purple-500"
+            />
+          </div>
+        </div>
+      )}
+
+      {enableSubtitles && subtitleStyles.length > 0 && (
+        <div className="mt-4">
+          <label className="text-sm text-gray-300 mb-2 block">字幕样式</label>
+          <div className="grid grid-cols-2 gap-2">
+            {subtitleStyles.map((style) => (
+              <button
+                key={style.id}
+                onClick={() => onSelectSubtitleStyle(style.id)}
+                className={`p-2 rounded-lg border transition-all text-left ${selectedSubtitleStyleId === style.id
+                  ? "border-purple-500 bg-purple-500/20"
+                  : "border-white/10 bg-white/5 hover:border-white/30"
+                  }`}
+              >
+                <div className="text-white text-sm truncate">{style.label}</div>
+                <div className="text-xs text-gray-400 truncate">
+                  {style.font_family || style.font_file || ""}
+                </div>
+              </button>
+            ))}
+          </div>
+          <div className="mt-3">
+            <label className="text-xs text-gray-400 mb-2 block">字幕字号: {subtitleFontSize}px</label>
+            <input
+              type="range"
+              min="32"
+              max="90"
+              step="1"
+              value={subtitleFontSize}
+              onChange={(e) => onSubtitleFontSizeChange(parseInt(e.target.value, 10))}
+              className="w-full accent-purple-500"
+            />
+          </div>
+        </div>
+      )}
+
+      <div className="mt-4 pt-4 border-t border-white/10 flex items-center justify-between">
+        <div>
+          <span className="text-sm text-gray-300">逐字高亮字幕</span>
+          <p className="text-xs text-gray-500 mt-1">自动生成卡拉OK效果字幕</p>
+        </div>
+        <label className="relative inline-flex items-center cursor-pointer">
+          <input
+            type="checkbox"
+            checked={enableSubtitles}
+            onChange={(e) => onToggleSubtitles(e.target.checked)}
+            className="sr-only peer"
+          />
+          <div className="w-11 h-6 bg-gray-600 peer-focus:outline-none rounded-full peer peer-checked:after:translate-x-full peer-checked:after:border-white after:content-[''] after:absolute after:top-[2px] after:left-[2px] after:bg-white after:border-gray-300 after:border after:rounded-full after:h-5 after:w-5 after:transition-all peer-checked:bg-purple-600"></div>
+        </label>
+      </div>
+    </div>
+  );
+}
--- a/frontend/src/features/home/ui/VoiceSelector.tsx
+++ b/frontend/src/features/home/ui/VoiceSelector.tsx
@@ -0,0 +1,75 @@
+import type { ReactNode } from "react";
+import { Mic, Volume2 } from "lucide-react";
+
+interface VoiceOption {
+  id: string;
+  name: string;
+}
+
+interface VoiceSelectorProps {
+  ttsMode: "edgetts" | "voiceclone";
+  onSelectTtsMode: (mode: "edgetts" | "voiceclone") => void;
+  voices: VoiceOption[];
+  voice: string;
+  onSelectVoice: (id: string) => void;
+  voiceCloneSlot: ReactNode;
+}
+
+export function VoiceSelector({
+  ttsMode,
+  onSelectTtsMode,
+  voices,
+  voice,
+  onSelectVoice,
+  voiceCloneSlot,
+}: VoiceSelectorProps) {
+  return (
+    <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+      <h2 className="text-lg font-semibold text-white mb-4 flex items-center gap-2">
+        🎙️ 配音方式
+      </h2>
+
+      <div className="flex gap-2 mb-4">
+        <button
+          onClick={() => onSelectTtsMode("edgetts")}
+          className={`flex-1 py-2 px-4 rounded-lg font-medium transition-all flex items-center justify-center gap-2 ${ttsMode === "edgetts"
+            ? "bg-purple-600 text-white"
+            : "bg-white/10 text-gray-300 hover:bg-white/20"
+            }`}
+        >
+          <Volume2 className="h-4 w-4" />
+          选择声音
+        </button>
+        <button
+          onClick={() => onSelectTtsMode("voiceclone")}
+          className={`flex-1 py-2 px-4 rounded-lg font-medium transition-all flex items-center justify-center gap-2 ${ttsMode === "voiceclone"
+            ? "bg-purple-600 text-white"
+            : "bg-white/10 text-gray-300 hover:bg-white/20"
+            }`}
+        >
+          <Mic className="h-4 w-4" />
+          克隆声音
+        </button>
+      </div>
+
+      {ttsMode === "edgetts" && (
+        <div className="grid grid-cols-2 gap-3">
+          {voices.map((v) => (
+            <button
+              key={v.id}
+              onClick={() => onSelectVoice(v.id)}
+              className={`p-3 rounded-xl border-2 transition-all text-left ${voice === v.id
+                ? "border-purple-500 bg-purple-500/20"
+                : "border-white/10 bg-white/5 hover:border-white/30"
+                }`}
+            >
+              <span className="text-white text-sm">{v.name}</span>
+            </button>
+          ))}
+        </div>
+      )}
+
+      {ttsMode === "voiceclone" && voiceCloneSlot}
+    </div>
+  );
+}
--- a/frontend/src/features/publish/model/usePublishController.ts
+++ b/frontend/src/features/publish/model/usePublishController.ts
@@ -0,0 +1,323 @@
+import { useEffect, useMemo, useState } from "react";
+import useSWR from "swr";
+import api from "@/shared/api/axios";
+import { formatDate, getApiBaseUrl, isAbsoluteUrl, resolveMediaUrl } from "@/shared/lib/media";
+import { clampTitle } from "@/shared/lib/title";
+import { useTitleInput } from "@/shared/hooks/useTitleInput";
+import { useAuth } from "@/contexts/AuthContext";
+
+interface Account {
+  platform: string;
+  name: string;
+  logged_in: boolean;
+  enabled: boolean;
+}
+
+interface Video {
+  name: string;
+  path: string;
+}
+
+const fetcher = (url: string) => api.get(url).then((res) => res.data);
+
+export const usePublishController = () => {
+  const apiBase = getApiBaseUrl();
+
+  const [accounts, setAccounts] = useState<Account[]>([]);
+  const [videos, setVideos] = useState<Video[]>([]);
+  const [selectedVideo, setSelectedVideo] = useState<string>("");
+  const [videoFilter, setVideoFilter] = useState<string>("");
+  const [previewVideoUrl, setPreviewVideoUrl] = useState<string | null>(null);
+  const [selectedPlatforms, setSelectedPlatforms] = useState<string[]>([]);
+  const [title, setTitle] = useState<string>("");
+  const [tags, setTags] = useState<string>("");
+  const [isPublishing, setIsPublishing] = useState(false);
+  const [publishResults, setPublishResults] = useState<any[]>([]);
+  const [scheduleMode, setScheduleMode] = useState<"now" | "scheduled">("now");
+  const [publishTime, setPublishTime] = useState<string>("");
+  const [qrCodeImage, setQrCodeImage] = useState<string | null>(null);
+  const [qrPlatform, setQrPlatform] = useState<string | null>(null);
+  const [isLoadingQR, setIsLoadingQR] = useState(false);
+
+  // 使用全局认证状态
+  const { userId, isLoading: isAuthLoading } = useAuth();
+  // 是否已从 localStorage 恢复完成
+  const [isRestored, setIsRestored] = useState(false);
+
+  const titleInput = useTitleInput({
+    value: title,
+    onChange: setTitle,
+  });
+
+  const fetchAccounts = async () => {
+    try {
+      const { data } = await api.get("/api/publish/accounts");
+      setAccounts(data.accounts || []);
+    } catch (error) {
+      console.error("获取账号失败:", error);
+    }
+  };
+
+  const fetchVideos = async () => {
+    try {
+      const { data } = await api.get("/api/videos/generated");
+
+      const nextVideos = (data.videos || []).map((v: any) => ({
+        name: formatDate(v.created_at) + ` (${v.size_mb.toFixed(1)}MB)`,
+        path: v.path.startsWith("/") ? v.path.slice(1) : v.path,
+      }));
+
+      setVideos(nextVideos);
+      if (nextVideos.length > 0) {
+        setSelectedVideo(nextVideos[0].path);
+      }
+    } catch (error) {
+      console.error("获取视频失败:", error);
+    }
+  };
+
+  useEffect(() => {
+    void Promise.allSettled([
+      fetchAccounts(),
+      fetchVideos(),
+    ]);
+  }, []);
+
+  useEffect(() => {
+    if (typeof window === "undefined") return;
+    if ("scrollRestoration" in window.history) {
+      window.history.scrollRestoration = "manual";
+    }
+    window.scrollTo({ top: 0, left: 0, behavior: "auto" });
+  }, []);
+
+  // 获取存储 key 的前缀（登录用户使用 userId，未登录使用 guest）
+  const storageKey = userId || "guest";
+
+  // 从 localStorage 恢复用户输入（等待认证完成后）
+  useEffect(() => {
+    if (isAuthLoading) return;
+
+    // 从 localStorage 恢复用户输入（带用户隔离，未登录用户使用 guest）
+    const savedTitle = localStorage.getItem(`vigent_${storageKey}_publish_title`);
+    const savedTags = localStorage.getItem(`vigent_${storageKey}_publish_tags`);
+
+    if (savedTitle) setTitle(clampTitle(savedTitle));
+    if (savedTags) {
+      // 兼容 JSON 数组格式（AI 生成）和字符串格式（手动输入）
+      try {
+        const parsed = JSON.parse(savedTags);
+        if (Array.isArray(parsed)) {
+          setTags(parsed.join(", "));
+        } else {
+          setTags(savedTags);
+        }
+      } catch {
+        setTags(savedTags);
+      }
+    }
+
+    // 恢复完成后才允许保存
+    setIsRestored(true);
+  }, [storageKey, isAuthLoading]);
+
+  // 保存用户输入到 localStorage（恢复完成后才保存，未登录用户也可保存）
+  useEffect(() => {
+    if (!isRestored) return;
+    const timeout = setTimeout(() => {
+      localStorage.setItem(`vigent_${storageKey}_publish_title`, title);
+    }, 300);
+    return () => clearTimeout(timeout);
+  }, [title, storageKey, isRestored]);
+
+  useEffect(() => {
+    if (!isRestored) return;
+    const timeout = setTimeout(() => {
+      localStorage.setItem(`vigent_${storageKey}_publish_tags`, tags);
+    }, 300);
+    return () => clearTimeout(timeout);
+  }, [tags, storageKey, isRestored]);
+
+  const togglePlatform = (platform: string) => {
+    if (selectedPlatforms.includes(platform)) {
+      setSelectedPlatforms(selectedPlatforms.filter((p) => p !== platform));
+    } else {
+      setSelectedPlatforms([...selectedPlatforms, platform]);
+    }
+  };
+
+  const handlePublish = async () => {
+    if (!selectedVideo || !title || selectedPlatforms.length === 0) {
+      alert("请选择视频、填写标题并选择至少一个平台");
+      return;
+    }
+
+    setIsPublishing(true);
+    setPublishResults([]);
+
+    const tagList = tags.split(/[,，\s]+/).filter((t) => t.trim());
+
+    for (const platform of selectedPlatforms) {
+      try {
+        const { data: result } = await api.post("/api/publish", {
+          video_path: selectedVideo,
+          platform,
+          title,
+          tags: tagList,
+          description: "",
+          publish_time: scheduleMode === "scheduled" && publishTime
+            ? new Date(publishTime).toISOString()
+            : null,
+        });
+
+        setPublishResults((prev) => [...prev, result]);
+        // 发布成功后10秒自动清除结果
+        if (result.success) {
+          setTimeout(() => {
+            setPublishResults((prev) => prev.filter((r) => r !== result));
+          }, 10000);
+        }
+      } catch (error: any) {
+        const message = error.response?.data?.detail || String(error);
+        setPublishResults((prev) => [
+          ...prev,
+          { platform, success: false, message },
+        ]);
+      }
+    }
+
+    setIsPublishing(false);
+  };
+
+  // SWR Polling for Login Status
+  useSWR(
+    qrPlatform ? `${apiBase}/api/publish/login/status/${qrPlatform}` : null,
+    fetcher,
+    {
+      refreshInterval: 2000,
+      onSuccess: (data) => {
+        if (data.success) {
+          setQrCodeImage(null);
+          setQrPlatform(null);
+          alert("✅ 登录成功！");
+          fetchAccounts();
+        }
+      },
+    }
+  );
+
+  // Timeout logic for QR code (business logic: stop after 2 mins)
+  useEffect(() => {
+    let timer: NodeJS.Timeout;
+    if (qrPlatform) {
+      timer = setTimeout(() => {
+        if (qrPlatform) {
+          setQrPlatform(null);
+          setQrCodeImage(null);
+          alert("登录超时，请重试");
+        }
+      }, 120000);
+    }
+    return () => clearTimeout(timer);
+  }, [qrPlatform]);
+
+  const handleLogin = async (platform: string) => {
+    setIsLoadingQR(true);
+    setQrPlatform(platform);
+    setQrCodeImage(null);
+    try {
+      const { data: result } = await api.post(`/api/publish/login/${platform}`);
+
+      if (result.success && result.qr_code) {
+        setQrCodeImage(result.qr_code);
+      } else {
+        setQrPlatform(null);
+        alert(result.message || "登录失败");
+      }
+    } catch (error: any) {
+      setQrPlatform(null);
+      alert(`登录失败: ${error.response?.data?.detail || error.message}`);
+    } finally {
+      setIsLoadingQR(false);
+    }
+  };
+
+  const handleLogout = async (platform: string) => {
+    if (!confirm("确定要注销登录吗？")) return;
+    try {
+      const { data: result } = await api.post(`/api/publish/logout/${platform}`);
+      if (result.success) {
+        alert("已注销");
+        fetchAccounts();
+      } else {
+        alert(result.message || "注销失败");
+      }
+    } catch (error: any) {
+      alert(`注销失败: ${error.response?.data?.detail || error.message}`);
+    }
+  };
+
+  const platformIcons: Record<string, string> = {
+    douyin: "🎵",
+    xiaohongshu: "📕",
+    weixin: "💬",
+    kuaishou: "⚡",
+    bilibili: "📺",
+  };
+
+  const filteredVideos = useMemo(() => {
+    const query = videoFilter.trim().toLowerCase();
+    if (!query) return videos;
+    return videos.filter((v) => v.name.toLowerCase().includes(query));
+  }, [videos, videoFilter]);
+
+  const handlePreviewVideo = (path: string) => {
+    const previewPath = isAbsoluteUrl(path)
+      ? path
+      : path.startsWith("/")
+        ? path
+        : `/${path}`;
+    setPreviewVideoUrl(resolveMediaUrl(previewPath) || previewPath);
+  };
+
+  const closeQrModal = () => {
+    setQrCodeImage(null);
+    setQrPlatform(null);
+  };
+
+  return {
+    apiBase,
+    accounts,
+    videos,
+    selectedVideo,
+    setSelectedVideo,
+    videoFilter,
+    setVideoFilter,
+    previewVideoUrl,
+    setPreviewVideoUrl,
+    selectedPlatforms,
+    title,
+    titleInput,
+    tags,
+    setTags,
+    isPublishing,
+    publishResults,
+    scheduleMode,
+    setScheduleMode,
+    publishTime,
+    setPublishTime,
+    qrCodeImage,
+    qrPlatform,
+    isLoadingQR,
+    fetchAccounts,
+    fetchVideos,
+    togglePlatform,
+    handlePublish,
+    handleLogin,
+    handleLogout,
+    platformIcons,
+    filteredVideos,
+    handlePreviewVideo,
+    closeQrModal,
+  };
+};
--- a/frontend/src/features/publish/ui/PublishPage.tsx
+++ b/frontend/src/features/publish/ui/PublishPage.tsx
@@ -0,0 +1,381 @@
+"use client";
+
+import Link from "next/link";
+import VideoPreviewModal from "@/components/VideoPreviewModal";
+import AccountSettingsDropdown from "@/components/AccountSettingsDropdown";
+import { usePublishController } from "@/features/publish/model/usePublishController";
+import {
+  ArrowLeft,
+  RotateCcw,
+  LogOut,
+  QrCode,
+  Rocket,
+  Clock,
+  Search,
+  Eye,
+} from "lucide-react";
+
+export function PublishPage() {
+  const {
+    accounts,
+    selectedVideo,
+    setSelectedVideo,
+    videoFilter,
+    setVideoFilter,
+    previewVideoUrl,
+    setPreviewVideoUrl,
+    selectedPlatforms,
+    title,
+    titleInput,
+    tags,
+    setTags,
+    isPublishing,
+    publishResults,
+    scheduleMode,
+    setScheduleMode,
+    publishTime,
+    setPublishTime,
+    qrCodeImage,
+    qrPlatform,
+    isLoadingQR,
+    togglePlatform,
+    handlePublish,
+    handleLogin,
+    handleLogout,
+    platformIcons,
+    filteredVideos,
+    handlePreviewVideo,
+    closeQrModal,
+  } = usePublishController();
+
+  return (
+    <div className="min-h-dvh">
+      <VideoPreviewModal
+        onClose={() => setPreviewVideoUrl(null)}
+        videoUrl={previewVideoUrl}
+        title="发布视频预览"
+      />
+      {/* QR码弹窗 */}
+      {qrPlatform && (
+        <div className="fixed inset-0 bg-black/80 flex items-center justify-center z-50">
+          <div className="bg-white rounded-2xl p-8 max-w-md min-w-[320px]">
+            <h2 className="text-2xl font-bold mb-4 text-center">🔐 扫码登录 {qrPlatform}</h2>
+            {isLoadingQR ? (
+              <div className="flex flex-col items-center py-8">
+                <div className="animate-spin w-16 h-16 border-4 border-purple-500 border-t-transparent rounded-full" />
+                <p className="text-gray-600 mt-4">正在获取二维码...</p>
+              </div>
+            ) : qrCodeImage ? (
+              <>
+                <img
+                  src={`data:image/png;base64,${qrCodeImage}`}
+                  alt="QR Code"
+                  className="w-full h-auto"
+                />
+                <p className="text-center text-gray-600 mt-4">
+                  请使用手机扫码登录
+                </p>
+              </>
+            ) : null}
+            <button
+              onClick={closeQrModal}
+              className="w-full mt-4 px-4 py-2 bg-gray-200 rounded-lg hover:bg-gray-300"
+            >
+              取消
+            </button>
+          </div>
+        </div>
+      )}
+
+      {/* Header - 统一样式 */}
+      <header className="border-b border-white/10 bg-black/20 backdrop-blur-sm relative z-[100]">
+        <div className="max-w-6xl mx-auto px-4 sm:px-6 py-3 sm:py-4 flex items-center justify-between">
+          <Link
+            href="/"
+            className="text-xl sm:text-2xl font-bold text-white flex items-center gap-2 sm:gap-3 hover:opacity-80 transition-opacity"
+          >
+            <span className="text-3xl sm:text-4xl">🎬</span>
+            IPAgent
+          </Link>
+          <div className="flex items-center gap-1 sm:gap-4">
+            <Link
+              href="/"
+              className="px-2 sm:px-4 py-1 sm:py-2 text-sm sm:text-base bg-white/10 hover:bg-white/20 text-white rounded-lg transition-colors flex items-center gap-1"
+            >
+              <ArrowLeft className="h-4 w-4" />
+              返回创作
+            </Link>
+            <span className="px-2 sm:px-4 py-1 sm:py-2 text-sm sm:text-base bg-gradient-to-r from-purple-600 to-pink-600 text-white rounded-lg font-semibold">
+              发布管理
+            </span>
+            <AccountSettingsDropdown />
+          </div>
+        </div>
+      </header>
+
+      <main className="max-w-6xl mx-auto px-6 py-8">
+        <div className="grid grid-cols-1 lg:grid-cols-2 gap-8">
+          {/* 左侧: 账号管理 */}
+          <div className="space-y-6">
+            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+              <h2 className="text-lg font-semibold text-white mb-4 flex items-center gap-2">
+                👤 平台账号
+              </h2>
+
+              <div className="space-y-3">
+                {accounts.map((account) => (
+                  <div
+                    key={account.platform}
+                    className="flex items-center justify-between p-4 bg-black/30 rounded-xl"
+                  >
+                    <div className="flex items-center gap-3">
+                      <span className="text-2xl">
+                        {platformIcons[account.platform]}
+                      </span>
+                      <div>
+                        <div className="text-white font-medium">
+                          {account.name}
+                        </div>
+                        <div
+                          className={`text-sm ${account.logged_in
+                            ? "text-green-400"
+                            : "text-gray-500"
+                            }`}
+                        >
+                          {account.logged_in ? "✓ 已登录" : "未登录"}
+                        </div>
+                      </div>
+                    </div>
+                    <div className="flex gap-2">
+                      {account.logged_in ? (
+                        <>
+                          <button
+                            onClick={() => handleLogin(account.platform)}
+                            className="px-3 py-1 bg-white/10 hover:bg-white/20 text-white text-sm rounded-lg transition-colors flex items-center gap-1"
+                          >
+                            <RotateCcw className="h-3.5 w-3.5" />
+                            重新登录
+                          </button>
+                          <button
+                            onClick={() => handleLogout(account.platform)}
+                            className="px-3 py-1 bg-red-500/80 hover:bg-red-600 text-white text-sm rounded-lg transition-colors flex items-center gap-1"
+                          >
+                            <LogOut className="h-3.5 w-3.5" />
+                            注销
+                          </button>
+                        </>
+                      ) : (
+                        <button
+                          onClick={() => handleLogin(account.platform)}
+                          className="px-3 py-1 bg-purple-500/80 hover:bg-purple-600 text-white text-sm rounded-lg transition-colors flex items-center gap-1"
+                        >
+                          <QrCode className="h-3.5 w-3.5" />
+                          登录
+                        </button>
+                      )}
+                    </div>
+                  </div>
+                ))}
+              </div>
+            </div>
+          </div>
+
+          {/* 右侧: 发布设置 */}
+          <div className="space-y-6">
+            {/* 选择视频 */}
+            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+              <h2 className="text-lg font-semibold text-white mb-4">📹 选择发布作品</h2>
+
+              <div className="flex items-center gap-3 mb-4">
+                <Search className="text-gray-400 w-4 h-4" />
+                <input
+                  type="text"
+                  value={videoFilter}
+                  onChange={(e) => setVideoFilter(e.target.value)}
+                  placeholder="搜索视频名称..."
+                  className="flex-1 bg-black/30 border border-white/10 rounded-lg px-3 py-2 text-sm text-white placeholder-gray-500 focus:outline-none focus:border-purple-500"
+                />
+              </div>
+
+              {filteredVideos.length === 0 ? (
+                <div className="text-center py-8 text-gray-400">
+                  暂无可发布的视频
+                </div>
+              ) : (
+                <div className="space-y-2 max-h-64 overflow-y-auto hide-scrollbar" style={{ contentVisibility: "auto" }}>
+                  {filteredVideos.map((v) => (
+                    <div
+                      key={v.path}
+                      onClick={() => setSelectedVideo(v.path)}
+                      className={`p-3 rounded-lg border transition-all flex items-center justify-between group cursor-pointer ${selectedVideo === v.path
+                        ? "border-purple-500 bg-purple-500/20"
+                        : "border-white/10 bg-white/5 hover:border-white/30"
+                        }`}
+                    >
+                      <div className="flex flex-col">
+                        <span className="text-sm text-white">{v.name}</span>
+                      </div>
+                      <div className="flex items-center gap-2 pl-2">
+                        <button
+                          onClick={(e) => {
+                            e.stopPropagation();
+                            handlePreviewVideo(v.path);
+                          }}
+                          className="p-1 text-gray-500 hover:text-purple-400 transition-colors"
+                          title="预览"
+                        >
+                          <Eye className="h-4 w-4" />
+                        </button>
+                        {selectedVideo === v.path && (
+                          <span className="text-xs text-purple-300">已选</span>
+                        )}
+                      </div>
+                    </div>
+                  ))}
+                </div>
+              )}
+            </div>
+
+            {/* 填写信息 */}
+            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+              <h2 className="text-lg font-semibold text-white mb-4">✍️ 发布信息</h2>
+
+              <div className="space-y-4">
+                <div>
+                  <label className="block text-gray-400 text-sm mb-2">
+                    标题
+                  </label>
+                  <input
+                    type="text"
+                    value={title}
+                    onChange={(e) => titleInput.handleChange(e.target.value)}
+                    onCompositionStart={titleInput.handleCompositionStart}
+                    onCompositionEnd={(e) => titleInput.handleCompositionEnd(e.currentTarget.value)}
+                    placeholder="输入视频标题..."
+                    className="w-full p-3 bg-black/30 border border-white/10 rounded-xl text-white placeholder-gray-500"
+                  />
+                </div>
+                <div>
+                  <label className="block text-gray-400 text-sm mb-2">
+                    标签 (用逗号分隔)
+                  </label>
+                  <input
+                    type="text"
+                    value={tags}
+                    onChange={(e) => setTags(e.target.value)}
+                    placeholder="AI, 数字人, 口播..."
+                    className="w-full p-3 bg-black/30 border border-white/10 rounded-xl text-white placeholder-gray-500"
+                  />
+                </div>
+              </div>
+            </div>
+
+            {/* 选择平台 */}
+            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+              <h2 className="text-lg font-semibold text-white mb-4">📱 选择发布平台</h2>
+
+              <div className="grid grid-cols-3 gap-3">
+                {accounts
+                  .filter((a) => a.logged_in)
+                  .map((account) => (
+                    <button
+                      key={account.platform}
+                      onClick={() => togglePlatform(account.platform)}
+                      className={`p-3 rounded-xl border-2 transition-all ${selectedPlatforms.includes(account.platform)
+                        ? "border-purple-500 bg-purple-500/20"
+                        : "border-white/10 bg-white/5 hover:border-white/30"
+                        }`}
+                    >
+                      <span className="text-2xl block mb-1">
+                        {platformIcons[account.platform]}
+                      </span>
+                      <span className="text-white text-sm">{account.name}</span>
+                    </button>
+                  ))}
+              </div>
+            </div>
+
+            {/* 定时发布 */}
+            <div className="bg-white/5 rounded-2xl p-6 border border-white/10 backdrop-blur-sm">
+              <h2 className="text-lg font-semibold text-white mb-4 flex items-center gap-2">
+                ⏰ 发布设置
+              </h2>
+
+              <div className="space-y-4">
+                <div className="flex items-center gap-3">
+                  <button
+                    onClick={() => setScheduleMode("now")}
+                    className={`flex-1 p-3 rounded-xl border-2 transition-all ${scheduleMode === "now"
+                      ? "border-purple-500 bg-purple-500/20"
+                      : "border-white/10 bg-white/5 hover:border-white/30"
+                      }`}
+                  >
+                    <Rocket className="h-5 w-5 mx-auto mb-1" />
+                    <span className="text-white text-sm">立即发布</span>
+                  </button>
+                  <button
+                    onClick={() => setScheduleMode("scheduled")}
+                    className={`flex-1 p-3 rounded-xl border-2 transition-all ${scheduleMode === "scheduled"
+                      ? "border-purple-500 bg-purple-500/20"
+                      : "border-white/10 bg-white/5 hover:border-white/30"
+                      }`}
+                  >
+                    <Clock className="h-5 w-5 mx-auto mb-1" />
+                    <span className="text-white text-sm">定时发布</span>
+                  </button>
+                </div>
+
+                {scheduleMode === "scheduled" && (
+                  <input
+                    type="datetime-local"
+                    value={publishTime}
+                    onChange={(e) => setPublishTime(e.target.value)}
+                    className="w-full p-3 bg-black/30 border border-white/10 rounded-xl text-white"
+                  />
+                )}
+              </div>
+            </div>
+
+            {/* 发布按钮 */}
+            <button
+              onClick={handlePublish}
+              disabled={isPublishing || (scheduleMode === "scheduled" && !publishTime)}
+              className="w-full py-4 bg-gradient-to-r from-purple-600 to-pink-600 text-white rounded-xl font-bold text-lg hover:shadow-lg hover:from-purple-500 hover:to-pink-500 transition-all disabled:opacity-50 disabled:cursor-not-allowed"
+            >
+              {isPublishing
+                ? "正在发布..."
+                : scheduleMode === "scheduled"
+                  ? "定时发布"
+                  : "立即发布"}
+            </button>
+
+            {/* 发布结果 */}
+            {publishResults.length > 0 && (
+              <div className="space-y-3">
+                {publishResults.map((result, index) => (
+                  <div
+                    key={index}
+                    className={`p-4 rounded-xl border ${result.success
+                      ? "border-green-500/50 bg-green-500/10"
+                      : "border-red-500/50 bg-red-500/10"
+                      }`}
+                  >
+                    <div className="flex items-center gap-2 mb-1">
+                      <span className="text-lg">
+                        {platformIcons[result.platform]}
+                      </span>
+                      <span className={`font-medium ${result.success ? "text-green-400" : "text-red-400"}`}>
+                        {result.success ? "发布成功" : "发布失败"}
+                      </span>
+                    </div>
+                    <p className="text-sm text-gray-300">{result.message}</p>
+                  </div>
+                ))}
+              </div>
+            )}
+          </div>
+        </div>
+      </main>
+    </div>
+  );
+}
--- a/frontend/src/middleware.ts
+++ b/frontend/src/middleware.ts
@@ -7,7 +7,7 @@ const protectedPaths = ['/', '/publish', '/admin'];
 // 公开路径 (无需登录)
 const publicPaths = ['/login', '/register'];

-export function middleware(request: NextRequest) {
+export function proxy(request: NextRequest) {
    const { pathname } = request.nextUrl;

    // 检查是否有 access_token cookie
--- a/frontend/src/shared/api/axios.ts
+++ b/frontend/src/shared/api/axios.ts
@@ -0,0 +1,54 @@
+/**
+ * Axios 实例配置
+ * 全局拦截 401/403 响应，自动跳转登录页
+ */
+import axios from 'axios';
+
+// 动态获取 API 地址：服务端使用 localhost，客户端使用当前域名
+const API_BASE = typeof window === 'undefined'
+    ? 'http://localhost:8006'
+    : '';
+
+// 防止重复跳转
+let isRedirecting = false;
+
+const PUBLIC_PATHS = new Set(['/login', '/register']);
+
+// 创建 axios 实例
+const api = axios.create({
+    baseURL: API_BASE,
+    withCredentials: true, // 自动携带 cookie
+    headers: {
+        'Content-Type': 'application/json',
+    },
+});
+
+// 响应拦截器 - 全局处理 401/403
+api.interceptors.response.use(
+    (response) => response,
+    async (error) => {
+        const status = error.response?.status;
+
+        const isPublicPath = typeof window !== 'undefined' && PUBLIC_PATHS.has(window.location.pathname);
+
+        if ((status === 401 || status === 403) && !isRedirecting && !isPublicPath) {
+            isRedirecting = true;
+
+            // 调用 logout API 清除 HttpOnly cookie
+            try {
+                await fetch('/api/auth/logout', { method: 'POST' });
+            } catch (e) {
+                // 忽略错误
+            }
+
+            // 跳转登录页
+            if (typeof window !== 'undefined') {
+                window.location.replace('/login');
+            }
+        }
+
+        return Promise.reject(error);
+    }
+);
+
+export default api;
--- a/frontend/src/shared/hooks/useTitleInput.ts
+++ b/frontend/src/shared/hooks/useTitleInput.ts
@@ -0,0 +1,66 @@
+import { useCallback, useEffect, useRef } from "react";
+import { applyTitleLimit, TITLE_MAX_LENGTH } from "@/shared/lib/title";
+
+interface UseTitleInputOptions {
+  value: string;
+  onChange: (value: string) => void;
+  onCommit?: (value: string) => void;
+  maxLength?: number;
+}
+
+export const useTitleInput = ({
+  value,
+  onChange,
+  onCommit,
+  maxLength = TITLE_MAX_LENGTH,
+}: UseTitleInputOptions) => {
+  const isComposingRef = useRef(false);
+  const committedRef = useRef(value);
+
+  useEffect(() => {
+    if (isComposingRef.current) return;
+    committedRef.current = value;
+  }, [value]);
+
+  const commitValue = useCallback(
+    (nextValue: string) => {
+      committedRef.current = nextValue;
+      onChange(nextValue);
+      onCommit?.(nextValue);
+    },
+    [onChange, onCommit]
+  );
+
+  const handleChange = useCallback(
+    (nextValue: string) => {
+      if (isComposingRef.current) {
+        onChange(nextValue);
+        return;
+      }
+      const limited = applyTitleLimit(committedRef.current, nextValue, maxLength);
+      commitValue(limited);
+    },
+    [maxLength, onChange, commitValue]
+  );
+
+  const handleCompositionStart = useCallback(() => {
+    isComposingRef.current = true;
+  }, []);
+
+  const handleCompositionEnd = useCallback(
+    (nextValue: string) => {
+      isComposingRef.current = false;
+      const limited = applyTitleLimit(committedRef.current, nextValue, maxLength);
+      commitValue(limited);
+    },
+    [maxLength, commitValue]
+  );
+
+  return {
+    handleChange,
+    handleCompositionStart,
+    handleCompositionEnd,
+    commitValue,
+    maxLength,
+  };
+};
--- a/frontend/src/shared/lib/auth.ts
+++ b/frontend/src/shared/lib/auth.ts
@@ -8,10 +8,11 @@ const API_BASE = typeof window === 'undefined'

 export interface User {
    id: string;
-    email: string;
+    phone: string;
    username: string | null;
    role: string;
    is_active: boolean;
+    expires_at: string | null;
 }

 export interface AuthResponse {
@@ -23,12 +24,12 @@ export interface AuthResponse {
 /**
 * 用户注册
 */
-export async function register(email: string, password: string, username?: string): Promise<AuthResponse> {
+export async function register(phone: string, password: string, username?: string): Promise<AuthResponse> {
    const res = await fetch(`${API_BASE}/api/auth/register`, {
        method: 'POST',
        headers: { 'Content-Type': 'application/json' },
        credentials: 'include',
-        body: JSON.stringify({ email, password, username })
+        body: JSON.stringify({ phone, password, username })
    });
    return res.json();
 }
@@ -36,12 +37,12 @@ export async function register(email: string, password: string, username?: strin
 /**
 * 用户登录
 */
-export async function login(email: string, password: string): Promise<AuthResponse> {
+export async function login(phone: string, password: string): Promise<AuthResponse> {
    const res = await fetch(`${API_BASE}/api/auth/login`, {
        method: 'POST',
        headers: { 'Content-Type': 'application/json' },
        credentials: 'include',
-        body: JSON.stringify({ email, password })
+        body: JSON.stringify({ phone, password })
    });
    return res.json();
 }
@@ -57,6 +58,19 @@ export async function logout(): Promise<AuthResponse> {
    return res.json();
 }

+/**
+ * 修改密码
+ */
+export async function changePassword(oldPassword: string, newPassword: string): Promise<AuthResponse> {
+    const res = await fetch(`${API_BASE}/api/auth/change-password`, {
+        method: 'POST',
+        headers: { 'Content-Type': 'application/json' },
+        credentials: 'include',
+        body: JSON.stringify({ old_password: oldPassword, new_password: newPassword })
+    });
+    return res.json();
+}
+
 /**
 * 获取当前用户
 */
--- a/frontend/src/shared/lib/media.ts
+++ b/frontend/src/shared/lib/media.ts
@@ -0,0 +1,61 @@
+const DEFAULT_API_BASE = process.env.NEXT_PUBLIC_API_BASE_URL || 'http://localhost:8006';
+
+export const getApiBaseUrl = () => {
+  return typeof window === 'undefined' ? DEFAULT_API_BASE : '';
+};
+
+export const isAbsoluteUrl = (url: string) => /^https?:\/\//i.test(url);
+
+export const joinBaseUrl = (base: string, path: string) => {
+  if (!base) return path;
+  if (!path.startsWith('/')) return `${base}/${path}`;
+  return `${base}${path}`;
+};
+
+export const resolveMediaUrl = (url?: string | null) => {
+  if (!url) return null;
+  if (isAbsoluteUrl(url)) return url;
+  return joinBaseUrl(getApiBaseUrl(), url);
+};
+
+export const encodePathSegments = (value: string) =>
+  value.split('/').map(encodeURIComponent).join('/');
+
+export const resolveAssetUrl = (assetPath?: string | null) => {
+  if (!assetPath) return null;
+  const encoded = encodePathSegments(assetPath);
+  return joinBaseUrl(getApiBaseUrl(), `/assets/${encoded}`);
+};
+
+export const resolveBgmUrl = (bgmId?: string | null) => {
+  if (!bgmId) return null;
+  return resolveAssetUrl(`bgm/${bgmId}`);
+};
+
+export const getFontFormat = (fontFile?: string) => {
+  if (!fontFile) return 'truetype';
+  const ext = fontFile.split('.').pop()?.toLowerCase();
+  if (ext === 'otf') return 'opentype';
+  return 'truetype';
+};
+
+export const buildTextShadow = (color: string, size: number) => {
+  return [
+    `-${size}px -${size}px 0 ${color}`,
+    `${size}px -${size}px 0 ${color}`,
+    `-${size}px ${size}px 0 ${color}`,
+    `${size}px ${size}px 0 ${color}`,
+    `0 0 ${size * 4}px rgba(0,0,0,0.9)`,
+    `0 4px 8px rgba(0,0,0,0.6)`
+  ].join(',');
+};
+
+export const formatDate = (timestamp: number) => {
+  const d = new Date(timestamp * 1000);
+  const year = d.getFullYear();
+  const month = String(d.getMonth() + 1).padStart(2, '0');
+  const day = String(d.getDate()).padStart(2, '0');
+  const hour = String(d.getHours()).padStart(2, '0');
+  const minute = String(d.getMinutes()).padStart(2, '0');
+  return `${year}/${month}/${day} ${hour}:${minute}`;
+};
--- a/frontend/src/shared/lib/title.ts
+++ b/frontend/src/shared/lib/title.ts
@@ -0,0 +1,14 @@
+export const TITLE_MAX_LENGTH = 15;
+
+export const clampTitle = (value: string, maxLength: number = TITLE_MAX_LENGTH) =>
+  value.slice(0, maxLength);
+
+export const applyTitleLimit = (
+  prev: string,
+  next: string,
+  maxLength: number = TITLE_MAX_LENGTH
+) => {
+  if (next.length <= maxLength) return next;
+  if (prev.length >= maxLength) return prev;
+  return next.slice(0, maxLength);
+};
--- a/models/Qwen3-TTS/.gitignore
+++ b/models/Qwen3-TTS/.gitignore
@@ -0,0 +1,24 @@
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+.idea/
+.vscode/
+venv/
+env/
--- a/models/Qwen3-TTS/LICENSE
+++ b/models/Qwen3-TTS/LICENSE
@@ -0,0 +1,201 @@
+                                 Apache License
+                           Version 2.0, January 2004
+                        http://www.apache.org/licenses/
+
+   TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
+
+   1. Definitions.
+
+      "License" shall mean the terms and conditions for use, reproduction,
+      and distribution as defined by Sections 1 through 9 of this document.
+
+      "Licensor" shall mean the copyright owner or entity authorized by
+      the copyright owner that is granting the License.
+
+      "Legal Entity" shall mean the union of the acting entity and all
+      other entities that control, are controlled by, or are under common
+      control with that entity. For the purposes of this definition,
+      "control" means (i) the power, direct or indirect, to cause the
+      direction or management of such entity, whether by contract or
+      otherwise, or (ii) ownership of fifty percent (50%) or more of the
+      outstanding shares, or (iii) beneficial ownership of such entity.
+
+      "You" (or "Your") shall mean an individual or Legal Entity
+      exercising permissions granted by this License.
+
+      "Source" form shall mean the preferred form for making modifications,
+      including but not limited to software source code, documentation
+      source, and configuration files.
+
+      "Object" form shall mean any form resulting from mechanical
+      transformation or translation of a Source form, including but
+      not limited to compiled object code, generated documentation,
+      and conversions to other media types.
+
+      "Work" shall mean the work of authorship, whether in Source or
+      Object form, made available under the License, as indicated by a
+      copyright notice that is included in or attached to the work
+      (an example is provided in the Appendix below).
+
+      "Derivative Works" shall mean any work, whether in Source or Object
+      form, that is based on (or derived from) the Work and for which the
+      editorial revisions, annotations, elaborations, or other modifications
+      represent, as a whole, an original work of authorship. For the purposes
+      of this License, Derivative Works shall not include works that remain
+      separable from, or merely link (or bind by name) to the interfaces of,
+      the Work and Derivative Works thereof.
+
+      "Contribution" shall mean any work of authorship, including
+      the original version of the Work and any modifications or additions
+      to that Work or Derivative Works thereof, that is intentionally
+      submitted to Licensor for inclusion in the Work by the copyright owner
+      or by an individual or Legal Entity authorized to submit on behalf of
+      the copyright owner. For the purposes of this definition, "submitted"
+      means any form of electronic, verbal, or written communication sent
+      to the Licensor or its representatives, including but not limited to
+      communication on electronic mailing lists, source code control systems,
+      and issue tracking systems that are managed by, or on behalf of, the
+      Licensor for the purpose of discussing and improving the Work, but
+      excluding communication that is conspicuously marked or otherwise
+      designated in writing by the copyright owner as "Not a Contribution."
+
+      "Contributor" shall mean Licensor and any individual or Legal Entity
+      on behalf of whom a Contribution has been received by Licensor and
+      subsequently incorporated within the Work.
+
+   2. Grant of Copyright License. Subject to the terms and conditions of
+      this License, each Contributor hereby grants to You a perpetual,
+      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
+      copyright license to reproduce, prepare Derivative Works of,
+      publicly display, publicly perform, sublicense, and distribute the
+      Work and such Derivative Works in Source or Object form.
+
+   3. Grant of Patent License. Subject to the terms and conditions of
+      this License, each Contributor hereby grants to You a perpetual,
+      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
+      (except as stated in this section) patent license to make, have made,
+      use, offer to sell, sell, import, and otherwise transfer the Work,
+      where such license applies only to those patent claims licensable
+      by such Contributor that are necessarily infringed by their
+      Contribution(s) alone or by combination of their Contribution(s)
+      with the Work to which such Contribution(s) was submitted. If You
+      institute patent litigation against any entity (including a
+      cross-claim or counterclaim in a lawsuit) alleging that the Work
+      or a Contribution incorporated within the Work constitutes direct
+      or contributory patent infringement, then any patent licenses
+      granted to You under this License for that Work shall terminate
+      as of the date such litigation is filed.
+
+   4. Redistribution. You may reproduce and distribute copies of the
+      Work or Derivative Works thereof in any medium, with or without
+      modifications, and in Source or Object form, provided that You
+      meet the following conditions:
+
+      (a) You must give any other recipients of the Work or
+          Derivative Works a copy of this License; and
+
+      (b) You must cause any modified files to carry prominent notices
+          stating that You changed the files; and
+
+      (c) You must retain, in the Source form of any Derivative Works
+          that You distribute, all copyright, patent, trademark, and
+          attribution notices from the Source form of the Work,
+          excluding those notices that do not pertain to any part of
+          the Derivative Works; and
+
+      (d) If the Work includes a "NOTICE" text file as part of its
+          distribution, then any Derivative Works that You distribute must
+          include a readable copy of the attribution notices contained
+          within such NOTICE file, excluding those notices that do not
+          pertain to any part of the Derivative Works, in at least one
+          of the following places: within a NOTICE text file distributed
+          as part of the Derivative Works; within the Source form or
+          documentation, if provided along with the Derivative Works; or,
+          within a display generated by the Derivative Works, if and
+          wherever such third-party notices normally appear. The contents
+          of the NOTICE file are for informational purposes only and
+          do not modify the License. You may add Your own attribution
+          notices within Derivative Works that You distribute, alongside
+          or as an addendum to the NOTICE text from the Work, provided
+          that such additional attribution notices cannot be construed
+          as modifying the License.
+
+      You may add Your own copyright statement to Your modifications and
+      may provide additional or different license terms and conditions
+      for use, reproduction, or distribution of Your modifications, or
+      for any such Derivative Works as a whole, provided Your use,
+      reproduction, and distribution of the Work otherwise complies with
+      the conditions stated in this License.
+
+   5. Submission of Contributions. Unless You explicitly state otherwise,
+      any Contribution intentionally submitted for inclusion in the Work
+      by You to the Licensor shall be under the terms and conditions of
+      this License, without any additional terms or conditions.
+      Notwithstanding the above, nothing herein shall supersede or modify
+      the terms of any separate license agreement you may have executed
+      with Licensor regarding such Contributions.
+
+   6. Trademarks. This License does not grant permission to use the trade
+      names, trademarks, service marks, or product names of the Licensor,
+      except as required for reasonable and customary use in describing the
+      origin of the Work and reproducing the content of the NOTICE file.
+
+   7. Disclaimer of Warranty. Unless required by applicable law or
+      agreed to in writing, Licensor provides the Work (and each
+      Contributor provides its Contributions) on an "AS IS" BASIS,
+      WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
+      implied, including, without limitation, any warranties or conditions
+      of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
+      PARTICULAR PURPOSE. You are solely responsible for determining the
+      appropriateness of using or redistributing the Work and assume any
+      risks associated with Your exercise of permissions under this License.
+
+   8. Limitation of Liability. In no event and under no legal theory,
+      whether in tort (including negligence), contract, or otherwise,
+      unless required by applicable law (such as deliberate and grossly
+      negligent acts) or agreed to in writing, shall any Contributor be
+      liable to You for damages, including any direct, indirect, special,
+      incidental, or consequential damages of any character arising as a
+      result of this License or out of the use or inability to use the
+      Work (including but not limited to damages for loss of goodwill,
+      work stoppage, computer failure or malfunction, or any and all
+      other commercial damages or losses), even if such Contributor
+      has been advised of the possibility of such damages.
+
+   9. Accepting Warranty or Additional Liability. While redistributing
+      the Work or Derivative Works thereof, You may choose to offer,
+      and charge a fee for, acceptance of support, warranty, indemnity,
+      or other liability obligations and/or rights consistent with this
+      License. However, in accepting such obligations, You may act only
+      on Your own behalf and on Your sole responsibility, not on behalf
+      of any other Contributor, and only if You agree to indemnify,
+      defend, and hold each Contributor harmless for any liability
+      incurred by, or claims asserted against, such Contributor by reason
+      of your accepting any such warranty or additional liability.
+
+   END OF TERMS AND CONDITIONS
+
+   APPENDIX: How to apply the Apache License to your work.
+
+      To apply the Apache License to your work, attach the following
+      boilerplate notice, with the fields enclosed by brackets "[]"
+      replaced with your own identifying information. (Don't include
+      the brackets!)  The text should be enclosed in the appropriate
+      comment syntax for the file format. We also recommend that a
+      file or class name and description of purpose be included on the
+      same "printed page" as the copyright notice for easier
+      identification within third-party archives.
+
+   Copyright 2026 Alibaba Cloud
+
+   Licensed under the Apache License, Version 2.0 (the "License");
+   you may not use this file except in compliance with the License.
+   You may obtain a copy of the License at
+
+       http://www.apache.org/licenses/LICENSE-2.0
+
+   Unless required by applicable law or agreed to in writing, software
+   distributed under the License is distributed on an "AS IS" BASIS,
+   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+   See the License for the specific language governing permissions and
+   limitations under the License.
--- a/models/Qwen3-TTS/MANIFEST.in
+++ b/models/Qwen3-TTS/MANIFEST.in
@@ -0,0 +1,13 @@
+global-exclude *
+
+recursive-include qwen_tts *.py *.pyi py.typed
+recursive-include qwen_tts *.npz
+
+include LICENSE
+include MANIFEST.in
+include pyproject.toml
+
+prune assets
+prune examples
+prune finetuning
+prune qwen_tts.egg-info
--- a/models/Qwen3-TTS/README.md
+++ b/models/Qwen3-TTS/README.md
--- a/models/Qwen3-TTS/finetuning/README.md
+++ b/models/Qwen3-TTS/finetuning/README.md
@@ -0,0 +1,121 @@
+## Fine Tuning Qwen3-TTS-12Hz-1.7B/0.6B-Base
+
+The Qwen3-TTS-12Hz-1.7B/0.6B-Base model series currently supports single-speaker fine-tuning. Please run `pip install qwen-tts` first, then run the command below:
+
+```
+git clone https://github.com/QwenLM/Qwen3-TTS.git
+cd Qwen3-TTS/finetuning
+```
+
+Then follow the steps below to complete the entire fine-tuning workflow. Multi-speaker fine-tuning and other advanced fine-tuning features will be supported in future releases.
+
+### 1) Input JSONL format
+
+Prepare your training file as a JSONL (one JSON object per line). Each line must contain:
+
+- `audio`: path to the target training audio (wav)
+- `text`: transcript corresponding to `audio`
+- `ref_audio`: path to the reference speaker audio (wav)
+
+Example:
+```jsonl
+{"audio":"./data/utt0001.wav","text":"其实我真的有发现，我是一个特别善于观察别人情绪的人。","ref_audio":"./data/ref.wav"}
+{"audio":"./data/utt0002.wav","text":"She said she would be here by noon.","ref_audio":"./data/ref.wav"}
+```
+
+`ref_audio` recommendation:
+- Strongly recommended: use the same `ref_audio` for all samples.
+- Keeping `ref_audio` identical across the dataset usually improves speaker consistency and stability during generation.
+
+
+### 2) Prepare data (extract `audio_codes`)
+
+Convert `train_raw.jsonl` into a training JSONL that includes `audio_codes`:
+
+```bash
+python prepare_data.py \
+  --device cuda:0 \
+  --tokenizer_model_path Qwen/Qwen3-TTS-Tokenizer-12Hz \
+  --input_jsonl train_raw.jsonl \
+  --output_jsonl train_with_codes.jsonl
+```
+
+
+### 3) Fine-tune
+
+Run SFT using the prepared JSONL:
+
+```bash
+python sft_12hz.py \
+  --init_model_path Qwen/Qwen3-TTS-12Hz-1.7B-Base \
+  --output_model_path output \
+  --train_jsonl train_with_codes.jsonl \
+  --batch_size 2 \
+  --lr 2e-5 \
+  --num_epochs 3 \
+  --speaker_name speaker_test
+```
+
+Checkpoints will be written to:
+- `output/checkpoint-epoch-0`
+- `output/checkpoint-epoch-1`
+- `output/checkpoint-epoch-2`
+- ...
+
+
+### 4) Quick inference test
+
+```python
+import torch
+import soundfile as sf
+from qwen_tts import Qwen3TTSModel
+
+device = "cuda:0"
+tts = Qwen3TTSModel.from_pretrained(
+    "output/checkpoint-epoch-2",
+    device_map=device,
+    dtype=torch.bfloat16,
+    attn_implementation="flash_attention_2",
+)
+
+wavs, sr = tts.generate_custom_voice(
+    text="She said she would be here by noon.",
+    speaker="speaker_test",
+)
+sf.write("output.wav", wavs[0], sr)
+```
+
+### One-click shell script example
+
+```bash
+#!/usr/bin/env bash
+set -e
+
+DEVICE="cuda:0"
+TOKENIZER_MODEL_PATH="Qwen/Qwen3-TTS-Tokenizer-12Hz"
+INIT_MODEL_PATH="Qwen/Qwen3-TTS-12Hz-1.7B-Base"
+
+RAW_JSONL="train_raw.jsonl"
+TRAIN_JSONL="train_with_codes.jsonl"
+OUTPUT_DIR="output"
+
+BATCH_SIZE=2
+LR=2e-5
+EPOCHS=3
+SPEAKER_NAME="speaker_1"
+
+python prepare_data.py \
+  --device ${DEVICE} \
+  --tokenizer_model_path ${TOKENIZER_MODEL_PATH} \
+  --input_jsonl ${RAW_JSONL} \
+  --output_jsonl ${TRAIN_JSONL}
+
+python sft_12hz.py \
+  --init_model_path ${INIT_MODEL_PATH} \
+  --output_model_path ${OUTPUT_DIR} \
+  --train_jsonl ${TRAIN_JSONL} \
+  --batch_size ${BATCH_SIZE} \
+  --lr ${LR} \
+  --num_epochs ${EPOCHS} \
+  --speaker_name ${SPEAKER_NAME}
+```
--- a/models/Qwen3-TTS/finetuning/dataset.py
+++ b/models/Qwen3-TTS/finetuning/dataset.py
@@ -0,0 +1,218 @@
+# coding=utf-8
+# Copyright 2026 The Alibaba Qwen team.
+# SPDX-License-Identifier: Apache-2.0
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import Any, List, Tuple, Union
+
+import librosa
+import numpy as np
+import torch
+from qwen_tts.core.models.configuration_qwen3_tts import Qwen3TTSConfig
+from qwen_tts.core.models.modeling_qwen3_tts import mel_spectrogram
+from torch.utils.data import Dataset
+
+AudioLike = Union[
+    str,                     # wav path, URL, base64
+    np.ndarray,              # waveform (requires sr)
+    Tuple[np.ndarray, int],  # (waveform, sr)
+]
+
+MaybeList = Union[Any, List[Any]]
+
+class TTSDataset(Dataset):
+    def __init__(self, data_list, processor, config:Qwen3TTSConfig, lag_num = -1):
+        self.data_list = data_list
+        self.processor = processor
+        self.lag_num = lag_num
+        self.config = config
+
+    def __len__(self):
+        return len(self.data_list)
+    
+    def _load_audio_to_np(self, x: str) -> Tuple[np.ndarray, int]:
+        
+        audio, sr = librosa.load(x, sr=None, mono=True)
+
+        if audio.ndim > 1:
+            audio = np.mean(audio, axis=-1)
+
+        return audio.astype(np.float32), int(sr)
+
+    def _normalize_audio_inputs(self, audios: Union[AudioLike, List[AudioLike]]) -> List[Tuple[np.ndarray, int]]:
+        """
+        Normalize audio inputs into a list of (waveform, sr).
+
+        Supported forms:
+          - str: wav path / URL / base64 audio string
+          - np.ndarray: waveform (NOT allowed alone here because sr is unknown)
+          - (np.ndarray, sr): waveform + sampling rate
+          - list of the above
+
+        Args:
+            audios:
+                Audio input(s).
+
+        Returns:
+            List[Tuple[np.ndarray, int]]:
+                List of (float32 waveform, original sr).
+
+        Raises:
+            ValueError: If a numpy waveform is provided without sr.
+        """
+        if isinstance(audios, list):
+            items = audios
+        else:
+            items = [audios]
+
+        out: List[Tuple[np.ndarray, int]] = []
+        for a in items:
+            if isinstance(a, str):
+                out.append(self._load_audio_to_np(a))
+            elif isinstance(a, tuple) and len(a) == 2 and isinstance(a[0], np.ndarray):
+                out.append((a[0].astype(np.float32), int(a[1])))
+            elif isinstance(a, np.ndarray):
+                raise ValueError("For numpy waveform input, pass a tuple (audio, sr).")
+            else:
+                raise TypeError(f"Unsupported audio input type: {type(a)}")
+        return out
+
+    
+    def _build_assistant_text(self, text: str) -> str:
+        return f"<|im_start|>assistant\n{text}<|im_end|>\n<|im_start|>assistant\n"
+    
+    def _ensure_list(self, x: MaybeList) -> List[Any]:
+        return x if isinstance(x, list) else [x]
+    
+    def _tokenize_texts(self, text) -> List[torch.Tensor]:
+        input = self.processor(text=text, return_tensors="pt", padding=True)
+        input_id = input["input_ids"]
+        input_id = input_id.unsqueeze(0) if input_id.dim() == 1 else input_id
+        return input_id
+    
+    @torch.inference_mode()
+    def extract_mels(self, audio, sr):
+        assert sr == 24000, "Only support 24kHz audio"
+        mels = mel_spectrogram(
+            torch.from_numpy(audio).unsqueeze(0), 
+            n_fft=1024, 
+            num_mels=128, 
+            sampling_rate=24000,
+            hop_size=256, 
+            win_size=1024, 
+            fmin=0, 
+            fmax=12000
+        ).transpose(1, 2)
+        return mels
+
+
+
+    def __getitem__(self, idx):
+        item = self.data_list[idx]
+
+        audio_path  = item["audio"]
+        text        = item["text"]
+        audio_codes = item["audio_codes"]
+        language        = item.get('language','Auto')
+        ref_audio_path  = item['ref_audio']
+
+        text = self._build_assistant_text(text)
+        text_ids = self._tokenize_texts(text)
+
+        audio_codes = torch.tensor(audio_codes, dtype=torch.long)
+
+        ref_audio_list = self._ensure_list(ref_audio_path)
+        normalized = self._normalize_audio_inputs(ref_audio_list)
+        wav,sr = normalized[0]
+
+        ref_mel = self.extract_mels(audio=wav, sr=sr)
+
+        return {
+            "text_ids": text_ids[:,:-5],    # 1 , t
+            "audio_codes":audio_codes,      # t, 16
+            "ref_mel":ref_mel
+        }
+        
+    def collate_fn(self, batch):
+        assert self.lag_num == -1
+
+        item_length = [b['text_ids'].shape[1] + b['audio_codes'].shape[0] for b in batch]
+        max_length = max(item_length) + 8
+        b,t = len(batch),max_length
+
+        input_ids   = torch.zeros((b,t,2),dtype=torch.long)
+        codec_ids   = torch.zeros((b,t,16),dtype=torch.long)
+        text_embedding_mask     = torch.zeros((b,t),dtype=torch.bool)
+        codec_embedding_mask    = torch.zeros((b,t),dtype=torch.bool)
+        codec_mask      = torch.zeros((b,t),dtype=torch.bool)
+        attention_mask  = torch.zeros((b,t),dtype=torch.long)
+        codec_0_labels  = torch.full((b, t), -100, dtype=torch.long)
+
+        for i,data in enumerate(batch):
+            text_ids        = data['text_ids']
+            audio_codec_0   = data['audio_codes'][:,0]
+            audio_codecs    = data['audio_codes']
+
+            text_ids_len = text_ids.shape[1]
+            codec_ids_len = audio_codec_0.shape[0]
+            
+            # text channel
+            input_ids[i,  :3, 0] = text_ids[0,:3]
+            input_ids[i, 3:7, 0] = self.config.tts_pad_token_id
+            input_ids[i,   7, 0] = self.config.tts_bos_token_id
+            input_ids[i, 8:8+text_ids_len-3, 0] = text_ids[0,3:]
+            input_ids[i,   8+text_ids_len-3, 0] = self.config.tts_eos_token_id
+            input_ids[i, 8+text_ids_len-2:8+text_ids_len+codec_ids_len , 0] = self.config.tts_pad_token_id
+            text_embedding_mask[i,  :8+text_ids_len+codec_ids_len] = True
+
+            # codec channel
+            # input_ids[i,   :3, 1] = 0
+            input_ids[i,    3:8 ,1] = torch.tensor(
+                                        [
+                                            self.config.talker_config.codec_nothink_id,
+                                            self.config.talker_config.codec_think_bos_id,
+                                            self.config.talker_config.codec_think_eos_id,
+                                            0,     # for speaker embedding
+                                            self.config.talker_config.codec_pad_id       
+                                        ]
+                                    )
+            input_ids[i,    8:8+text_ids_len-3  ,1] = self.config.talker_config.codec_pad_id
+            input_ids[i,    8+text_ids_len-3    ,1] = self.config.talker_config.codec_pad_id
+            input_ids[i,    8+text_ids_len-2    ,1] = self.config.talker_config.codec_bos_id
+            input_ids[i,    8+text_ids_len-1:8+text_ids_len-1+codec_ids_len,    1] = audio_codec_0
+            input_ids[i,    8+text_ids_len-1+codec_ids_len,    1] = self.config.talker_config.codec_eos_token_id
+
+            codec_0_labels[i,    8+text_ids_len-1:8+text_ids_len-1+codec_ids_len] = audio_codec_0
+            codec_0_labels[i,    8+text_ids_len-1+codec_ids_len] = self.config.talker_config.codec_eos_token_id
+
+            codec_ids[i, 8+text_ids_len-1:8+text_ids_len-1+codec_ids_len,:] = audio_codecs
+
+            codec_embedding_mask[i, 3:8+text_ids_len+codec_ids_len] = True
+            codec_embedding_mask[i, 6] = False       # for speaker embedding
+
+            codec_mask[i,   8+text_ids_len-1:8+text_ids_len-1+codec_ids_len] = True
+            attention_mask[i, :8+text_ids_len+codec_ids_len] = True
+        
+        ref_mels = [data['ref_mel'] for data in batch]
+        ref_mels = torch.cat(ref_mels,dim=0)
+
+        return {
+            'input_ids':input_ids,
+            'ref_mels':ref_mels,
+            'attention_mask':attention_mask,
+            'text_embedding_mask':text_embedding_mask.unsqueeze(-1),
+            'codec_embedding_mask':codec_embedding_mask.unsqueeze(-1),
+            'codec_0_labels':codec_0_labels,
+            'codec_ids': codec_ids,
+            'codec_mask':codec_mask
+        }
--- a/models/Qwen3-TTS/finetuning/prepare_data.py
+++ b/models/Qwen3-TTS/finetuning/prepare_data.py
@@ -0,0 +1,71 @@
+# coding=utf-8
+# Copyright 2026 The Alibaba Qwen team.
+# SPDX-License-Identifier: Apache-2.0
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+import argparse
+import json
+
+from qwen_tts import Qwen3TTSTokenizer
+
+BATCH_INFER_NUM = 32
+
+def main():
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--device", type=str, default="cuda:0")
+    parser.add_argument("--tokenizer_model_path", type=str, default="Qwen/Qwen3-TTS-Tokenizer-12Hz")
+    parser.add_argument("--input_jsonl", type=str, required=True)
+    parser.add_argument("--output_jsonl", type=str, required=True)
+    args = parser.parse_args()
+
+    tokenizer_12hz = Qwen3TTSTokenizer.from_pretrained(
+        args.tokenizer_model_path,
+        device_map=args.device,
+    )
+
+    total_lines = open(args.input_jsonl).readlines()
+    total_lines = [json.loads(line.strip()) for line in total_lines]
+
+    final_lines = []
+    batch_lines = []
+    batch_audios = []
+    for line in total_lines:
+
+        batch_lines.append(line)
+        batch_audios.append(line['audio'])
+
+        if len(batch_lines) >= BATCH_INFER_NUM:
+            enc_res = tokenizer_12hz.encode(batch_audios)
+            for code, line in zip(enc_res.audio_codes, batch_lines):
+                line['audio_codes'] = code.cpu().tolist()
+                final_lines.append(line)
+            batch_lines.clear()
+            batch_audios.clear()
+
+    if len(batch_audios) > 0:
+        enc_res = tokenizer_12hz.encode(batch_audios)
+        for code, line in zip(enc_res.audio_codes, batch_lines):
+            line['audio_codes'] = code.cpu().tolist()
+            final_lines.append(line)
+        batch_lines.clear()
+        batch_audios.clear()
+
+    final_lines = [json.dumps(line, ensure_ascii=False) for line in final_lines]
+
+    with open(args.output_jsonl, 'w') as f:
+        for line in final_lines:
+            f.writelines(line + '\n')
+
+if __name__ == "__main__":
+    main()
--- a/models/Qwen3-TTS/finetuning/sft_12hz.py
+++ b/models/Qwen3-TTS/finetuning/sft_12hz.py
@@ -0,0 +1,161 @@
+# coding=utf-8
+# Copyright 2026 The Alibaba Qwen team.
+# SPDX-License-Identifier: Apache-2.0
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+import argparse
+import json
+import os
+import shutil
+
+import torch
+from accelerate import Accelerator
+from dataset import TTSDataset
+from qwen_tts.inference.qwen3_tts_model import Qwen3TTSModel
+from safetensors.torch import save_file
+from torch.optim import AdamW
+from torch.utils.data import DataLoader
+from transformers import AutoConfig
+
+target_speaker_embedding = None
+def train():
+    global target_speaker_embedding
+
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--init_model_path", type=str, default="Qwen/Qwen3-TTS-12Hz-1.7B-Base")
+    parser.add_argument("--output_model_path", type=str, default="output")
+    parser.add_argument("--train_jsonl", type=str, required=True)
+    parser.add_argument("--batch_size", type=int, default=2)
+    parser.add_argument("--lr", type=float, default=2e-5)
+    parser.add_argument("--num_epochs", type=int, default=3)
+    parser.add_argument("--speaker_name", type=str, default="speaker_test")
+    args = parser.parse_args()
+
+    accelerator = Accelerator(gradient_accumulation_steps=4, mixed_precision="bf16", log_with="tensorboard")
+
+    MODEL_PATH = args.init_model_path
+
+    qwen3tts = Qwen3TTSModel.from_pretrained(
+        MODEL_PATH,
+        torch_dtype=torch.bfloat16,
+        attn_implementation="flash_attention_2",
+    )
+    config = AutoConfig.from_pretrained(MODEL_PATH)
+
+    train_data = open(args.train_jsonl).readlines()
+    train_data = [json.loads(line) for line in train_data]
+    dataset = TTSDataset(train_data, qwen3tts.processor, config)
+    train_dataloader = DataLoader(dataset, batch_size=args.batch_size, shuffle=True, collate_fn=dataset.collate_fn)
+
+    optimizer = AdamW(qwen3tts.model.parameters(), lr=args.lr, weight_decay=0.01)
+
+    model, optimizer, train_dataloader = accelerator.prepare(
+        qwen3tts.model, optimizer, train_dataloader
+    )
+
+    num_epochs = args.num_epochs
+    model.train()
+
+    for epoch in range(num_epochs):
+        for step, batch in enumerate(train_dataloader):
+            with accelerator.accumulate(model):
+
+                input_ids = batch['input_ids']
+                codec_ids = batch['codec_ids']
+                ref_mels = batch['ref_mels']
+                text_embedding_mask = batch['text_embedding_mask']
+                codec_embedding_mask = batch['codec_embedding_mask']
+                attention_mask = batch['attention_mask']
+                codec_0_labels = batch['codec_0_labels']
+                codec_mask = batch['codec_mask']
+
+                speaker_embedding = model.speaker_encoder(ref_mels.to(model.device).to(model.dtype)).detach()
+                if target_speaker_embedding is None:
+                    target_speaker_embedding = speaker_embedding
+
+                input_text_ids = input_ids[:, :, 0]
+                input_codec_ids = input_ids[:, :, 1]
+
+                input_text_embedding = model.talker.model.text_embedding(input_text_ids) * text_embedding_mask
+                input_codec_embedding = model.talker.model.codec_embedding(input_codec_ids) * codec_embedding_mask
+                input_codec_embedding[:, 6, :] = speaker_embedding
+
+                input_embeddings = input_text_embedding + input_codec_embedding
+
+                for i in range(1, 16):
+                    codec_i_embedding = model.talker.code_predictor.get_input_embeddings()[i - 1](codec_ids[:, :, i])
+                    codec_i_embedding = codec_i_embedding * codec_mask.unsqueeze(-1)
+                    input_embeddings = input_embeddings + codec_i_embedding
+
+                outputs = model.talker(
+                    inputs_embeds=input_embeddings[:, :-1, :],
+                    attention_mask=attention_mask[:, :-1],
+                    labels=codec_0_labels[:, 1:],
+                    output_hidden_states=True
+                )
+
+                hidden_states = outputs.hidden_states[0][-1]
+                talker_hidden_states = hidden_states[codec_mask[:, 1:]]
+                talker_codec_ids = codec_ids[codec_mask]
+
+                sub_talker_logits, sub_talker_loss = model.talker.forward_sub_talker_finetune(talker_codec_ids, talker_hidden_states)
+
+                loss = outputs.loss + sub_talker_loss
+
+                accelerator.backward(loss)
+
+                if accelerator.sync_gradients:
+                    accelerator.clip_grad_norm_(model.parameters(), 1.0)
+
+                optimizer.step()
+                optimizer.zero_grad()
+
+            if step % 10 == 0:
+                accelerator.print(f"Epoch {epoch} | Step {step} | Loss: {loss.item():.4f}")
+
+        if accelerator.is_main_process:
+            output_dir = os.path.join(args.output_model_path, f"checkpoint-epoch-{epoch}")
+            shutil.copytree(MODEL_PATH, output_dir, dirs_exist_ok=True)
+
+            input_config_file = os.path.join(MODEL_PATH, "config.json")
+            output_config_file = os.path.join(output_dir, "config.json")
+            with open(input_config_file, 'r', encoding='utf-8') as f:
+                config_dict = json.load(f)
+            config_dict["tts_model_type"] = "custom_voice"
+            talker_config = config_dict.get("talker_config", {})
+            talker_config["spk_id"] = {
+                args.speaker_name: 3000
+            }
+            talker_config["spk_is_dialect"] = {
+                args.speaker_name: False
+            }
+            config_dict["talker_config"] = talker_config
+
+            with open(output_config_file, 'w', encoding='utf-8') as f:
+                json.dump(config_dict, f, indent=2, ensure_ascii=False)
+
+            unwrapped_model = accelerator.unwrap_model(model)
+            state_dict = {k: v.detach().to("cpu") for k, v in unwrapped_model.state_dict().items()}
+
+            drop_prefix = "speaker_encoder"
+            keys_to_drop = [k for k in state_dict.keys() if k.startswith(drop_prefix)]
+            for k in keys_to_drop:
+                del state_dict[k]
+
+            weight = state_dict['talker.model.codec_embedding.weight']
+            state_dict['talker.model.codec_embedding.weight'][3000] = target_speaker_embedding[0].detach().to(weight.device).to(weight.dtype)
+            save_path = os.path.join(output_dir, "model.safetensors")
+            save_file(state_dict, save_path)
+
+if __name__ == "__main__":
+    train()
--- a/models/Qwen3-TTS/pyproject.toml
+++ b/models/Qwen3-TTS/pyproject.toml
@@ -0,0 +1,46 @@
+[build-system]
+requires = ["setuptools>=68", "wheel"]
+build-backend = "setuptools.build_meta"
+
+[project]
+name = "qwen-tts"
+version = "0.0.4"
+description = "Qwen-TTS python package"
+readme = "README.md"
+requires-python = ">=3.9"
+classifiers = [
+  "Programming Language :: Python :: 3",
+  "Programming Language :: Python :: 3.9",
+  "Programming Language :: Python :: 3.10",
+  "Programming Language :: Python :: 3.11",
+  "Programming Language :: Python :: 3.12",
+  "Programming Language :: Python :: 3.13",
+]
+license = { text = "Apache-2.0" }
+authors = [{ name = "Alibaba Qwen Team" }]
+
+dependencies = [
+  "transformers==4.57.3",
+  "accelerate==1.12.0",
+  "gradio",
+  "librosa",
+  "torchaudio",
+  "soundfile",
+  "sox",
+  "onnxruntime",
+  "einops",
+]
+
+[project.urls]
+Homepage = "https://github.com/Qwen/Qwen3-TTS"
+Repository = "https://github.com/Qwen/Qwen3-TTS"
+
+[project.scripts]
+qwen-tts-demo = "qwen_tts.cli.demo:main"
+
+[tool.setuptools]
+packages = { find = { where = ["."] , include = ["qwen_tts*"] } }
+include-package-data = true
+
+[tool.setuptools.package-data]
+qwen_tts = ["py.typed", "**/*.npz"]
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Kevin Wong	b2c1042c5c	更新	2026-02-04 18:04:17 +08:00
Kevin Wong	aaa8088c82	更新	2026-02-04 17:19:24 +08:00
Kevin Wong	31469ca01d	更新	2026-02-04 16:56:16 +08:00
Kevin Wong	22ea3dd0db	更新	2026-02-04 16:54:59 +08:00
Kevin Wong	8a5912c517	更新	2026-02-04 15:59:45 +08:00
Kevin Wong	74516dbcdb	更新	2026-02-04 11:56:37 +08:00
Kevin Wong	5357d97012	更新	2026-02-04 11:41:55 +08:00
Kevin Wong	33d8e52802	更新	2026-02-03 17:42:04 +08:00
Kevin Wong	9af50a9066	更新	2026-02-03 17:15:35 +08:00
Kevin Wong	6c6fbae13a	更新	2026-02-03 17:12:30 +08:00
Kevin Wong	cb10da52fc	更新	2026-02-03 13:46:52 +08:00
Kevin Wong	eb3ed23326	更新	2026-02-02 17:34:36 +08:00
Kevin Wong	6e58f4bbe7	更新	2026-02-02 17:16:07 +08:00
Kevin Wong	7bfd6bf862	更新	2026-02-02 14:28:48 +08:00
Kevin Wong	569736d05b	更新代码	2026-02-02 11:49:22 +08:00
Kevin Wong	ec16e08bdb	更新代码	2026-02-02 10:58:21 +08:00
Kevin Wong	6801d3e8aa	更新代码	2026-02-02 10:51:27 +08:00
Kevin Wong	cf679b34bf	更新	2026-01-29 17:58:07 +08:00
Kevin Wong	b74bacb0b5	更新	2026-01-29 17:54:43 +08:00
Kevin Wong	661a8f357c	更新	2026-01-29 12:16:41 +08:00
Kevin Wong	4a3dd2b225	更新	2026-01-28 17:22:31 +08:00
Kevin Wong	ee8cb9cfd2	更新	2026-01-27 16:52:40 +08:00