Spaces:

lisonallen
/

framepack-i2v

Running on Zero

App Files Files Community

lisonallen commited on 9 days ago

Commit

8336ddb

1 Parent(s): 1082c60

Limit video length to maximum 5 seconds

Browse files

Files changed (2) hide show

README.md +27 -5
app.py +10 -7

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: FramePack图像到视频生成
 emoji: 🎬
 colorFrom: indigo
 colorTo: purple
@@ -10,23 +10,45 @@ pinned: false
 license: mit
 ---
-# FramePack
-FramePack是一个图像到视频生成工具，利用扩散模型将静态图像转换为动态视频。
 ## 特点
 - 使用单张图片生成流畅的动作视频
 - 基于HunyuanVideo和FramePack架构
 - 支持低显存GPU（最低6GB）运行
-- 可以生成最长120秒的视频
 - 使用TeaCache技术加速生成过程
 ## 使用方法
 1. 上传一张人物图像
 2. 输入描述所需动作的提示词
-3. 设置所需视频长度（秒）
 4. 点击"开始生成"按钮
 5. 等待视频生成（生成过程是渐进式的，会不断扩展视频长度）

 ---
+title: FramePack图像到视频生成(5秒限制版)
 emoji: 🎬
 colorFrom: indigo
 colorTo: purple
 license: mit
 ---
+# FramePack - Image to Video Generation
+This is a modified version of the FramePack model with a 5-second maximum video length limit.
+## Features
+- Generate realistic videos from still images
+- Simple and intuitive interface
+- Bilingual support (English/Chinese)
+- Maximum video length of 5 seconds to ensure quick generation times
+## Usage
+1. Upload an image
+2. Enter a prompt describing the desired motion
+3. Adjust parameters if needed (seed, video length, etc.)
+4. Click "Generate" and wait for the result
+## Technical Details
+This application uses the HunyuanVideo transformer model for image-to-video generation. The model has been optimized to work efficiently with videos up to 5 seconds in length.
+## Credits
+Based on the original FramePack model by lllyasviel.
 ## 特点
 - 使用单张图片生成流畅的动作视频
 - 基于HunyuanVideo和FramePack架构
 - 支持低显存GPU（最低6GB）运行
+- 可以生成最长5秒的视频
 - 使用TeaCache技术加速生成过程
 ## 使用方法
 1. 上传一张人物图像
 2. 输入描述所需动作的提示词
+3. 设置所需视频长度（最大5秒）
 4. 点击"开始生成"按钮
 5. 等待视频生成（生成过程是渐进式的，会不断扩展视频长度）

app.py CHANGED Viewed

@@ -23,7 +23,7 @@ translations = {
         "teacache_info": "Faster speed, but may result in slightly worse finger and hand generation.",
         "negative_prompt": "Negative Prompt",
         "seed": "Seed",
-        "video_length": "Video Length (seconds)",
         "latent_window": "Latent Window Size",
         "steps": "Inference Steps",
         "steps_info": "Changing this value is not recommended.",
@@ -55,7 +55,7 @@ translations = {
         "teacache_info": "速度更快，但可能会使手指和手的生成效果稍差。",
         "negative_prompt": "负面提示词",
         "seed": "随机种子",
-        "video_length": "视频长度(秒)",
         "latent_window": "潜在窗口大小",
         "steps": "推理步数",
         "steps_info": "不建议修改此值。",
@@ -420,6 +420,9 @@ def worker(input_image, prompt, n_prompt, seed, total_second_length, latent_wind
     global last_update_time
     last_update_time = time.time()
     # 获取模型
     try:
         models = get_models()
@@ -456,7 +459,7 @@ def worker(input_image, prompt, n_prompt, seed, total_second_length, latent_wind
         # 减小处理大小以加快CPU处理
         latent_window_size = min(latent_window_size, 5)
         steps = min(steps, 15)  # 减少步数
-        total_second_length = min(total_second_length, 2.0)  # 限制视频长度
     total_latent_sections = (total_second_length * 30) / (latent_window_size * 4)
     total_latent_sections = int(max(round(total_latent_sections), 1))
@@ -1302,7 +1305,7 @@ with block:
                             "teacache_info": "Faster speed, but may result in slightly worse finger and hand generation.",
                             "negative_prompt": "Negative Prompt",
                             "seed": "Seed",
-                            "video_length": "Video Length (seconds)",
                             "latent_window": "Latent Window Size",
                             "steps": "Inference Steps",
                             "steps_info": "Changing this value is not recommended.",
@@ -1334,7 +1337,7 @@ with block:
                             "teacache_info": "速度更快，但可能会使手指和手的生成效果稍差。",
                             "negative_prompt": "负面提示词",
                             "seed": "随机种子",
-                            "video_length": "视频长度(秒)",
                             "latent_window": "潜在窗口大小",
                             "steps": "推理步数",
                             "steps_info": "不建议修改此值。",
@@ -1486,9 +1489,9 @@ with block:
                 # 添加slider-container类以便CSS触摸优化
                 with gr.Group(elem_classes="slider-container"):
                     total_second_length = gr.Slider(
-                        label="Video Length (seconds) / 视频长度(秒)",
                         minimum=1,
-                        maximum=120,
                         value=5,
                         step=0.1
                     )

         "teacache_info": "Faster speed, but may result in slightly worse finger and hand generation.",
         "negative_prompt": "Negative Prompt",
         "seed": "Seed",
+        "video_length": "Video Length (max 5 seconds)",
         "latent_window": "Latent Window Size",
         "steps": "Inference Steps",
         "steps_info": "Changing this value is not recommended.",
         "teacache_info": "速度更快，但可能会使手指和手的生成效果稍差。",
         "negative_prompt": "负面提示词",
         "seed": "随机种子",
+        "video_length": "视频长度(最大5秒)",
         "latent_window": "潜在窗口大小",
         "steps": "推理步数",
         "steps_info": "不建议修改此值。",
     global last_update_time
     last_update_time = time.time()
+    # 限制视频长度不超过5秒
+    total_second_length = min(total_second_length, 5.0)
     # 获取模型
     try:
         models = get_models()
         # 减小处理大小以加快CPU处理
         latent_window_size = min(latent_window_size, 5)
         steps = min(steps, 15)  # 减少步数
+        total_second_length = min(total_second_length, 2.0)  # CPU模式下进一步限制视频长度
     total_latent_sections = (total_second_length * 30) / (latent_window_size * 4)
     total_latent_sections = int(max(round(total_latent_sections), 1))
                             "teacache_info": "Faster speed, but may result in slightly worse finger and hand generation.",
                             "negative_prompt": "Negative Prompt",
                             "seed": "Seed",
+                            "video_length": "Video Length (max 5 seconds)",
                             "latent_window": "Latent Window Size",
                             "steps": "Inference Steps",
                             "steps_info": "Changing this value is not recommended.",
                             "teacache_info": "速度更快，但可能会使手指和手的生成效果稍差。",
                             "negative_prompt": "负面提示词",
                             "seed": "随机种子",
+                            "video_length": "视频长度(最大5秒)",
                             "latent_window": "潜在窗口大小",
                             "steps": "推理步数",
                             "steps_info": "不建议修改此值。",
                 # 添加slider-container类以便CSS触摸优化
                 with gr.Group(elem_classes="slider-container"):
                     total_second_length = gr.Slider(
+                        label="Video Length (max 5 seconds) / 视频长度(最大5秒)",
                         minimum=1,
+                        maximum=5,
                         value=5,
                         step=0.1
                     )