Commits · aimeri/Qwen2.5-Omni-7B-Demo

Update README.md to specify Python version as 3.12 and modify requirements.txt to include a direct link for flash-attn package installation, enhancing dependency management.

0c57f30

aimeri commited on 25 days ago

Update process_input function in app.py to handle audio generation output more robustly, introducing a fallback mechanism for text generation in case of unexpected output formats. Improve error handling during audio and text generation processes. Additionally, update requirements.txt to include flash-attn for enhanced performance.

c98fc82

aimeri commited on 25 days ago

Refactor chat history processing in app.py to support both dictionary and list formats for messages, enhancing compatibility with previous data structures. Update text input handling to align with new message format, ensuring consistent user experience.

9f83467

aimeri commited on 25 days ago

Refactor chat history management in process_input function of app.py to utilize dictionaries for user and assistant messages, improving clarity and structure. Update error handling to maintain consistent message formatting in chat history.

417050f

aimeri commited on 25 days ago

Refactor process_input function in app.py to improve error handling during audio and text generation. Introduce try-except blocks for better exception management and ensure audio is only saved if generated successfully. Enhance text response processing with additional error handling for decoding.

c777d2d

aimeri commited on 25 days ago

Enhance process_input function in app.py by initializing user_message_for_display at the start and improving error handling for text responses. Ensure proper formatting for multimodal inputs and update chat history management to reflect user messages accurately.

d4eafc5

aimeri commited on 25 days ago

Refactor process_input and create_demo functions in app.py to enhance chat history management and improve text response handling, including the addition of user and assistant avatars. Update input clearing logic for better user experience with multimodal inputs.

8ddcd3c

aimeri commited on 25 days ago

Enhance process_input and create_demo functions in app.py to improve multimodal input handling, including better formatting for user messages and integration of TextStreamer for text response generation.

e4a9a7a

aimeri commited on 25 days ago

Refactor process_input and create_demo functions in app.py to improve chat history handling and ensure proper formatting for multimodal inputs, addressing audio processing issues.

1638860

aimeri commited on 25 days ago

Refactor chat history handling in process_input function to ensure it initializes correctly when not a list; improve user message formatting for multimodal inputs in app.py.

6a119c7

aimeri commited on Mar 28

Improve chat history formatting in process_input and create_demo functions in app.py to enhance user experience with multimodal inputs, ensuring clear display of uploaded content types.

9e14c66

aimeri commited on Mar 28

Enhance process_input function in app.py to ensure multimodal inputs are handled correctly when None; update demo creation to include hidden placeholders for image, audio, and video inputs.

513e0c6

aimeri commited on Mar 28

Refactor process_input function in app.py to handle multimodal inputs (image, audio, video, text) and update demo creation logic accordingly.

f0ab3ba

aimeri commited on Mar 28

Add accelerate dependency to requirements.txt

f14a0ac

aimeri commited on Mar 28

Update requirements.txt to add torchvision and torchaudio dependencies.

c40a960

aimeri commited on Mar 28

Update app.py to change torch data type to float16 and comment out attention implementation; modify requirements.txt to reorder dependencies and remove flash-attn.

b7fa0fa

aimeri commited on Mar 28

Refactor app.py to import 'spaces' and add GPU decorator; update requirements.txt to remove no-build-isolation flag from flash-attn

c7db60b

aimeri commited on Mar 28

Add application file

039d869

aimeri commited on Mar 28

initial commit

ca99eca
verified

aimeri commited on Mar 28

Spaces:

aimeri
/

Qwen2.5-Omni-7B-Demo

Build error

Commit History

Update README.md to specify Python version as 3.12 and modify requirements.txt to include a direct link for flash-attn package installation, enhancing dependency management.

0c57f30

Refactor chat history processing in app.py to support both dictionary and list formats for messages, enhancing compatibility with previous data structures. Update text input handling to align with new message format, ensuring consistent user experience.

9f83467

Refactor chat history management in process_input function of app.py to utilize dictionaries for user and assistant messages, improving clarity and structure. Update error handling to maintain consistent message formatting in chat history.

417050f

Enhance process_input function in app.py by initializing user_message_for_display at the start and improving error handling for text responses. Ensure proper formatting for multimodal inputs and update chat history management to reflect user messages accurately.

d4eafc5

Refactor process_input and create_demo functions in app.py to enhance chat history management and improve text response handling, including the addition of user and assistant avatars. Update input clearing logic for better user experience with multimodal inputs.

8ddcd3c

Enhance process_input and create_demo functions in app.py to improve multimodal input handling, including better formatting for user messages and integration of TextStreamer for text response generation.

e4a9a7a

Refactor process_input and create_demo functions in app.py to improve chat history handling and ensure proper formatting for multimodal inputs, addressing audio processing issues.

1638860

Refactor chat history handling in process_input function to ensure it initializes correctly when not a list; improve user message formatting for multimodal inputs in app.py.

6a119c7

Improve chat history formatting in process_input and create_demo functions in app.py to enhance user experience with multimodal inputs, ensuring clear display of uploaded content types.

9e14c66

Enhance process_input function in app.py to ensure multimodal inputs are handled correctly when None; update demo creation to include hidden placeholders for image, audio, and video inputs.

513e0c6

Refactor process_input function in app.py to handle multimodal inputs (image, audio, video, text) and update demo creation logic accordingly.

f0ab3ba

Add accelerate dependency to requirements.txt

f14a0ac

Update requirements.txt to add torchvision and torchaudio dependencies.

c40a960

Update app.py to change torch data type to float16 and comment out attention implementation; modify requirements.txt to reorder dependencies and remove flash-attn.

b7fa0fa

Refactor app.py to import 'spaces' and add GPU decorator; update requirements.txt to remove no-build-isolation flag from flash-attn

c7db60b

Add application file

039d869

initial commit

ca99eca
verified

Commit History

Update README.md to specify Python version as 3.12 and modify requirements.txt to include a direct link for flash-attn package installation, enhancing dependency management. 0c57f30

Refactor chat history processing in app.py to support both dictionary and list formats for messages, enhancing compatibility with previous data structures. Update text input handling to align with new message format, ensuring consistent user experience. 9f83467

Refactor chat history management in process_input function of app.py to utilize dictionaries for user and assistant messages, improving clarity and structure. Update error handling to maintain consistent message formatting in chat history. 417050f

Enhance process_input function in app.py by initializing user_message_for_display at the start and improving error handling for text responses. Ensure proper formatting for multimodal inputs and update chat history management to reflect user messages accurately. d4eafc5

Refactor process_input and create_demo functions in app.py to enhance chat history management and improve text response handling, including the addition of user and assistant avatars. Update input clearing logic for better user experience with multimodal inputs. 8ddcd3c

Enhance process_input and create_demo functions in app.py to improve multimodal input handling, including better formatting for user messages and integration of TextStreamer for text response generation. e4a9a7a

Refactor process_input and create_demo functions in app.py to improve chat history handling and ensure proper formatting for multimodal inputs, addressing audio processing issues. 1638860

Refactor chat history handling in process_input function to ensure it initializes correctly when not a list; improve user message formatting for multimodal inputs in app.py. 6a119c7

Improve chat history formatting in process_input and create_demo functions in app.py to enhance user experience with multimodal inputs, ensuring clear display of uploaded content types. 9e14c66

Enhance process_input function in app.py to ensure multimodal inputs are handled correctly when None; update demo creation to include hidden placeholders for image, audio, and video inputs. 513e0c6

Refactor process_input function in app.py to handle multimodal inputs (image, audio, video, text) and update demo creation logic accordingly. f0ab3ba

Add accelerate dependency to requirements.txt f14a0ac

Update requirements.txt to add torchvision and torchaudio dependencies. c40a960

Update app.py to change torch data type to float16 and comment out attention implementation; modify requirements.txt to reorder dependencies and remove flash-attn. b7fa0fa

Refactor app.py to import 'spaces' and add GPU decorator; update requirements.txt to remove no-build-isolation flag from flash-attn c7db60b

Add application file 039d869

initial commit ca99eca verified

Update README.md to specify Python version as 3.12 and modify requirements.txt to include a direct link for flash-attn package installation, enhancing dependency management.

0c57f30

Refactor chat history processing in app.py to support both dictionary and list formats for messages, enhancing compatibility with previous data structures. Update text input handling to align with new message format, ensuring consistent user experience.

9f83467

Refactor chat history management in process_input function of app.py to utilize dictionaries for user and assistant messages, improving clarity and structure. Update error handling to maintain consistent message formatting in chat history.

417050f

Enhance process_input function in app.py by initializing user_message_for_display at the start and improving error handling for text responses. Ensure proper formatting for multimodal inputs and update chat history management to reflect user messages accurately.

d4eafc5

Refactor process_input and create_demo functions in app.py to enhance chat history management and improve text response handling, including the addition of user and assistant avatars. Update input clearing logic for better user experience with multimodal inputs.

8ddcd3c

Enhance process_input and create_demo functions in app.py to improve multimodal input handling, including better formatting for user messages and integration of TextStreamer for text response generation.

e4a9a7a

Refactor process_input and create_demo functions in app.py to improve chat history handling and ensure proper formatting for multimodal inputs, addressing audio processing issues.

1638860

Refactor chat history handling in process_input function to ensure it initializes correctly when not a list; improve user message formatting for multimodal inputs in app.py.

6a119c7

Improve chat history formatting in process_input and create_demo functions in app.py to enhance user experience with multimodal inputs, ensuring clear display of uploaded content types.

9e14c66

Enhance process_input function in app.py to ensure multimodal inputs are handled correctly when None; update demo creation to include hidden placeholders for image, audio, and video inputs.

513e0c6

Refactor process_input function in app.py to handle multimodal inputs (image, audio, video, text) and update demo creation logic accordingly.

f0ab3ba

Add accelerate dependency to requirements.txt

f14a0ac

Update requirements.txt to add torchvision and torchaudio dependencies.

c40a960

Update app.py to change torch data type to float16 and comment out attention implementation; modify requirements.txt to reorder dependencies and remove flash-attn.

b7fa0fa

Refactor app.py to import 'spaces' and add GPU decorator; update requirements.txt to remove no-build-isolation flag from flash-attn

c7db60b

Add application file

039d869

initial commit

ca99eca
verified