Commits · Jerich/TalklasApp

Fix TTS language issue by dynamically updating model in /translate-audio endpoint

2b35bda
verified

Jerich commited on 16 days ago

Replaced librosa with torchaudio for audio loading and resampling. Added speech detection (energy-based or webrtcvad for accuracy). Improved /translate-audio endpoint to handle silent audio gracefully.

b0c2331
verified

Jerich commited on 18 days ago

Fix TTS error by updating save_pcm_to_wav to handle 16-bit audio samples correctly using NumPy tobytes()

bf59705
verified

Jerich commited on 21 days ago

Fix PermissionError by changing AUDIO_DIR to /tmp/audio_output for Hugging Face Spaces

e79c9c5
verified

Jerich commited on 21 days ago

Fix PermissionError by setting AUDIO_DIR to /app/audio_output for Hugging Face Spaces compatibility

060630c
verified

Jerich commited on 21 days ago

Update audio handling: Save synthesized audio as WAV, return URL, set 10-min expiration with 5-min cleanup interval

eacfbe2
verified

Jerich commited on 21 days ago

Switch STT model to facebook/mms-1b-all with Whisper fallback

98d2781
verified

Jerich commited on 23 days ago

Fix STT error by removing return_attention_mask and address forced_decoder_ids warning

62f9c95
verified

Jerich commited on 23 days ago

Fix MT error by removing clean_up_tokenization_spaces and address STT warnings

e656c37
verified

Jerich commited on 23 days ago

Update app.py

989a3f5
verified

Jerich commited on 23 days ago

Add MT with NLLB-200-distilled-600M and TTS with MMS-TTS

f56283f
verified

Jerich commited on 23 days ago

Fix torch import error in translate-audio endpoint

f8dca01
verified

Jerich commited on 23 days ago

Add STT functionality with openai/whisper-tiny

ec27d4d
verified

Jerich commited on 23 days ago

Update app.py

aef3d4f
verified

Jerich commited on 23 days ago

Deploy minimal FastAPI app to test Hugging Face Spaces compatibility

5f26e89
verified

Jerich commited on 23 days ago

Update app.py

2fe79f5
verified

Jerich commited on 23 days ago

Implement lazy loading of ML models to fix startup timeout on HF Spaces

352aa0e
verified

Jerich commited on 23 days ago

Fix MT model loading: Revert to nllb-200-distilled-600M and add fallback

6a1bf6c
verified

Jerich commited on 23 days ago

Further optimize memory usage: Switch to nllb-200-distilled-200M and add Uvicorn debug logging

e868e42
verified

Jerich commited on 23 days ago

Optimize app for Hugging Face Spaces: Switch to whisper-tiny, add health check, and improve model loading

1881cc0
verified

Jerich commited on 23 days ago

Replaced the openai/whisper-tiny references with openai/whisper-small

4279e53
verified

Jerich commited on 23 days ago

Modified the _initialize_tts_model method to include the clean_up_tokenization_spaces parameter; Added logging configuration to configure the logging level for transformers in app.py

f12e9dc
verified

Jerich commited on 23 days ago

Modified the tokenization step to include clean_up_tokenization_spaces=True; Added clean_up_tokenization_spaces=True in the text_to_speech method; Added a print statement to confirm the TTS model is loaded

5b5fc47
verified

Jerich commited on 23 days ago

Removed the TRANSFORMERS_CACHE environment variable and its associated debug print statement.

741c20d
verified

Jerich commited on 23 days ago

Update app.py

52830c5
verified

Jerich commited on 23 days ago

Set the Cache Directory Explicitly

de6eef4
verified

Jerich commited on 23 days ago

Modified the code with API endpoints

4763326
verified

Jerich commited on 23 days ago

Update app.py

fff0177
verified

Jerich commited on 23 days ago

API integration for the Talklas pipeline

de2c35e
verified

Jerich commited on 23 days ago

Expose the Hugging Face Code as an API

224fa8d
verified

Jerich commited on 24 days ago

Update app.py

e978acd
verified

Jerich commited on 30 days ago

Update app.py

851d8b5
verified

Jerich commited on 30 days ago

Create app.py

f53ba4b
verified

Jerich commited on 30 days ago

Spaces:

Jerich
/

TalklasApp

Paused

Commit History

Fix TTS language issue by dynamically updating model in /translate-audio endpoint

2b35bda
verified

Replaced librosa with torchaudio for audio loading and resampling. Added speech detection (energy-based or webrtcvad for accuracy). Improved /translate-audio endpoint to handle silent audio gracefully.

b0c2331
verified

Fix TTS error by updating save_pcm_to_wav to handle 16-bit audio samples correctly using NumPy tobytes()

bf59705
verified

Fix PermissionError by changing AUDIO_DIR to /tmp/audio_output for Hugging Face Spaces

e79c9c5
verified

Fix PermissionError by setting AUDIO_DIR to /app/audio_output for Hugging Face Spaces compatibility

060630c
verified

Update audio handling: Save synthesized audio as WAV, return URL, set 10-min expiration with 5-min cleanup interval

eacfbe2
verified

Switch STT model to facebook/mms-1b-all with Whisper fallback

98d2781
verified

Fix STT error by removing return_attention_mask and address forced_decoder_ids warning

62f9c95
verified

Fix MT error by removing clean_up_tokenization_spaces and address STT warnings

e656c37
verified

Update app.py

989a3f5
verified

Add MT with NLLB-200-distilled-600M and TTS with MMS-TTS

f56283f
verified

Fix torch import error in translate-audio endpoint

f8dca01
verified

Add STT functionality with openai/whisper-tiny

ec27d4d
verified

Update app.py

aef3d4f
verified

Deploy minimal FastAPI app to test Hugging Face Spaces compatibility

5f26e89
verified

Update app.py

2fe79f5
verified

Implement lazy loading of ML models to fix startup timeout on HF Spaces

352aa0e
verified

Fix MT model loading: Revert to nllb-200-distilled-600M and add fallback

6a1bf6c
verified

Further optimize memory usage: Switch to nllb-200-distilled-200M and add Uvicorn debug logging

e868e42
verified

Optimize app for Hugging Face Spaces: Switch to whisper-tiny, add health check, and improve model loading

1881cc0
verified

Replaced the openai/whisper-tiny references with openai/whisper-small

4279e53
verified

Modified the _initialize_tts_model method to include the clean_up_tokenization_spaces parameter; Added logging configuration to configure the logging level for transformers in app.py

f12e9dc
verified

Modified the tokenization step to include clean_up_tokenization_spaces=True; Added clean_up_tokenization_spaces=True in the text_to_speech method; Added a print statement to confirm the TTS model is loaded

5b5fc47
verified

Removed the TRANSFORMERS_CACHE environment variable and its associated debug print statement.

741c20d
verified

Update app.py

52830c5
verified

Set the Cache Directory Explicitly

de6eef4
verified

Modified the code with API endpoints

4763326
verified

Update app.py

fff0177
verified

API integration for the Talklas pipeline

de2c35e
verified

Expose the Hugging Face Code as an API

224fa8d
verified

Update app.py

e978acd
verified

Update app.py

851d8b5
verified

Create app.py

f53ba4b
verified

Commit History

Fix TTS language issue by dynamically updating model in /translate-audio endpoint 2b35bda verified

Replaced librosa with torchaudio for audio loading and resampling. Added speech detection (energy-based or webrtcvad for accuracy). Improved /translate-audio endpoint to handle silent audio gracefully. b0c2331 verified

Fix TTS error by updating save_pcm_to_wav to handle 16-bit audio samples correctly using NumPy tobytes() bf59705 verified

Fix PermissionError by changing AUDIO_DIR to /tmp/audio_output for Hugging Face Spaces e79c9c5 verified

Fix PermissionError by setting AUDIO_DIR to /app/audio_output for Hugging Face Spaces compatibility 060630c verified

Update audio handling: Save synthesized audio as WAV, return URL, set 10-min expiration with 5-min cleanup interval eacfbe2 verified

Switch STT model to facebook/mms-1b-all with Whisper fallback 98d2781 verified

Fix STT error by removing return_attention_mask and address forced_decoder_ids warning 62f9c95 verified

Fix MT error by removing clean_up_tokenization_spaces and address STT warnings e656c37 verified

Update app.py 989a3f5 verified

Add MT with NLLB-200-distilled-600M and TTS with MMS-TTS f56283f verified

Fix torch import error in translate-audio endpoint f8dca01 verified

Add STT functionality with openai/whisper-tiny ec27d4d verified

Update app.py aef3d4f verified

Deploy minimal FastAPI app to test Hugging Face Spaces compatibility 5f26e89 verified

Update app.py 2fe79f5 verified

Implement lazy loading of ML models to fix startup timeout on HF Spaces 352aa0e verified

Fix MT model loading: Revert to nllb-200-distilled-600M and add fallback 6a1bf6c verified

Further optimize memory usage: Switch to nllb-200-distilled-200M and add Uvicorn debug logging e868e42 verified

Optimize app for Hugging Face Spaces: Switch to whisper-tiny, add health check, and improve model loading 1881cc0 verified

Replaced the openai/whisper-tiny references with openai/whisper-small 4279e53 verified

Modified the _initialize_tts_model method to include the clean_up_tokenization_spaces parameter; Added logging configuration to configure the logging level for transformers in app.py f12e9dc verified

Modified the tokenization step to include clean_up_tokenization_spaces=True; Added clean_up_tokenization_spaces=True in the text_to_speech method; Added a print statement to confirm the TTS model is loaded 5b5fc47 verified

Removed the TRANSFORMERS_CACHE environment variable and its associated debug print statement. 741c20d verified

Update app.py 52830c5 verified

Set the Cache Directory Explicitly de6eef4 verified

Modified the code with API endpoints 4763326 verified

Update app.py fff0177 verified

API integration for the Talklas pipeline de2c35e verified

Expose the Hugging Face Code as an API 224fa8d verified

Update app.py e978acd verified

Update app.py 851d8b5 verified

Create app.py f53ba4b verified

Fix TTS language issue by dynamically updating model in /translate-audio endpoint

2b35bda
verified

Replaced librosa with torchaudio for audio loading and resampling. Added speech detection (energy-based or webrtcvad for accuracy). Improved /translate-audio endpoint to handle silent audio gracefully.

b0c2331
verified

Fix TTS error by updating save_pcm_to_wav to handle 16-bit audio samples correctly using NumPy tobytes()

bf59705
verified

Fix PermissionError by changing AUDIO_DIR to /tmp/audio_output for Hugging Face Spaces

e79c9c5
verified

Fix PermissionError by setting AUDIO_DIR to /app/audio_output for Hugging Face Spaces compatibility

060630c
verified

Update audio handling: Save synthesized audio as WAV, return URL, set 10-min expiration with 5-min cleanup interval

eacfbe2
verified

Switch STT model to facebook/mms-1b-all with Whisper fallback

98d2781
verified

Fix STT error by removing return_attention_mask and address forced_decoder_ids warning

62f9c95
verified

Fix MT error by removing clean_up_tokenization_spaces and address STT warnings

e656c37
verified

Update app.py

989a3f5
verified

Add MT with NLLB-200-distilled-600M and TTS with MMS-TTS

f56283f
verified

Fix torch import error in translate-audio endpoint

f8dca01
verified

Add STT functionality with openai/whisper-tiny

ec27d4d
verified

Update app.py

aef3d4f
verified

Deploy minimal FastAPI app to test Hugging Face Spaces compatibility

5f26e89
verified

Update app.py

2fe79f5
verified

Implement lazy loading of ML models to fix startup timeout on HF Spaces

352aa0e
verified

Fix MT model loading: Revert to nllb-200-distilled-600M and add fallback

6a1bf6c
verified

Further optimize memory usage: Switch to nllb-200-distilled-200M and add Uvicorn debug logging

e868e42
verified

Optimize app for Hugging Face Spaces: Switch to whisper-tiny, add health check, and improve model loading

1881cc0
verified

Replaced the openai/whisper-tiny references with openai/whisper-small

4279e53
verified

Modified the _initialize_tts_model method to include the clean_up_tokenization_spaces parameter; Added logging configuration to configure the logging level for transformers in app.py

f12e9dc
verified

Modified the tokenization step to include clean_up_tokenization_spaces=True; Added clean_up_tokenization_spaces=True in the text_to_speech method; Added a print statement to confirm the TTS model is loaded

5b5fc47
verified

Removed the TRANSFORMERS_CACHE environment variable and its associated debug print statement.

741c20d
verified

Update app.py

52830c5
verified

Set the Cache Directory Explicitly

de6eef4
verified

Modified the code with API endpoints

4763326
verified

Update app.py

fff0177
verified

API integration for the Talklas pipeline

de2c35e
verified

Expose the Hugging Face Code as an API

224fa8d
verified

Update app.py

e978acd
verified

Update app.py

851d8b5
verified

Create app.py

f53ba4b
verified