kemuriririn
a26894de71
+回归测试脚本 ( #103 )
...
* deepspeed无法使用时回退到通常路径
* ninja支持中文路径编译补丁:BigVGAN fused cuda kernel
* 缓存参考音频的Mel
* ninja支持中文路径编译方案2:BigVGAN fused cuda kernel
* 增加批次推理:长句实现至少 2~10 倍以上的速度提升~
* fix上层目录为空时报错
* 批次推理:重要修复(漏句/丢句/音频空白)
* 批次推理:新增数据分桶机制,增强稳定性~
* +回归测试脚本
* update 回归测试脚本
* fix merge出错
---------
Co-authored-by: kemuriririn <10inspiral@gmail.com>
Co-authored-by: sunnyboxs <sjt2000@qq.com>
2025-04-18 18:09:13 +08:00
sunnyboxs
71c5295198
批次推理:修复(漏句/丢句/音频空白) ( #100 )
...
* 批次推理:重要修复(漏句/丢句/音频空白)
* 批次推理:新增数据分桶机制,增强稳定性~
2025-04-18 17:57:07 +08:00
jc-shu
919062dfb0
Update README.md
...
update readme
2025-04-17 19:22:37 +08:00
sunnyboxs
fc78f322ad
增加批次推理:长句实现至少 2~10 倍以上的速度提升~ ( #97 )
...
* ninja支持中文路径编译补丁:BigVGAN fused cuda kernel
* 缓存参考音频的Mel
* ninja支持中文路径编译方案2:BigVGAN fused cuda kernel
* 增加批次推理:长句实现至少 2~10 倍以上的速度提升~
---------
Co-authored-by: kemuriririn <nanaoto@foxmail.com>
2025-04-17 15:13:58 +08:00
kemuriririn
6783f22fe4
Feature/kemurin ( #99 )
...
* deepspeed无法使用时回退到通常路径
* ninja支持中文路径编译补丁:BigVGAN fused cuda kernel
* 缓存参考音频的Mel
* ninja支持中文路径编译方案2:BigVGAN fused cuda kernel
* 增加批次推理:长句实现至少 2~10 倍以上的速度提升~
* fix上层目录为空时报错
---------
Co-authored-by: kemuriririn <10inspiral@gmail.com>
Co-authored-by: sunnyboxs <sjt2000@qq.com>
2025-04-17 15:12:45 +08:00
sunnyboxs
91b7fa6148
ninja中文路径编译补丁支持:BigVGAN fused cuda kernel ( #93 )
...
* ninja支持中文路径编译补丁:BigVGAN fused cuda kernel
* 缓存参考音频的Mel
* ninja支持中文路径编译方案2:BigVGAN fused cuda kernel
2025-04-17 14:56:37 +08:00
root
ac96789ee1
update seed test results
2025-04-17 11:49:51 +08:00
root
b6c11dddb9
Add the calculation time of each module.
2025-04-15 12:48:47 +08:00
index-tts
6f830327d6
Merge pull request #92 from yrom/feat/activation-cuda
...
使用BigVGAN fused cuda kernel
2025-04-15 12:28:29 +08:00
yrom
06e445779d
Update README
2025-04-15 12:05:07 +08:00
Yrom
94d1353e4e
enable custom cuda kernel for BigVGAN
2025-04-15 12:04:59 +08:00
kemuriririn
21a3212a34
deepspeed无法使用时回退到通常路径 ( #90 )
...
Co-authored-by: kemuriririn <10inspiral@gmail.com>
2025-04-14 20:22:57 +08:00
Yrom Wang
18c32c06b1
修复拼音问题和分句问题,支持轻音声调(如yi1 shang5) ( #83 )
...
* Update Pinyin tone handling in TextNormalizer
* Enhance sentence splitting and improve tokenizer integration in inference
* Update character replacement mappings
test: "在电影《肖申克的救赎》中,安迪·杜佛兰被错误地判处终身监禁..."
* Refactor TextNormalizer and enhance testing with additional cases
2025-04-14 19:50:36 +08:00
index-tts
f07a9032c1
Merge pull request #78 from yrom/feat/mac-support
...
Add a new Command-Line Interface and support for mps device (Apple Silicon)
2025-04-11 22:01:00 +08:00
Yrom
879e270d39
Adds MPS support for Apple Silicon
2025-04-11 21:22:08 +08:00
Yrom
ec65755fc8
Support inference on CPU
2025-04-11 20:58:41 +08:00
Yrom
471a45435c
Add cli mode for inference
2025-04-11 20:33:54 +08:00
root
eff6eb8f43
fix bug.
2025-04-10 10:52:59 +08:00
root
702cfa905c
fix long silence bug.
2025-04-09 19:53:36 +08:00
root
999cf40258
fix long silence bug.
2025-04-09 19:52:49 +08:00
root
47ec591d40
fix long silence bug.
2025-04-09 19:45:18 +08:00
shujingchen
ea9acb5ca3
Merge from main
2025-04-09 12:19:44 +08:00
shujingchen
058be6f799
Merge from main
2025-04-09 12:02:28 +08:00
root
19be5dba2d
fix bug.
2025-04-09 10:38:51 +08:00
root
18e20ccbb4
enable front-end caching to speed up startup.
2025-04-09 10:35:47 +08:00
shujingchen
a649fe2bff
set replace_with_kernel_inject=False as default for gpt infer
2025-04-08 16:02:26 +08:00
root
ae395dc416
cleanup code
2025-04-08 11:54:31 +08:00
boostpapa
2523001bb4
support ultra-long silence filtering
2025-04-08 11:23:11 +08:00
shujingchen
e92bf90235
DeepSpeed acceleration and FP16 inference support, but bigvgan disable
2025-04-03 16:30:39 +08:00
index-tts
397fef2f14
Merge pull request #39 from index-tts/feature/kemurin
...
推理时加载bpe model使用相对于模型根目录的路径
2025-04-02 17:42:39 +08:00
kemuriririn
6286b0ffc9
推理时加载bpe model使用相对于模型根目录的路径
2025-04-02 17:40:41 +08:00
kemuriririn
5fd2fd2a5c
Merge remote-tracking branch 'origin/main' into feature/kemurin
2025-04-02 14:27:53 +08:00
index-tts
c9bea55903
Update README.md
2025-03-28 15:56:47 +08:00
kemuriririn
1734698f3d
一些输入文本处理的fix ( #18 )
...
* +简单前端
* 前端兼容arm机器
* fix
* fix
* WeTextProcessing: overwrite_cache=True 刷新前端缓存
* 恢复输入中的拼音
---------
Co-authored-by: kemuriririn <10inspiral@gmail.com>
2025-03-27 14:12:12 +08:00
kemuriririn
94004b5eb3
Merge remote-tracking branch 'origin/main' into feature/kemurin
2025-03-27 14:09:25 +08:00
kemuriririn
fd81f4a5bd
恢复输入中的拼音
2025-03-27 14:03:51 +08:00
wangyining02
1004452e95
WeTextProcessing: overwrite_cache=True 刷新前端缓存
2025-03-26 20:29:12 +08:00
kemuriririn
c73344ecc9
集成简单前端 ( #15 )
...
* +简单前端
* 前端兼容arm机器
* fix
* fix
---------
Co-authored-by: wangyining02 <wangyining02@bilibili.com>
2025-03-26 19:39:08 +08:00
wangyining02
f6e7b4acf6
fix
2025-03-26 19:33:12 +08:00
wangyining02
fb0bc6a486
fix
2025-03-26 19:29:31 +08:00
wangyining02
9a925a1497
前端兼容arm机器
2025-03-26 19:28:44 +08:00
wangyining02
46630ca45b
+简单前端
2025-03-26 19:14:47 +08:00
kemuriririn
3ad7b7b566
Update README.md
2025-03-26 18:23:17 +08:00
wangyining02
bd7530a7d9
add download models commands in README.md
2025-03-26 17:36:42 +08:00
index-tts
dec377d9a1
Update README.md
2025-03-26 12:49:41 +08:00
wangyining02
b28b4edbdd
Merge branch 'eschmidbauer-main'
2025-03-26 12:47:49 +08:00
wangyining02
de60f6829b
Merge branch 'main' of github.com:eschmidbauer/index-tts into eschmidbauer-main
2025-03-26 12:46:19 +08:00
wangyining02
b1bcd97022
Merge remote-tracking branch 'origin/main'
2025-03-26 12:20:29 +08:00
wangyining02
8031b5d654
fix import error in feature_extractors.py
2025-03-26 12:19:57 +08:00
wangyining02
b591e84bf9
rename utils.utils to utils.common
2025-03-26 12:15:48 +08:00