210 Commits

Author SHA1 Message Date
Arcitec
55b7d32149 fix: Fix character encoding in examples 2025-09-08 16:14:38 +02:00
Arcitec
d5cdb5eb3c fix: Suppress pandas PyArrow future dependency warning
Moving the import of pandas *after* we've suppressed FutureWarning, to hide a big and useless warning saying that "Pandas 3.x will require PyArrow".
2025-09-08 16:14:38 +02:00
Arcitec
3e64c4ac11 fix: Update pandas to fix Gradio errors
Gradio requires Pandas >= 2.2.0, otherwise it will throw errors in some situations (such as when GPU is Out of Memory).
2025-09-08 16:14:38 +02:00
Arcitec
5ffb84b427 fix: Improve .gitignore and re-add config file
- Improves organization and removes extra junk files.

- Unignores *.yaml files such as config.yaml from the /checkpoints/ directory since we need that file.
2025-09-08 16:14:38 +02:00
kemuriririn
050a4c821e
Dev kemurin (#287)
* update deps for windows

* update reqs & README

* update README.md

* update README.md

* use wetext to replace WeTextProcessing on windows

---------

Co-authored-by: wangyining02 <wangyining02@bilibili.com>
2025-09-08 22:14:34 +08:00
十字鱼
9d4776b082
Use without deepspeed (#280)
Use without deepspeed
2025-09-08 22:09:26 +08:00
kemuriririn
474ec9b6cf
Dev kemurin (#284)
* update deps for windows

* update reqs & README

* update README.md

* update README.md

---------

Co-authored-by: wangyining02 <wangyining02@bilibili.com>
2025-09-08 21:53:27 +08:00
kemuriririn
fd0a77d390
update deps for windows (#282)
Co-authored-by: wangyining02 <wangyining02@bilibili.com>
2025-09-08 20:59:31 +08:00
kemuriririn
c1a5e39716
Indextts2 (#278)
* indextts2

* update lfs for audio files

* fix pypi source & add python version

---------

Co-authored-by: wangyining02 <wangyining02@bilibili.com>
2025-09-08 18:42:02 +08:00
index-tts
92d50a6ba0
Merge pull request #277 from index-tts/tts2dev
Tts2dev fix video
2025-09-08 17:55:53 +08:00
shujingchen
f61d128893 update 2025-09-08 17:55:14 +08:00
shujingchen
3355074853 remove video 2025-09-08 17:52:14 +08:00
shujingchen
2c88c9731f add download link& update video 2025-09-08 17:51:20 +08:00
kemuriririn
72c09ec0b7
Indextts2 (#276)
* indextts2

* update lfs for audio files

---------

Co-authored-by: wangyining02 <wangyining02@bilibili.com>
2025-09-08 17:36:39 +08:00
root
9098497272 Fix the bug that 嗯 cannot be synthesized. 2025-06-17 17:48:04 +08:00
index-tts
10d557a15e
Merge pull request #180 from yrom/fix-bugs
Improve loading `alias_free_activation_cuda`
2025-05-29 11:06:53 +08:00
yrom
92bb2eb0c0 fix: 避免在 MinGW-w64 环境 jit compile cuda ext 2025-05-29 09:01:56 +08:00
kemuriririn
7a5c56bfe2
Update README.md 2025-05-26 13:42:51 +08:00
yrom
59c05c0765
fix: add force_rebuild flag for fused alias_free_activation and update installation instructions 2025-05-23 15:07:39 +08:00
index-tts
414f2a4052
Merge pull request #178 from yrom/fix-front
fix: 添加英语缩写处理
2025-05-23 14:07:50 +08:00
yrom
bb4d76aa2a fix: 添加英语缩写处理
临时解决WeTextProcessing 的部分badcase
2025-05-23 10:07:48 +08:00
index-tts
9d8a6a65fa
Merge pull request #169 from yrom/fix-text-tokenizer
Fixes issues of text normalizer
2025-05-21 15:31:31 +08:00
Yrom
7d943b362d
fix: update PINYIN_TONE_PATTERN and NormalizerZh 2025-05-21 13:01:25 +08:00
index-tts
d26310d7cf
Merge pull request #161 from yrom/fix-tokenizer
fix 分句逻辑,增加测试用例
2025-05-19 08:28:01 +08:00
yrom
3553a3755d fix 分句逻辑,增加测试用例
不包含。!?的句子没有被正确分句
2025-05-18 22:53:51 +08:00
index-tts
c0c17fe387
Merge pull request #160 from yrom/fix-infer
适配1.5版本模型,优化Webui,适配新版本transformers
2025-05-18 22:14:32 +08:00
yrom
c178198ed7 padding_test.py support model dir for test 2025-05-18 19:57:11 +08:00
yrom
76e7645a8d 更新WebUI,添加模型目录检查和必要文件验证
- 新增示例
- 新增模型版本提示
- 新增生成参数设置
- 新增分句预览
2025-05-18 19:41:06 +08:00
yrom
60a2238eac sample_kwargs 替换为 generation_kwargs 2025-05-18 19:34:41 +08:00
yrom
96d3b75708 添加警告提示:生成停止因超出 max_mel_tokens 限制 2025-05-18 19:34:41 +08:00
yrom
7e52976bd1 删除测试代码 2025-05-18 19:34:41 +08:00
yrom
1b7529cacd 适配新版本transformers 2025-05-18 19:34:41 +08:00
yrom
22eeb7625f 修正attention mask和positional embeddings
- 将之前只有text右侧填充改为cond+text 整体左侧填充
- 添加填充测试用例
2025-05-18 19:34:32 +08:00
yrom
a50cb8c287 优化文本掩码填充逻辑,改进句子桶化处理 2025-05-17 20:59:07 +08:00
yrom
4de7611bda fix 批量推理1.5版本模型问题,调整分句逻辑和参数设置
- 将pad 改为全 eos token
- 优化bucket_sentences 算法
2025-05-17 14:40:01 +08:00
yrom
8f7c1f3e93 优化inference attention mask 2025-05-17 14:38:01 +08:00
yrom
cb6c73d391 优化文本归一化和分句逻辑
修复可能的递归问题 (Fixes #124)
2025-05-17 11:16:54 +08:00
index-tts
9c584159c9
Update README.md 2025-05-14 17:07:05 +08:00
root
e31de1e119 release IndexTTS-1.5 2025-05-14 11:15:20 +08:00
index-tts
141599f04d
Merge pull request #122 from yrom/feat/opt-text-tokenizer
Introduce a new `TextTokenizer` class to enhance text normalization and tokenization
2025-04-25 11:42:03 +08:00
yrom
d3bd7eb8b2 Fix split_sentences_by_token 2025-04-24 23:58:16 +08:00
Yrom
475fb12574
Fix pinyin correction 2025-04-24 20:38:52 +08:00
Yrom
35b6514ee5
Enhance text normalization and tokenization
- Introduced `de_tokenized_by_CJK_char` for restoring original text from tokenized format.
- Added `TextTokenizer` class for improved tokenization, including sentence splitting and handling of special tokens.
- Enhanced `TextNormalizer` to handle names and pinyin tones with placeholder mechanisms.
- Added regression tests for new features in `regression_test.py`.
2025-04-24 20:28:44 +08:00
Yrom
dd2b7dd820
Fix autocast device type for compatibility 2025-04-24 11:00:49 +08:00
Yrom
bfd787eaa6
Update setup.py 2025-04-24 10:59:09 +08:00
index-tts
f9e464a35e
Merge pull request #107 from juntaosun/main
单句推理:RTF 性能提速小优化 ~
2025-04-21 11:33:01 +08:00
sunnyboxs
3fc7b31e10 单句推理:RTF性能至少提升 10% 2025-04-20 14:12:38 +08:00
kemuriririn
a26894de71
+回归测试脚本 (#103)
* deepspeed无法使用时回退到通常路径

* ninja支持中文路径编译补丁:BigVGAN fused cuda kernel

* 缓存参考音频的Mel

* ninja支持中文路径编译方案2:BigVGAN fused cuda kernel

* 增加批次推理:长句实现至少 2~10 倍以上的速度提升~

* fix上层目录为空时报错

* 批次推理:重要修复(漏句/丢句/音频空白)

* 批次推理:新增数据分桶机制,增强稳定性~

* +回归测试脚本

* update 回归测试脚本

* fix merge出错

---------

Co-authored-by: kemuriririn <10inspiral@gmail.com>
Co-authored-by: sunnyboxs <sjt2000@qq.com>
2025-04-18 18:09:13 +08:00
sunnyboxs
71c5295198
批次推理:修复(漏句/丢句/音频空白) (#100)
* 批次推理:重要修复(漏句/丢句/音频空白)

* 批次推理:新增数据分桶机制,增强稳定性~
2025-04-18 17:57:07 +08:00
jc-shu
919062dfb0
Update README.md
update readme
2025-04-17 19:22:37 +08:00