* Update Pinyin tone handling in TextNormalizer
* Enhance sentence splitting and improve tokenizer integration in inference
* Update character replacement mappings
test: "在电影《肖申克的救赎》中,安迪·杜佛兰被错误地判处终身监禁..."
* Refactor TextNormalizer and enhance testing with additional cases