SenseVoiceSmall返回的结果中text和words不一致

1、代码：
# funasr==1.3.1
`
from funasr import AutoModel
from funasr.utils.postprocess_utils import rich_transcription_postprocess

model_dir = "iic/SenseVoiceSmall"


model = AutoModel(
    model=model_dir,
    trust_remote_code=True,
    remote_code="./model.py",
    vad_model="fsmn-vad",
    vad_kwargs={"max_single_segment_time": 30000},
    device="cuda:0",
)

res = model.generate(
    input="/content/1772091310572.mp3.opus",
    cache={},
    language="auto",  # "zn", "en", "yue", "ja", "ko", "nospeech"
    use_itn=True,
    batch_size_s=60,
    merge_vad=True,  #
    merge_length_s=15,
    output_timestamp=True
)
print(res)`

2、音频
[audio.zip](https://github.com/user-attachments/files/25591263/audio.zip)

3、运行结果

<img width="1522" height="679" alt="Image" src="https://github.com/user-attachments/assets/f89d8da3-6391-487c-9a6b-7590b946d2b9" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SenseVoiceSmall返回的结果中text和words不一致 #2822

funasr==1.3.1

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

SenseVoiceSmall返回的结果中text和words不一致 #2822

Description

funasr==1.3.1

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions