WIP: Add shallow fusion to C API #149

csukuangfj · 2023-05-11T04:53:02Z

Integrate changes from #147

TODOs

Fix iOS demo
Fix .Net demo
Fix Python APIs

kamirdin · 2023-10-07T06:07:32Z

Hi, I'm using OnlineLMConfig by Python APIs, but it seams that it didn't work by set 'rnn lm onnx path' to 'model'，may I ask what should I do to using shallow fusion by Python APIs？

csukuangfj · 2023-10-07T06:26:55Z

but it seams that it didn't work by set 'rnn lm onnx path' to 'model'

How do you tell it does not work?

kamirdin · 2023-10-07T06:38:40Z

How do you tell it does not work?

By decoding a test set which has 1k samples, say 10k characters, but got exactly the same result compared to not using LM.

kamirdin · 2023-10-07T06:48:01Z

Before this, I obtained better results by using Python LM decode scripts from Icefall. This was with the same test set, ASR, and LM model. So, it was expected that better results would be achieved, even if only a few characters were changed.

csukuangfj · 2023-10-07T06:48:44Z

haracters, but got exactly the same result compared to not using LM.

How many lm scales have you tried?

kamirdin · 2023-10-07T07:07:51Z

python code ：

lm_config = OnlineLMConfig(
    model=lm,
    scale=scale,
)

print(lm_config)
print("="*30)

recognizer_config = OnlineRecognizerConfig(
    feat_config=feat_config,
    model_config=model_config,
    lm_config=lm_config,
    endpoint_config=endpoint_config,
    enable_endpoint=enable_endpoint_detection,
    decoding_method=decoding_method,
    max_active_paths=max_active_paths,
    context_score=context_score,
)
print(recognizer_config)

and than print out this:

OnlineLMConfig(model="base/with-state-epoch-21-avg-2.onnx", scale=1.1)
==============================
OnlineRecognizerConfig(feat_config=FeatureExtractorConfig(sampling_rate=16000, feature_dim=80), model_config=OnlineTransducerModelConfig(encoder_filename="./asr_model_chunk320/encoder.onnx", decoder_filename="./asr_model_chunk320/decoder.onnx", joiner_filename="./asr_model_chunk320/joiner.onnx", tokens="./asr_model_chunk320/tokens.txt", num_threads=8, provider="cpu", model_type="", debug=False), lm_config=OnlineLMConfig(model="", scale=0.5), endpoint_config=EndpointConfig(rule1=EndpointRule(must_contain_nonsilence=False, min_trailing_silence=2.4, min_utterance_length=0), rule2=EndpointRule(must_contain_nonsilence=True, min_trailing_silence=1.2, min_utterance_length=0), rule3=EndpointRule(must_contain_nonsilence=False, min_trailing_silence=0, min_utterance_length=20)), enable_endpoint=False, max_active_paths=4, context_score=1.5, decoding_method="modified_beam_search")

added .def_readwrite("lm_config", &PyClass::lm_config) at here:

sherpa-onnx/sherpa-onnx/python/csrc/online-recognizer.cc

Line 39 in 36017d4

.def_readwrite("model_config", &PyClass::model_config)

and rebuild, still print lm_config=OnlineLMConfig(model="", scale=0.5), of recognizer_config

kamirdin · 2023-10-07T08:12:09Z

by adding lm_config(lm_config), at
https://github.com/k2-fsa/sherpa-onnx/blob/36017d49c4f0b2f2f87feeeb0a40e54be4487b76/sherpa-onnx/csrc/online-recognizer.h#L97C16-L97C16
It appears to be functioning, but it is encountering more missing errors compared to the Python script. It seems like there is still some work to be done on it. In any case, thank you for your assistance!

csukuangfj · 2023-10-07T08:24:02Z

https://github.com/k2-fsa/sherpa-onnx/blob/36017d49c4f0b2f2f87feeeb0a40e54be4487b76/sherpa-onnx/csrc/online-recognizer.h#L97C16-L97C16

Thank you for identifying the bug. Would you mind creating a PR to fix it?

kamirdin · 2023-10-07T08:36:01Z

Sure, I will create PR after more test finish

rkjaran · 2023-12-21T10:51:41Z

I'd like to use the online LM with the C API. What's the status on this?

WIP: Add shallow fusion to C API

54f50d5

csukuangfj mentioned this pull request Oct 8, 2023

Fix typos/bugs #351

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Add shallow fusion to C API #149

WIP: Add shallow fusion to C API #149

csukuangfj commented May 11, 2023 •

edited

kamirdin commented Oct 7, 2023

csukuangfj commented Oct 7, 2023

kamirdin commented Oct 7, 2023

kamirdin commented Oct 7, 2023

csukuangfj commented Oct 7, 2023

kamirdin commented Oct 7, 2023

kamirdin commented Oct 7, 2023

csukuangfj commented Oct 7, 2023

kamirdin commented Oct 7, 2023

rkjaran commented Dec 21, 2023

WIP: Add shallow fusion to C API #149

Are you sure you want to change the base?

WIP: Add shallow fusion to C API #149

Conversation

csukuangfj commented May 11, 2023 • edited

kamirdin commented Oct 7, 2023

csukuangfj commented Oct 7, 2023

kamirdin commented Oct 7, 2023

kamirdin commented Oct 7, 2023

csukuangfj commented Oct 7, 2023

kamirdin commented Oct 7, 2023

kamirdin commented Oct 7, 2023

csukuangfj commented Oct 7, 2023

kamirdin commented Oct 7, 2023

rkjaran commented Dec 21, 2023

csukuangfj commented May 11, 2023 •

edited