You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
dynamic pruning and substitution restrictions (#60) * massive speed improvement, with some WER improvements too in large complicated files with lots of deletions * adding a flexible beam, adjustable via command line parameters * adding --strict-punctuation mode, that will allow punctuation marks to be substituted only within themselves * adding support for strict punctuation and favouring same words in alignments * fixing the default value of the beam to 50 * adding new test case file * adding new result for std composition * fixes, and new test cases updates * bumping version and adding release notes to the readme
Allow flexible forwarding of NLP columns to SBS (#56) * initial attempt * refactor token properties inside stitching * read from CLI * fix test * handle empty confidence * set confidence when present * use pinned kaldi dockerhub image * update version
Bug: WER sidecar info not appearing in SBS (#55) * add test * add test * fix * Add and use wer tag data structure * fix test * Remove debug log * remove unigram and bigram info from sbs output * fix log json missing unigram bigram info if output sbs not set * version bump
Nerd-1422: Add flag for reading punctuation from nlp as separate toke… …ns (#10) * Nerd-1422: Add flag for reading punctuation from nlp as separate tokens * test * version file --------- Co-authored-by: Nishchal Bhandari <nishchal2050@gmail.com>