Find Jobs
Hire Freelancers

Embedded Offline ASR Engine Development For English and Chinese Languages

$250-750 USD

В роботі
Опублікований about 4 years ago

$250-750 USD

Оплачується при отриманні
This project is to use Python and Mozilla’s DeepSpeech ASR (automatic speech recognition) engine on different platforms (such as Raspberry Pi 4 - 1 GB, Nvidia Jetson Nano, Windows PC, and Linux PC, Samsung Galaxy A50, Huawei P20) in order to develop a refined ASR engine for English and Chinese (Mandarin) languages with following functionalities, with source code, instruction, data and an API documentation delivered after development complete. The architecture of Deep Speech is an end-to-end trainable, character-level, deep recurrent neural network (RNN). It is a deep neural network with recurrent layers that gets audio features as input and outputs characters directly — the transcription of the audio, and uses LSTM (Long short-term memory) cells instead of GRU (gated recurrent unit) cells. This project targets <6% of Word Error Rate, and especially for key phrases and key words to <3% WER, close to human level performance for English and Chinese languages. 1. Use the latest Mozilla’s DeepSpeech ASR engine which comes with .tflite model (TensorFlow Lite), faster than real-time on a single core of a Raspberry Pi 4, and able to make our own audio transcription application with hot word detection function. 2. Generate Confidence Scores of high accuracy level. Be able to retrieve result confidence scores for each English word and for each English sentence in a transcription of audio into text (confidence score for both word and sentence level), and they should be highly reliable. 3. Support Keyword Spotting modes that can recognize in a continuous stream. User can configure a list of key phrases to search for and specify the detection threshold for each of them. This mode should reliably work in continuous speech stream and can be used for keyword activation. Equivalent to pocketsphinx -kws and –keyphrase options (The methods are ps_set_keyphrase and ps_set_kws). 4. Recognize accurately at least a thousand of commands and controls that user can define in a simple text editor in keyword spotting mode or keyword activation mode. (same as No. 3 requirement, with 3% WER) 5. Mozilla’s DeepSpeech ASR word error rate on LibriSpeech’s test-clean set is 6.5%, which we target to improve to 6% by this project, and for key phrases and key words to <3% WER, close to human level performance. 6. Use automatic phoneme alignment, VAD and other methods to detect the start time, the end time and position for each recognized phoneme, word and sentence, with an output data structure readable at real-time for the complete set of phoneme position information. Forced alignment refers to the process by which orthographic transcriptions are aligned to audio recordings to automatically generate phone level segmentation. As described at the following link:[login to view URL]。 When developing an ASR system, “good initial estimates … are essential” when training Gaussian Mixture Model (GMM) parameters (Rabiner and Juang, 1993, p. 370). Phoneme location information is also critical when building concatenative text-to-speech systems. 7. Implement CTC decoder as an important optimization: integrating appropriate language model into the decoder.
ID проекту: 24813208

Про проект

7 пропозицій(-ї)
Дистанційний проект
Активність 4 yrs ago

Хочете заробити?

Переваги подання заявок на Freelancer

Вкажіть свій бюджет та терміни
Отримайте гроші за свою роботу
Опишіть свою пропозицію
Реєстрація та подання заявок у проекти є безкоштовними
Доручений:
Аватарка користувача
Hello! I'd like to deliver asr engine for mobile platform. I'm familiar with CTC loss, Speech recognition, DeepSpeech, pytorch, tflite. I'll do the job blazingly fast. Please, give me a try!
$1 792 USD за 45 дні(-в)
5,0 (6 відгуки(-ів))
4,6
4,6
7 фрілансерів(-и) готові виконати цю роботу у середньому за $1 335 USD
Аватарка користувача
Hi there, I have read the brief details on the job listing. You can check my experience, customer feed backs and my portfolio here: https://www.freelancer.com/u/AwaisChaudhry?w=f I believe its a doable job I have great experience doing projects with Tensorflow and Python. Please initiate the chat so we could discuss it in detail. Thanks! Awais
$750 USD за 28 дні(-в)
5,0 (5 відгуки(-ів))
5,8
5,8
Аватарка користувача
Alert: I will give you 20% discount on my bid rate also give on my All Services. So grabs this special offer is limited. Let’s get to the point. I am an highly experienced freelancer . I am offering services in Arduino Embedded system Machine Learning, Deep Leaning, Arduino programming , Raspberry pi Android app development, Data Science, Natural Language Processing, Computer Vision, OpenCV, script and utility. I offer original quality services to clients throughout the world. I have deep rich knowledge and skills as I have been in this field in the past five years. Skills: - Programming(Python/PHP/Java/C/C++/C#/HTML/XML/R,JavaScript) - AI(Text Processing/OpenCV/Image Processing/Machine Learning/Data Mining) - Android development - React native development - Big Data(Hadoop ecosystem tools like Hive/Spark/HDFS/...) - AWS / Heroku
$250 USD за 7 дні(-в)
4,6 (11 відгуки(-ів))
4,3
4,3
Аватарка користувача
Hello. My name is Mohsen and thanks for posting the job. I have read all your requirements for 'Embedded Offline ASR Engine Development For English and Chinese Languages' and I fully understood it. I've already done this kind of project before. I am confident and I am sure that I am able to finish this project. Please come in contact with me, so that we can discuss any details via chat:) Skills: Tensorflow, Python
$555 USD за 3 дні(-в)
5,0 (2 відгуки(-ів))
4,1
4,1
Аватарка користувача
Hello! I would love to help you reach your goals on-time and on-budget. I have extensive experience creating web platforms from simple informational sites, to high-performance Single Page Applications for consumers and enterprises. I especially love to work in Python Django and/or React.js. I speak English as my first language and live in the USA. Please review my profile to see my work. Chat with me to discuss more when you are ready. Thank you!
$748 USD за 7 дні(-в)
4,8 (3 відгуки(-ів))
4,2
4,2
Аватарка користувача
Hi, dear Bei I have read your description carefully. Very interesting job but it may take long time, and high cost. Can we contact and discuss over chat? If you can proceed this project in hourly mode, I can work for you. Looking forward to hearing from you. Best regards.
$5 000 USD за 30 дні(-в)
5,0 (5 відгуки(-ів))
3,0
3,0
Аватарка користувача
Hi, I am an Electrical, Electronics and Embedded Engineer. A PCB Designer, Arduino/Raspberry Pi, ESP32, ESP8266 and internet of things expert. I read through the job description very carefully and I am absolutely sure that I can do the project very well. I have worked on similar projects to what you are looking for, and I am confident I can exceed your expectations. I can achieve the results that you are asking for.I can complete your project on time and within your budget. Thanks
$250 USD за 7 дні(-в)
0,0 (0 відгуки(-ів))
0,0
0,0

Про клієнта

Прапор UNITED STATES
Flushing, United States
5,0
3
Спосіб оплати верифіковано
На сайті з січ. 31, 2020

Верифікація клієнта

Дякуємо! Ми надіслали на вашу електронну пошту посилання для отримання безкоштовного кредиту.
Під час надсилання електронного листа сталася помилка. Будь ласка, спробуйте ще раз.
Зареєстрованих користувачів Загальна кількість опублікованих робіт
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Завантажуємо для перегляду
Дозвіл на визначення геолокації надано.
Ваш сеанс входу закінчився, і сеанс було закрито. Будь ласка, увійдіть знову.