Home > Back-end >  Making the TTS
Making the TTS

Time:10-19

Want to make a ttS software, some people say that all Chinese characters online each recorded into wav files, then...
Someone who knows how to do after is fuzzy, the specific implementation process of TTS? What have to do the work,
Online someone says quantity is very big, this is not afraid of, really, now the hope is to help refers to a road, ground around don't fun, help, little women appreciate!!!!!!!!!!!!!! 1

CodePudding user response:

Wow, cattle man,
Estimation to estimate a TTS can silence is not complicated, to make the TTS can accept, is not easy,

CodePudding user response:

CodePudding user response:

Xunfei at hkust and access business group is doing the TTS, you can consult! But isn't as simple as I thought, what do you want to be a TTS product is more complex

CodePudding user response:

reference liuhengwinner reply: 3/f
xunfei at hkust and access business group is doing the TTS, you can consult! But isn't as simple as I thought, what do you want to be a TTS product or more complex

I want to know how to reference, key people have to pay,
I want to implement a simple first, then slowly sublimation

CodePudding user response:

asdfffffffff

CodePudding user response:

Just find something is useful:
Current location: home page & gt;> Communication technology & gt;> CTI> Product information & gt;> Speech synthesis

Access business group in Chinese speech synthesis (TTS) technique is introduced and the solution
Information source: CTI BBS acquisition time: 2002-09-01 00:00:00


A, access business group hua sheng electronic TTS (jTTS) introduced

1. Access business group hua sheng electronic TTS technology the basic principle of text-to-speech conversion

Access business group the basic structure of TTS technology text-to-speech conversion:

The basic structure:

Linguistic processing

Linguistic processing in text-to-speech conversion system plays an important role, he mainly simulate people's understanding of natural language process - structured text, word segmentation, syntactic analysis and semantic analysis, the input text can fully understand the computer, and after two parts need a variety of the pronunciation tips,

The rhythm

Is the purpose of the rhythm of the synthetic speech planning out the segmental features, such as pitch, sound intensity and duration, etc., make the synthetic speech can correctly express meaning, sound more natural,

Acoustic processing

The main function of acoustic processing is according to the requirement of the former two parts processing result output voice, namely synthetic speech,

2. Access business group hua sheng electronic characteristics of TTS

Access business group of TTS hua sheng electronic technology (jTTS), is a self copyright TTS technology, based on the large-scale real audio library, adopt varied, ranging from long speech waveform splicing technology, increase the sound library compression algorithm and timbre transformation algorithm to form the core technology, fluent pronunciation clear, natural degree is high,

Access business group hua sheng electronic TTS in text to speech conversion process is not a simple mapping of the text to speech, also includes the understanding of the text, as well as to the rhythm of speech processing,

Access business group hua sheng cordially by studying the Chinese tone, stress, and intonation of acoustic characteristics, the design of the simulation, the stress, and intonation

Access business group hua sheng electronic TTS in terms of prosodic rules, adopt the method of combining statistics and rules, the rules of Chinese rhythm and work to simulate certain rhythm, the rhythm hierarchical matching as choose the basic principles of segmental,

Access business group hua sheng electronic TTS provide function of mixed reading in both English and Chinese, embedded within the Chinese common words sound consistent with the Chinese, more natural and fluent,

Access business group hua sheng electronic TTS sound library size adjustable, can provide from the PC, sound card to the comprehensive solution of PDA TTS technology, store and procedures can be compressed into a total of 1.5 M level, realize the application in embedded system,

Provide access business group hua sheng electronic TTS development kit (jTTS SDK), can be synthesized to sound card, synthesis to a file, direct access to voice stream, support multithreaded operations, support GBK, BIG5 character set text reading, including sound library size adjustable access business group is the feature of hua sheng electronic TTS to transplant to the key characteristics of embedded devices such as PDA, and access business group hua sheng electronic TTS's unique advantages,

Support for multiple operating systems, such as WindowsNT/2000/XP, and all kinds of embedded Linux operating system,

Access business group hua sheng electronic company in July 2001, completed the latest version of access business group hua sheng electronic TTS core, a new version of library based on large-scale real recording, sound library provides more samples, choose sound range is big, can choose to match the pronunciation, in addition, when picking and stitching will beyond the level of the syllable, the ability of word or phrase level of joining together, therefore, in naturalness, intelligibility access business group in terms of new TTS hua sheng electronic has the larger increase, coupled with the English tone consistent with the Chinese words, can support GBK to all the othercharacters in a font of pronunciation, new access business group hua sheng electronic TTS technology of synthetic speech almost achieve the result of natural talk,

3. Access business group hua sheng electronic TTS direction of the further development of the

Access business group hua sheng electronic TTS will develop in the direction of the following:

To improve the quality of speech synthesis, reach the level of more fluent and natural,

Further study tone conversion function, make the TTS technology can achieve a variety of timbre (including different genders, different ages, different pronunciation characteristics of voice output,

Provide various industries with TTS core technology and solution, especially the CTI industries and embedded systems,

The TTS technology transplanted to other operating systems, such as Unix, transplanted to other embedded operating systems, such as Palm OS, HOPEN, etc. Considering the hardware implementation of TTS technology,

Combining the TTS technology and other new technology, and within the scope of the wider extension and application of TTS technology, access business group hua sheng electronic TTS technology can be widely used in many aspects in the field of computer and communication, access business group will strive to become a TTS hua sheng electronic company core technology suppliers, and is widely used in the following fields TTS technology, further contributing to their own TTS technology application,

Second, access business group hua sheng electronic TTS technology (jTTS) application in the field of CTI

Access business group hua sheng electronic TTS technology in the application of CTI industries provide three solutions: local interface calls, voice server solution, off-line synthetic solutions,

1. Access business group hua sheng electronic TTS local interface call scheme (jTTS SDK)

Introduction to scheme:

Local interface call provide access business group hua sheng electronic TTS development kit (jTTS - SDK) enables users to join in the development of the system the function of speech synthesis, jTTS - is based on the Win32 SDK (32 bit Windows environment, including the Windows95/98/2000/NT) platform development kit, so the project requires the use of party voice service system is run by the 32-bit Windows server environment, and the need to use the system transform part of the program, the scheme is applicable to require the use of party integration and application with secondary development ability,

Local interface call solution architecture figure

2. Access business group hua sheng electronic TTS voice server scheme (jTTS Service)

Introduction to scheme:

Voice server solution directly provide loading have access business group hua sheng electronic TTS voice service system of high performance server, it is in parallel with the existing audio service, accept its instruction and text data stream, synthesized speech data flow back to the original system, suitable for telecom, CTI system and large enterprise call center renovation and upgrade,

Voice server solution architecture

Technology advantage:

1. Reduce the requirements of the client machine configurations,

2. Through the TCP/IP protocol realized across speech synthesis service function of the operating system, can satisfy the demand of the Win32 platform,

3. Service for large scale system which can realize the voice server distributed works, to the client's request automatic scheduling, load balance, to achieve higher performance,

Areas of application:

The voice server scheme can under the condition of without changing the existing service system, set up in parallel access business group hua sheng electronic TTS voice server, safe, efficient service to expand the original text for text-to-speech services at the same time, is a large, professional voice service solutions, and can across different platforms, access business group hua sheng electronic TTS voice server solution can serve 160168 call center, such as UMS unified information service system, call center, voice mail, WAP website, online broadcast. Com, etc.,

3. Access business group hua sheng electronic TTS offline synthetic solution (jTTS Builder)

Introduction to scheme:

Access business group hua sheng electronic TTS off-line synthetic solution provide synthesis tool, which is based on Microsoft Windows NT/2000/95/98 independent application, can provide offline speech synthesis service, batch convert text to speech data files, is applicable quantity information, relatively stable or does not require the transformation of real-time audio service situation, can directly replace the traditional method of recording, the original system completely without modification, saves the manpower, improve efficiency,


CodePudding user response:

I do this before graduation set is,,,
But with the system's own online download of Chinese library and library in English, is to be able to read it, but because of the bad library so effect is not very satisfied, so focused on doing library, this is a kind of,

There is a rhythm synthesis technology was adopted, this I have no contact, did not dare to say more, ha ha,

CodePudding user response:

I personally do about, if have the business cooperation, can contact me

Qq: 79627128

CodePudding user response:

I'm doing, using MFC + TTS interface is out, debug and correct but can't pronounce

CodePudding user response:

Hkust xunfei is too expensive, there is a crack? Do you have any instructions how to invoke the service?

CodePudding user response:

Is strong, I just want to call the voice of others,
  • Related