000780776 000__ 05277cam\a2200505Ii\4500 000780776 001__ 780776 000780776 005__ 20230306143158.0 000780776 006__ m\\\\\o\\d\\\\\\\\ 000780776 007__ cr\nn\nnnunnun 000780776 008__ 170411s2017\\\\si\\\\\\ob\\\\000\0\eng\d 000780776 020__ $$a9789811037344$$q(electronic book) 000780776 020__ $$a9811037345$$q(electronic book) 000780776 020__ $$z9789811037337 000780776 035__ $$aSP(OCoLC)ocn982121294 000780776 035__ $$aSP(OCoLC)982121294 000780776 040__ $$aN$T$$beng$$erda$$epn$$cN$T$$dN$T$$dGW5XE$$dEBLCP$$dOCLCF$$dYDX$$dUAB 000780776 049__ $$aISEA 000780776 050_4 $$aTK7882.S65 000780776 08204 $$a006.4/54$$223 000780776 1001_ $$aHinterleitner, Florian. 000780776 24510 $$aQuality of synthetic speech :$$bperceptual dimensions, influencing factors, and instrumental assessment /$$cFlorian Hinterleitner. 000780776 264_1 $$aSingapore :$$bSpringer,$$c[2017] 000780776 300__ $$a1 online resource. 000780776 336__ $$atext$$btxt$$2rdacontent 000780776 337__ $$acomputer$$bc$$2rdamedia 000780776 338__ $$aonline resource$$bcr$$2rdacarrier 000780776 4901_ $$aT-labs series in telecommunication services 000780776 504__ $$aIncludes bibliographical references. 000780776 5050_ $$aAcknowledgements; Contents; Acronyms; Abstract; 1 Introduction; 1.1 Motivation; 1.2 Outline; References; 2 Speech Synthesis; 2.1 Setup of a Speech Synthesizer; 2.1.1 Natural Language Processing (NLP); 2.1.2 Prosody Generation; 2.1.3 Concatenation and Generation of Speech-Signal Parameters; 2.1.4 Speech Signal Generation; 2.2 The Mary Text-to-Speech System (MaryTTS); References; 3 Auditory and Instrumental Quality Evaluation Metrics; 3.1 What Is Perceptual Quality?; 3.2 Taxonomy for the Quality Assessment of Synthetic Speech; 3.2.1 Glass Box Versus Black Box 000780776 5058_ $$a3.2.2 Laboratory Versus Field Studies3.2.3 Linguistic Versus Acoustic; 3.2.4 Auditory Versus Instrumental; 3.3 Auditory Quality Evaluation Metrics; 3.3.1 Functional TestsThe content of this section has previously been published in a slightly different version in [6].; 3.3.2 Judgment TestsParts of the content of this section have previously been published in a slightly different version in [13] and [6].; 3.4 Instrumental Quality Evaluation Metrics; 3.4.1 Reference-Based MeasuresParts of the content of this section have previously been published in a slightly different version in [21]. 000780776 5058_ $$a3.4.2 Reference-Free MeasuresReferences; 4 Perceptual Quality Dimensions; 4.1 State-of-the-Art Perceptual Quality DimensionsParts of the content of this section have previously been published in a slightly different version in [1].; 4.1.1 Study: Kraft and Portele (Kraft1995); 4.1.2 Study: Mayo et al. I (Mayo2005); 4.1.3 Study: Viswanathan and Viswanathan (Vis2005); 4.1.4 Study: Seget (Seget2007); 4.1.5 Study: Hinterleitner (Hint2010); 4.1.6 Study: Mayo et al. II (Mayo2011); 4.1.7 Restrictions of Discussed Studies 000780776 5058_ $$a4.2 Semantic Differential and Factor AnalysisParts of the content of this section have previously been published in a slightly different version in [13].4.2.1 Experimental Setup; 4.2.2 Statistical Analysis; 4.3 Sorting Task and Multidimensional ScalingParts of the content of this section have previously been published in a slightly different version in [16].; 4.3.1 Experimental Setup; 4.3.2 Statistical Analysis; 4.4 Summary of the SD/FA and ST/MDS StudiesParts of the content of this section have previously been published in a slightly different version in [16]. 000780776 5058_ $$a4.5 4.5 Universal Perceptual Quality Dimensions4.5.1 Naturalness of Voice; 4.5.2 Prosodic Quality; 4.5.3 Fluency and Intelligibility; 4.5.4 Absence of Disturbances; 4.5.5 Calmness; 4.5.6 Instructions for TTS Quality Assessment; 4.6 Summary; References; 5 Influencing Factors on Perceptual Quality; 5.1 Influence of the ApplicationParts of the content of this section have previously been published in a slightly different version in [1].; 5.1.1 Pretest; 5.1.2 Main TestThe content of this section has previously been published in a slightly different version in [10].; 5.1.3 Conclusions 000780776 506__ $$aAccess limited to authorized users. 000780776 520__ $$aThis book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined. 000780776 588__ $$aVendor-supplied metadata. 000780776 650_0 $$aSpeech synthesis. 000780776 650_0 $$aSpeech processing systems. 000780776 650_0 $$aText-to-speech software. 000780776 650_0 $$aTelecommunication. 000780776 830_0 $$aT-labs series in telecommunication services. 000780776 852__ $$bebk 000780776 85640 $$3SpringerLink$$uhttps://univsouthin.idm.oclc.org/login?url=http://link.springer.com/10.1007/978-981-10-3734-4$$zOnline Access$$91397441.1 000780776 909CO $$ooai:library.usi.edu:780776$$pGLOBAL_SET 000780776 980__ $$aEBOOK 000780776 980__ $$aBIB 000780776 982__ $$aEbook 000780776 983__ $$aOnline 000780776 994__ $$a92$$bISE