Speech and computer :: 23rd international conference, SPECOM 2021, St. Petersburg, Russia, September 27-30, 2021 : proceedings /: Alexey Karpov, Rodmongo Potapova (eds.).

Karpov, Alexey,; Potapova, R. K.,

doi:10.1007/978-3-030-87802-3

Speech and computer : 23rd international conference, SPECOM 2021, St. Petersburg, Russia, September 27-30, 2021 : proceedings / Alexey Karpov, Rodmongo Potapova (eds.).

International Conference Speech and Computer (23rd : 2021 : St. Petersburg, Russia ; Online); Karpov, Alexey, editor.; Potapova, R. K., editor.

2021

QA76.9.N38 I58 2021

Available Online

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Cite

Linked e-resources

Linked Resource

Online Access

Concurrent users

Unlimited

Authorized users

Document Delivery Supplied

Can lend chapters, not whole ebooks

Details

Title

Speech and computer : 23rd international conference, SPECOM 2021, St. Petersburg, Russia, September 27-30, 2021 : proceedings / Alexey Karpov, Rodmongo Potapova (eds.).

Meeting Name

International Conference Speech and Computer (23rd : 2021 : St. Petersburg, Russia ; Online)

ISBN

9783030878023 (electronic bk.)
3030878023 (electronic bk.)
9783030878016
3030878015

DOI

https://doi.org/10.1007/978-3-030-87802-3

Published

Cham : Springer, [2021]

Copyright

Language

English

Description

1 online resource : illustrations (some color).

Item Number

10.1007/978-3-030-87802-3 doi

Call Number

QA76.9.N38 I58 2021

Dewey Decimal Classification

006.3/5

Summary

This book constitutes the proceedings of the 23rd International Conference on Speech and Computer, SPECOM 2021, held in St. Petersburg, Russia, in September 2021.* The 74 papers presented were carefully reviewed and selected from 163 submissions. The papers present current research in the area of computer speech processing including audio signal processing, automatic speech recognition, speaker recognition, computational paralinguistics, speech synthesis, sign language and multimodal processing, and speech and language resources. *Due to the COVID-19 pandemic, SPECOM 2021 was held as a hybrid event.

Note

International conference proceedings.
Includes author index.

Access Note

Access limited to authorized users.

Source of Description

Online resource; title from PDF title page (SpringerLink, viewed October 1, 2021).

Added Author

Karpov, Alexey, editor.
Potapova, R. K., editor.

Series

Lecture notes in computer science ; 12997.
Lecture notes in computer science. Lecture notes in artificial intelligence.
LNCS sublibrary. SL 7, Artificial intelligence.

Linked Resources

Online Access

Record Appears in

Online Resources > Ebooks
All Resources

Text-Independent Speaker Verification Employing CNN-LSTM-TDNN Hybrid Networks
End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics
Assessing Velar Gestures Timing in European Portuguese Nasal Vowels with RT-MRI Data
Designing and Deploying an Interaction Modality for Articulatory-Based Audiovisual Speech Synthesis
Kurdish Spoken Dialect Recognition Using X-vector Speaker Embedding
An ASR-based Tutor for Learning to Read: How to Optimize Feedback to First Graders
Velocity Differences Between Velum Raising and Lowering Movements
Pragmatic Markers of Russian Everyday Speech: Invariants in Dialogue and Monologue
Language Adaptation for Speaker Recognition Systems using Contrastive Learning
Evaluating X-vector-based Speaker Anonymization Under White-box Assessment
Improved Prosodic Clustering for Multispeaker and Speaker-Independent Phoneme-Level Prosody Control
Initial Experiments on Question Answering from the Intrinsic Structure of Oral History Archives
Imagined, Intended, and Spoken Speech Envelope Synthesis from Neuromagnetic Signals
What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms
Toxic Comment Classification Service in Social Network
Deep Learning based Engagement Recognition in Highly Imbalanced Data
Intraspeaker Variability of a Professional Lecturer: Ageing, Genre, Pragmatics vs. Voice Acting (Case Study)
An Ensemble Approach for the Diagnosis of COVID-19 from Speech and Cough Sounds
Where are We in Semantic Concept Extraction for Spoken Language Understanding?
Learning Mizo Tones from F0 Contours using 1D-CNN
OCR Improvements for Images of Multi-Page Historical Documents
X-Bridge: Image-to-Image Translation with Reconstruction Capabilities
Who is Selling to Whom
Feature Evaluation for Multi-block Classification in Invoice Information Extraction
Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning
Text and Synthetic Data for Domain Adaptation in End-to-End Speech Recognition
Speaker-invariant Speech-To-Intent Classification for Low-Resource Languages
Speaker-Dependent Visual Command Recognition in Vehicle Cabin: Methodology and Evaluation
Optimised Code-Switched Language Model Data Augmentation in Four Under-Resourced South African Languages
Synthesis Speech based Data Augmentation for Low Resource Children ASR
End-to-End Russian Speech Recognition Models with Multi-Head Attention
Word-level Style Control for Expressive, Non-attentive Speech Synthesis
Perceiving Speech Aggression with and without Textual Context on Twitter Social Network Site
Assessing Speaker Interpolation in Neural Text-to-Speech
A Mobile Application for Detection of Amyotrophic Lateral Sclerosis via Voice Analysis
Child's Emotional Speech Classification by Human across Two Languages: Russian & Tamil
Analysis of Dialogues of Typically Developing Children, Children with Down Syndrome and ASD using Machine Learning Methods
Speaker Adaptation with Continuous Vocoder-based DNN-TTS
Automatic Recognition of the Psychoneurological State of Children: Autism Spectrum Disorders, Down Syndrome, Typical Development
Study on Acoustic Model Personalization in a Context of Collaborative Learning Constrained by Privacy Preservation
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English
Dialog Speech Sentiment Classification for Imbalanced Datasets
Explicit Control of the Level of Expressiveness in DNN-based Speech Synthesis by Embedding Interpolation
Experimental Analysis of Expert and Quantitative Estimates of Syllable Recordings in the Process of Speech Rehabilitation
Methods for Using Class Based N-gram Language Models in the Kaldi Toolkit
Spectral Root Features for Replay Spoof Detection in Voice Assistants
Influence of the Aggressive Internet Environment on Cognitive Personality Disorders (in Relation to the Russian Young Generation of Users)
Media Content vs Nature Stimuli Influence on Human Brain Activity
Can Your Eyes Tell Us Why You Hesitate? Comparing Reading Aloud in Russian as L1 and Japanese as L2
Recognition of Heavily Accented and Emotional Speech of English and Czech Holocaust Survivors using Various DNN Architectures
Assessing Speaker-Independent Character Information for Acted Voices
Influence of Speaker Pre-training on Character Voice Representation
Opinion Classification via Word and Emoji Embedding Models with LSTM
An Equal Data Setting for Attention-based Encoder-Decoder and HMM/DNN Models: a Case Study in Finnish ASR
Speaker-aware Training of Speech Emotion Classifier with Speaker Recognition
Neural Network Recognition of Russian Noun and Adjective Cases in the Google Books Ngram Corpus
Is it a Filler or a Pause? A Quantitative Analysis of Filled Pauses in Hebrew
Modified Group Delay Function using Different Spectral Smoothing Techniques for Voice Liveness Detection
Complex Rhythm Adjustments in Multilingual Code-Switching across Mandarin, English and Russian
Increasing the Precision of Dysarthric Speech Intelligibility and Severity Level Estimate
Articulation During Voice Disguise: a Pilot Study
Improvement of Speaker Number Estimation by Applying an Overlapped Speech Detector
Mind Your Tweet: Abusive Tweet Detection
Speaker Authorization for Air Traffic Control Security
Prosodic Changes with Age: a Longitudinal Study on a Famous European Portuguese Native Speaker
Automatic Selection of the Most Characterizing Features for Detecting COPD in Speech
Multilingual Training Set Selection for ASR in Under-Resourced Malian Languages
Human and Transformer-Based Prosodic Phrasing in Two Speech Genres
Learning Efficient Representations for Keyword Spotting with Triplet Loss
Regularized Forward-Backward Decoder for Attention Models
Induced Local Attention for Transformer Models in Speech Recognition
Applying EEND Diarization to Telephone Recordings from a Call Center
Acoustic Characteristics of Speech Entrainment in Dialogues in Similar Phonetic Sequences
Predicting Biometric Error Behaviour from Speaker Embeddings and a Fast Score Normalization Scheme.

Browse Subjects

Show more subjects...

Speech and computer : 23rd international conference, SPECOM 2021, St. Petersburg, Russia, September 27-30, 2021 : proceedings / Alexey Karpov, Rodmongo Potapova (eds.).

Linked e-resources

Details

Table of Contents

Browse Subjects

Statistics