001434274 000__ 07978cam\a2200889\i\4500 001434274 001__ 1434274 001434274 003__ OCoLC 001434274 005__ 20230309003720.0 001434274 006__ m\\\\\o\\d\\\\\\\\ 001434274 007__ cr\nn\nnnunnun 001434274 008__ 210122s2021\\\\sz\a\\\\o\\\\\101\0\eng\d 001434274 019__ $$a1235596355$$a1241065920$$a1244118510$$a1249945120 001434274 020__ $$a9783030678326$$q(electronic bk.) 001434274 020__ $$a3030678326$$q(electronic bk.) 001434274 020__ $$z9783030678319 001434274 0247_ $$a10.1007/978-3-030-67832-6$$2doi 001434274 035__ $$aSP(OCoLC)1238924263 001434274 040__ $$aSNU$$beng$$erda$$epn$$cSNU$$dOCLCO$$dGW5XE$$dEBLCP$$dGZM$$dOCLCO$$dDKU$$dDCT$$dOCLCF$$dLEATE$$dOCLCQ$$dOCLCO$$dCOM$$dOCLCQ 001434274 049__ $$aISEA 001434274 050_4 $$aQA76.575 001434274 08204 $$a006.7$$223 001434274 1112_ $$aInternational Conference on Multi-Media Modeling$$n(27th :$$d2021 :$$cPrague, Czech Republic) 001434274 24510 $$aMultiMedia Modeling :$$b27th International Conference, MMM 2021, Prague, Czech Republic, June 22-24, 2021 : proceedings.$$nPart I /$$cJakub Lokoč, Tomáš Skopal, Klaus Schoeffmann, Vasileios Mezaris, Xirong Li, Stefanos Vrochidis, Ioannis Patras (eds.). 001434274 24630 $$aMMM 2021 001434274 264_1 $$aCham :$$bSpringer,$$c[2021] 001434274 300__ $$a1 online resource (xxv, 733 pages) :$$billustrations (chiefly color) 001434274 336__ $$atext$$btxt$$2rdacontent 001434274 337__ $$acomputer$$bc$$2rdamedia 001434274 338__ $$aonline resource$$bcr$$2rdacarrier 001434274 347__ $$atext file 001434274 347__ $$bPDF 001434274 4901_ $$aLecture notes in computer science ;$$v12572 001434274 4901_ $$aLNCS sublibrary: SL3 - Information systems and applications, incl. Internet/Web, and HCI 001434274 500__ $$aInternational conference proceedings. 001434274 500__ $$aIncludes author index. 001434274 5050_ $$aIntro -- Preface -- Organization -- Contents -- Part I -- Contents -- Part II -- Crossed-Time Delay Neural Network for Speaker Recognition -- 1 Introduction -- 2 Baseline Models -- 3 Crossed-Time Delay Neural Network -- 3.1 Crossed-Time Delay Layer -- 3.2 Statistical Concatenation -- 4 Experiments -- 4.1 Preprocessing -- 4.2 Model Configuration -- 4.3 Training Parameters Settings -- 4.4 Embedding Extraction and Verification -- 5 Results -- 5.1 VoxCeleb1 -- 5.2 Vcc2016 -- 6 Conclusion -- References -- An Asymmetric Two-Sided Penalty Term for CT-GAN -- 1 Introduction -- 2 Background -- 2.1 WGAN 001434274 5058_ $$a2.2 WGAN-GP -- 2.3 CT-GAN -- 3 Our Approach -- 3.1 Asymmetric Two-Sided Penalty -- 3.2 WGAN with Asymmetric Two-Sided Penalty -- 4 Experiments -- 4.1 Datasets and Evaluation -- 4.2 Results -- 5 Conclusion -- References -- Fast Discrete Matrix Factorization Hashing for Large-Scale Cross-Modal Retrieval -- 1 Introduction -- 2 Proposed Method -- 2.1 Problem Formulation -- 2.2 Fast Discrete Matrix Factorization Hashing -- 2.3 Optimization Algorithm -- 2.4 Out-of-Sample Extension -- 3 Experiment -- 3.1 Experiment Settings -- 3.2 Experimental Results -- 3.3 Parameter Sensitivity Analysis 001434274 5058_ $$a3.4 Time Cost Analysis -- 4 Conclusion -- References -- Fast Optimal Transport Artistic Style Transfer -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 Fast Style Transfer Framework -- 3.2 Learn to Style Transfer via Optimal Transport -- 3.3 Optimization Objectives -- 4 Experiments -- 4.1 Implementation Details -- 4.2 Qualitative Analysis -- 4.3 Quantitative Analysis -- 4.4 Ablation Study -- 5 Conclusion -- References -- Stacked Sparse Autoencoder for Audio Object Coding -- 1 Introduction -- 2 Related Work -- 3 Proposed Approach -- 3.1 Structure of SSAE-SAOC 001434274 5058_ $$a3.2 Architecture of Stacked Sparse Autoencoder -- 4 Experimental Evaluation -- 4.1 Experiments Conditions -- 4.2 SSAE Model Training -- 4.3 Test Results and Data Analysis -- 5 Conclusions -- References -- A Collaborative Multi-modal Fusion Method Based on Random Variational Information Bottleneck for Gesture Recognition -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 Variational Information Bottleneck -- 3.2 Random Variational Information Bottleneck -- 4 Experiment -- 4.1 Data Processing -- 4.2 Experimental Analysis -- 5 Conclusion -- References 001434274 5058_ $$aFrame Aggregation and Multi-modal Fusion Framework for Video-Based Person Recognition -- 1 Introduction -- 2 Related Work -- 3 Our Framework -- 3.1 Overview -- 3.2 AttentionVLAD for Frame Aggregation -- 3.3 MLMA for Multi-modal Fusion -- 4 Experiments -- 4.1 Dataset -- 4.2 Results -- 4.3 Implementation Details -- 4.4 Ablation Study -- 5 Conclusion -- References -- An Adaptive Face-Iris Multimodal Identification System Based on Quality Assessment Network -- 1 Introduction -- 2 Proposed System -- 2.1 Preprocessing -- 2.2 Feature Extraction -- 2.3 Matching -- 2.4 FaceIrisQANet -- 2.5 Fusion and Decision. 001434274 506__ $$aAccess limited to authorized users. 001434274 520__ $$aThe two-volume set LNCS 12572 and 1273 constitutes the thoroughly refereed proceedings of the 27th International Conference on MultiMedia Modeling, MMM 2021, held in Prague, Czech Republic, in June2021. Of the 211 submitted regular papers, 40 papers were selected for oral presentation and 33 for poster presentation; 16 special session papers were accepted as well as 2 papers for a demo presentation and 17 papers for participation at the Video Browser Showdown 2021. The papers cover topics such as: multimedia indexing; multimedia mining; multimedia abstraction and summarization; multimedia annotation, tagging and recommendation; multimodal analysis for retrieval applications; semantic analysis of multimedia and contextual data; multimedia fusion methods; multimedia hyperlinking; media content browsing and retrieval tools; media representation and algorithms; audio, image, video processing, coding and compression; multimedia sensors and interaction modes; multimedia privacy, security and content protection; multimedia standards and related issues; advances in multimedia networking and streaming; multimedia databases, content delivery and transport; wireless and mobile multimedia networking; multi-camera and multi-view systems; augmented and virtual reality, virtual environments; real-time and interactive multimedia applications; mobile multimedia applications; multimedia web applications; multimedia authoring and personalization; interactive multimedia and interfaces; sensor networks; social and educational multimedia applications; and emerging trends. 001434274 588__ $$aOnline resource; title from PDF title page (SpringerLink, viewed March 10, 2021). 001434274 650_0 $$aMultimedia systems$$vCongresses. 001434274 650_0 $$aComputer simulation$$vCongresses. 001434274 650_0 $$aOptical data processing. 001434274 650_0 $$aApplication software. 001434274 650_0 $$aEducation$$xData processing. 001434274 650_0 $$aArtificial intelligence. 001434274 650_0 $$aDatabase management. 001434274 650_6 $$aMultimédia$$vCongrès. 001434274 650_6 $$aSimulation par ordinateur$$vCongrès. 001434274 650_6 $$aTraitement optique de l'information. 001434274 650_6 $$aLogiciels d'application. 001434274 650_6 $$aÉducation$$xInformatique. 001434274 650_6 $$aIntelligence artificielle. 001434274 650_6 $$aBases de données$$xGestion. 001434274 655_7 $$aConference papers and proceedings.$$2fast$$0(OCoLC)fst01423772 001434274 655_7 $$aConference papers and proceedings.$$2lcgft 001434274 655_7 $$aActes de congrès.$$2rvmgf 001434274 655_0 $$aElectronic books. 001434274 7001_ $$aLokoč, Jakub,$$eeditor. 001434274 7001_ $$aSkopal, Tomas,$$eeditor. 001434274 7001_ $$aSchoeffmann, Klaus,$$eeditor. 001434274 7001_ $$aMezaris, Vasileios,$$eeditor. 001434274 7001_ $$aLi, Xirong,$$eeditor. 001434274 7001_ $$aVrochidis, Stefanos,$$d1975-$$eeditor. 001434274 7001_ $$aPatras, Ioannis,$$eeditor. 001434274 77608 $$z3030678318 001434274 830_0 $$aLecture notes in computer science ;$$v12572. 001434274 830_0 $$aLNCS sublibrary.$$nSL 3,$$pInformation systems and applications, incl. Internet/Web, and HCI. 001434274 852__ $$bebk 001434274 85640 $$3Springer Nature$$uhttps://univsouthin.idm.oclc.org/login?url=https://link.springer.com/10.1007/978-3-030-67832-6$$zOnline Access$$91397441.1 001434274 909CO $$ooai:library.usi.edu:1434274$$pGLOBAL_SET 001434274 980__ $$aBIB 001434274 980__ $$aEBOOK 001434274 982__ $$aEbook 001434274 983__ $$aOnline 001434274 994__ $$a92$$bISE