Linked e-resources
Details
Table of Contents
W12
Commands 4 Autonomous Vehicles
Commands 4 Autonomous Vehicles (C4AV) Workshop Summary
Commands for Autonomous Vehicles by Progressively Stacking Visual-Linguistic Representations
C4AV: Learning Cross-Modal Representations from Transformers
Cosine meets Softmax: A tough-to-beat baseline for visual grounding
Attention Enhanced Single Stage Multimodal Reasoner
AttnGrounder: Talking to Cars with Attention
W13
Computer VISion for ART Analysis
Detecting Faces, Visual Medium Types, and Gender in Historical Advertisements, 1950-1995
A Dataset and Baselines for Visual Question Answering on Art
Understanding Compositional Structures in Art Historical Images using Pose and Gaze Priors
Demographic In uences on Contemporary Art with Unsupervised Style Embeddings
Geolocating Time: Digitisation and Reverse Engineering of a Roman Sundial
Object Retrieval and Localization in Large Art Collections using Deep Multi-Style Feature Fusion and Iterative Voting
W15
Sign Language Recognition, Translation and Production
SLRTP 2020: The Sign Language Recognition, Translation & Production Workshop
Automatic Segmentation of Sign Language into Subtitle-Units
Phonologically-meaningful Subunits for Deep Learning-based Sign Language Recognition
Recognition of an effective and grammatical facial expressions: a study for Brazilian sign language
Real-Time Sign Language Detection using Human Pose Estimation
Exploiting 3D Hand Pose Estimation in Deep Learning-Based Sign Language Recognition from RGB Videos
A Plan for Developing an Auslan Communication Technologies Pipeline
A Multi-modal Machine Learning Approach and Toolkit to Automate Recognition of Early Stages of Dementia among British Sign Language Users
Score-level Multi Cue Fusion for Sign Language Recognition
Unsupervised Discovery of Sign Terms by K-Nearest Neighbours Approach
Improving Keyword Search Performance in Sign Language with Hand Shape Features
W16
Visual Inductive Priors for Data-Efficient Deep Learning
Lightweight Action Recognition in Compressed Videos
On sparse connectivity, adversarial robustness, and a novel model of the artificial neuron
Injecting Prior Knowledge into Image Caption Generation
Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition
Unsupervised Learning of Video Representations via Dense Trajectory Clustering
Distilling Visual Priors from Self-Supervised Learning
Unsupervised Image Classi cation for Deep Representation Learning
TDMPNet: Prototype Network with Recurrent Top-Down Modulation for Robust Object Classi cation under Partial Occlusion
What leads to generalization of object proposals
A Self-Supervised Framework for Human Instance Segmentation
Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering
A visual inductive priors framework for data-efficient image classification
W18
3D Poses In the Wild Challenge
Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation
Beyond Weak Perspective for Monocular 3D Human Pose Estimation
W20
Map-based Localization for Autonomous Driving
Geographically Local Representation Learning with a Spatial Prior for Visual Localization
W22
Recovering 6D Object Pose
BOP Challenge 2020 on 6D Object Localization
StructureFromGAN: Single Image 3D Model Reconstruction and Photorealistic Texturing
6 DoF Pose Estimation of Textureless Objects from Multiple RGB Frames
Semi-supervised Viewpoint Estimation with Geometry-aware Conditional Generation
Physical Plausibility of 6D Pose Estimates in Scenes of Static Rigid Objects
DronePose: Photorealistic UAV-Assistant Dataset Synthesis for 3D Pose Estimation via a Smooth Silhouette Loss
How to track your dragon: A Multi-Attentional Framework for real-time RGB-D 6-DOF Object Pose Tracking
Hybrid Approach for 6DoF Pose Estimation
Leaping from 2D Detection to E cient 6DoF Object Pose Estimation
W23
SHApe Recovery from Partial Textured 3D Scans
Implicit Feature Networks for Texture Completion from Partial 3D Data
3DBooSTeR: 3D Body Shape and Texture Recovery
SHARP 2020: The 1st Shape Recovery from Partial Textured 3D Scans Challenge Results.
Commands 4 Autonomous Vehicles
Commands 4 Autonomous Vehicles (C4AV) Workshop Summary
Commands for Autonomous Vehicles by Progressively Stacking Visual-Linguistic Representations
C4AV: Learning Cross-Modal Representations from Transformers
Cosine meets Softmax: A tough-to-beat baseline for visual grounding
Attention Enhanced Single Stage Multimodal Reasoner
AttnGrounder: Talking to Cars with Attention
W13
Computer VISion for ART Analysis
Detecting Faces, Visual Medium Types, and Gender in Historical Advertisements, 1950-1995
A Dataset and Baselines for Visual Question Answering on Art
Understanding Compositional Structures in Art Historical Images using Pose and Gaze Priors
Demographic In uences on Contemporary Art with Unsupervised Style Embeddings
Geolocating Time: Digitisation and Reverse Engineering of a Roman Sundial
Object Retrieval and Localization in Large Art Collections using Deep Multi-Style Feature Fusion and Iterative Voting
W15
Sign Language Recognition, Translation and Production
SLRTP 2020: The Sign Language Recognition, Translation & Production Workshop
Automatic Segmentation of Sign Language into Subtitle-Units
Phonologically-meaningful Subunits for Deep Learning-based Sign Language Recognition
Recognition of an effective and grammatical facial expressions: a study for Brazilian sign language
Real-Time Sign Language Detection using Human Pose Estimation
Exploiting 3D Hand Pose Estimation in Deep Learning-Based Sign Language Recognition from RGB Videos
A Plan for Developing an Auslan Communication Technologies Pipeline
A Multi-modal Machine Learning Approach and Toolkit to Automate Recognition of Early Stages of Dementia among British Sign Language Users
Score-level Multi Cue Fusion for Sign Language Recognition
Unsupervised Discovery of Sign Terms by K-Nearest Neighbours Approach
Improving Keyword Search Performance in Sign Language with Hand Shape Features
W16
Visual Inductive Priors for Data-Efficient Deep Learning
Lightweight Action Recognition in Compressed Videos
On sparse connectivity, adversarial robustness, and a novel model of the artificial neuron
Injecting Prior Knowledge into Image Caption Generation
Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition
Unsupervised Learning of Video Representations via Dense Trajectory Clustering
Distilling Visual Priors from Self-Supervised Learning
Unsupervised Image Classi cation for Deep Representation Learning
TDMPNet: Prototype Network with Recurrent Top-Down Modulation for Robust Object Classi cation under Partial Occlusion
What leads to generalization of object proposals
A Self-Supervised Framework for Human Instance Segmentation
Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering
A visual inductive priors framework for data-efficient image classification
W18
3D Poses In the Wild Challenge
Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation
Beyond Weak Perspective for Monocular 3D Human Pose Estimation
W20
Map-based Localization for Autonomous Driving
Geographically Local Representation Learning with a Spatial Prior for Visual Localization
W22
Recovering 6D Object Pose
BOP Challenge 2020 on 6D Object Localization
StructureFromGAN: Single Image 3D Model Reconstruction and Photorealistic Texturing
6 DoF Pose Estimation of Textureless Objects from Multiple RGB Frames
Semi-supervised Viewpoint Estimation with Geometry-aware Conditional Generation
Physical Plausibility of 6D Pose Estimates in Scenes of Static Rigid Objects
DronePose: Photorealistic UAV-Assistant Dataset Synthesis for 3D Pose Estimation via a Smooth Silhouette Loss
How to track your dragon: A Multi-Attentional Framework for real-time RGB-D 6-DOF Object Pose Tracking
Hybrid Approach for 6DoF Pose Estimation
Leaping from 2D Detection to E cient 6DoF Object Pose Estimation
W23
SHApe Recovery from Partial Textured 3D Scans
Implicit Feature Networks for Texture Completion from Partial 3D Data
3DBooSTeR: 3D Body Shape and Texture Recovery
SHARP 2020: The 1st Shape Recovery from Partial Textured 3D Scans Challenge Results.