1, TITLE: EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs AUTHORS: Guohao Ying ; Xin He ; Bin Gao ; Bo Han ; Xiaowen Chu CATEGORY: cs.CV [cs.CV, cs.LG, cs.NE]HIGHLIGHT: To alleviate the instability issue, we propose an efficient two-stage evolutionary algorithm (EA) based NAS framework to discover GANs, dubbed \textbf{EAGAN}. 2, TITLE: In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image Domains AUTHORS: Ting Cao ; Mohammad Ali Armin ; Simon Denman ; Lars Petersson ; David Ahmedt-Aristizabal CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: Motivated by the effectiveness of self-supervised methods in learning features directly from data, we propose a multi-modal conditional variational autoencoder (MC-VAE) capable of reconstructing features from missing modalities seen during training. 3, TITLE: Anonymization for Skeleton Action Recognition AUTHORS: Myeonghyeon Kim ; Zhenyue Qin ; Yang Liu ; Dongwoo Kim CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: We propose two variants of anonymization algorithms to protect the potential privacy leakage from the skeleton dataset. 4, TITLE: A Unified Pruning Framework for Vision Transformers AUTHORS: Hao Yu ; Jianxin Wu CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we design a unified framework for structural pruning of both ViTs and its variants, namely UP-ViTs. 5, TITLE: Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding AUTHORS: Abdullah Hamdi ; Silvio Giancola ; Bernard Ghanem CATEGORY: cs.CV [cs.CV, cs.LG, 68T45]HIGHLIGHT: To this end, we introduce the concept of the multi-view point cloud (Voint cloud), representing each 3D point as a set of features extracted from several view-points. 6, TITLE: Pyramid Adversarial Training Improves ViT Performance AUTHORS: CHARLES HERRMANN et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this work, we present Pyramid Adversarial Training, a simple and effective technique to improve ViT's overall performance. 7, TITLE: Seeking Salient Facial Regions for Cross-Database Micro-Expression Recognition AUTHORS: Xingxun Jiang ; Yuan Zong ; Wenming Zheng CATEGORY: cs.CV [cs.CV, cs.AI]HIGHLIGHT: To deal with cross-database micro-expression recognition, we propose a novel domain adaption method called Transfer Group Sparse Regression (TGSR). 8, TITLE: A Dataset-Dispersion Perspective on Reconstruction Versus Recognition in Single-View 3D Reconstruction Networks AUTHORS: Yefan Zhou ; Yiru Shen ; Yujun Yan ; Chen Feng ; Yaoqing Yang CATEGORY: cs.CV [cs.CV]HIGHLIGHT: Thus, we introduce the dispersion score, a new data-driven metric, to quantify this leading factor and study its effect on NNs. 9, TITLE: MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark AUTHORS: XIAOTIAN HAN et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we provide a large-scale densely-labeled multi-camera tracking dataset in five different environments with the help of an auto-annotation system. 10, TITLE: AirObject: A Temporally Evolving Graph Embedding for Object Identification AUTHORS: Nikhil Varma Keetha ; Chen Wang ; Yuheng Qiu ; Kuan Xu ; Sebastian Scherer CATEGORY: cs.CV [cs.CV, cs.RO]HIGHLIGHT: In this context, we propose a novel temporal 3D object encoding approach, dubbed AirObject, to obtain global keypoint graph-based embeddings of objects. 11, TITLE: NeuSample: Neural Sample Field for Efficient View Synthesis AUTHORS: JIEMIN FANG et. al. CATEGORY: cs.CV [cs.CV, cs.GR]HIGHLIGHT: We perform experiments on Realistic Synthetic 360$^{\circ}$ and Real Forward-Facing, two popular 3D scene sets, and show that NeuSample achieves better rendering quality than NeRF while enjoying a faster inference speed. 12, TITLE: Robust 3D Garment Digitization from Monocular 2D Images for 3D Virtual Try-On Systems AUTHORS: SAHIB MAJITHIA et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we develop a robust 3D garment digitization solution that can generalize well on real-world fashion catalog images with cloth texture occlusions and large body pose variations. 13, TITLE: HEAT: Holistic Edge Attention Transformer for Structured Reconstruction AUTHORS: Jiacheng Chen ; Yiming Qian ; Yasutaka Furukawa CATEGORY: cs.CV [cs.CV]HIGHLIGHT: This paper presents a novel attention-based neural network for structured reconstruction, which takes a 2D raster image as an input and reconstructs a planar graph depicting an underlying geometric structure. 14, TITLE: Low-light Image Enhancement Via Breaking Down The Darkness AUTHORS: Qiming Hu ; Xiaojie Guo CATEGORY: cs.CV [cs.CV]HIGHLIGHT: To seek results with satisfied lighting, cleanliness, and realism from degraded inputs, this paper presents a novel framework inspired by the divide-and-rule principle, greatly alleviating the degradation entanglement. 15, TITLE: TridentAdapt: Learning Domain-invariance Via Source-Target Confrontation and Self-induced Cross-domain Augmentation AUTHORS: Fengyi Shen ; Akhil Gurram ; Ahmet Faruk Tuna ; Onay Urfalioglu ; Alois Knoll CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we propose a novel trident-like architecture that enforces a shared feature encoder to satisfy confrontational source and target constraints simultaneously, thus learning a domain-invariant feature space. 16, TITLE: DiffSDFSim: Differentiable Rigid-Body Dynamics With Implicit Shapes AUTHORS: Michael Strecke ; Joerg Stueckler CATEGORY: cs.CV [cs.CV, cs.GR, cs.LG, cs.RO]HIGHLIGHT: In this paper, we propose a novel approach to differentiable physics with frictional contacts which represents object shapes implicitly using signed distance fields (SDFs). 17, TITLE: Generative Convolution Layer for Image Generation AUTHORS: Seung Park ; Yong-Goo Shin CATEGORY: cs.CV [cs.CV]HIGHLIGHT: This paper introduces a novel convolution method, called generative convolution (GConv), which is simple yet effective for improving the generative adversarial network (GAN) performance. 18, TITLE: CRIS: CLIP-Driven Referring Image Segmentation AUTHORS: ZHAOQING WANG et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: Inspired by the recent advance in Contrastive Language-Image Pretraining (CLIP), in this paper, we propose an end-to-end CLIP-Driven Referring Image Segmentation framework (CRIS). 19, TITLE: Morph Detection Enhanced By Structured Group Sparsity AUTHORS: Poorya Aghdaie ; Baaria Chaudhary ; Sobhan Soleymani ; Jeremy Dawson ; Nasser M. Nasrabadi CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we consider the challenge of face morphing attacks, which substantially undermine the integrity of face recognition systems such as those adopted for use in border protection agencies. 20, TITLE: CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning AUTHORS: Bang Yang ; Yuexian Zou CATEGORY: cs.CV [cs.CV]HIGHLIGHT: Specifically, our empirical study on INP vs. CLIP shows that INP makes video caption models tricky to capture attributes' semantics and sensitive to irrelevant background information. 21, TITLE: Equitable Modelling of Brain Imaging By Counterfactual Augmentation with Morphologically Constrained 3D Deep Generative Models AUTHORS: GUILHERME POMBO et. al. CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: We describe Countersynth, a conditional generative model of diffeomorphic deformations that induce label-driven, biologically plausible changes in volumetric brain images. 22, TITLE: EPose: Let's Make EfficientPose More Generally Applicable AUTHORS: Austin Lally ; Robert Bain ; Mazen Alotaibi CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: In this paper we try to improve on EfficientPose by giving it the ability to infer an object's size, and by simplifying both the data collection and loss calculations. 23, TITLE: FENeRF: Face Editing in Neural Radiance Fields AUTHORS: JINGXIANG SUN et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: To overcome these limitations, we propose FENeRF, a 3D-aware generator that can produce view-consistent and locally-editable portrait images. 24, TITLE: How Facial Features Convey Attention in Stationary Environments AUTHORS: Janelle Domantay CATEGORY: cs.CV [cs.CV]HIGHLIGHT: This paper aims to extend previous research on distraction detection by analyzing which visual features contribute most to predicting awareness and fatigue. 25, TITLE: Robust Partial-to-Partial Point Cloud Registration in A Full Range AUTHORS: Liang Pan ; Zhongang Cai ; Ziwei Liu CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this work, we propose Graph Matching Consensus Network (GMCNet), which estimates pose-invariant correspondences for fullrange 1 Partial-to-Partial point cloud Registration (PPR). 26, TITLE: PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction AUTHORS: QINGYU WANG et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we aim to address the issue between datasets and models and propose a large scale stereo dataset with high accuracy disparity ground truth named PlantStereo. 27, TITLE: Shunted Self-Attention Via Multi-Scale Token Aggregation AUTHORS: Sucheng Ren ; Daquan Zhou ; Shengfeng He ; Jiashi Feng ; Xinchao Wang CATEGORY: cs.CV [cs.CV]HIGHLIGHT: To address this issue, we propose a novel and generic strategy, termed shunted self-attention~(SSA), that allows ViTs to model the attentions at hybrid scales per attention layer. 28, TITLE: LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human Bodies AUTHORS: SANDRO LOMBARDI et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this work, we propose a novel neural implicit representation for the human body, which is fully differentiable and optimizable with disentangled shape and pose latent spaces. 29, TITLE: Semi-Supervised 3D Hand Shape and Pose Estimation with Label Propagation AUTHORS: Samira Kaviani ; Amir Rahimi ; Richard Hartley CATEGORY: cs.CV [cs.CV]HIGHLIGHT: To tackle this issue in the context of semi-supervised 3D hand shape and pose estimation, we propose the Pose Alignment network to propagate 3D annotations from labelled frames to nearby unlabelled frames in sparsely annotated videos. 30, TITLE: Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust Road Extraction AUTHORS: LINGBO LIU et. al. CATEGORY: cs.CV [cs.CV, cs.AI]HIGHLIGHT: In this work, we focus on a challenging task of land analysis, i.e., automatic extraction of traffic roads from remote sensing data, which has widespread applications in urban development and expansion estimation. 31, TITLE: The MIS Check-Dam Dataset for Object Detection and Instance Segmentation Tasks AUTHORS: Chintan Tundia ; Rajiv Kumar ; Om Damani ; G. Sivakumar CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we introduce MIS Check-Dam, a new dataset of check-dams from satellite imagery for building an automated system for the detection and mapping of check-dams, focusing on the importance of irrigation structures used for agriculture. 32, TITLE: Zero-Shot Semantic Segmentation Via Spatial and Multi-Scale Aware Visual Class Embedding AUTHORS: Sungguk Cha ; Yooseung Wang CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we address L-ZSSS has a limitation in generalization which is a virtue of zero-shot learning. 33, TITLE: Hallucinated Neural Radiance Fields in The Wild AUTHORS: XINGYU CHEN et. al. CATEGORY: cs.CV [cs.CV, cs.AI]HIGHLIGHT: To solve this problem, we present an end-to-end framework for constructing a hallucinated NeRF, dubbed as H-NeRF. 34, TITLE: Automatic Tracing of Mandibular Canal Pathways Using Deep Learning AUTHORS: Mrinal Kanti Dhar ; Zeyun Yu CATEGORY: cs.CV [cs.CV]HIGHLIGHT: Here, we propose a deep learning-based framework to detect mandibular canals from CBCT data. 35, TITLE: SamplingAug: On The Importance of Patch Sampling Augmentation for Single Image Super-Resolution AUTHORS: SHIZUN WANG et. al. CATEGORY: cs.CV [cs.CV, cs.AI]HIGHLIGHT: In this paper, we present a simple yet effective data augmentation method. 36, TITLE: MapReader: A Computer Vision Pipeline for The Semantic Exploration of Maps at Scale AUTHORS: Kasra Hosseini ; Daniel C. S. Wilson ; Kaspar Beelen ; Katherine McDonough CATEGORY: cs.CV [cs.CV, cs.LG, cs.SE]HIGHLIGHT: We present MapReader, a free, open-source software library written in Python for analyzing large map collections (scanned or born-digital). 37, TITLE: The Devil Is in The Margin: Margin-based Label Smoothing for Network Calibration AUTHORS: Bingyuan Liu ; Ismail Ben Ayed ; Adrian Galdran ; Jose Dolz CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: Following our observations, we propose a simple and flexible generalization based on inequality constraints, which imposes a controllable margin on logit distances. 38, TITLE: Human Imperceptible Attacks and Applications to Improve Fairness AUTHORS: Xinru Hua ; Huanzhong Xu ; Jose Blanchet ; Viet Nguyen CATEGORY: cs.CV [cs.CV]HIGHLIGHT: We provide a Distributionally Robust Optimization (DRO) framework which integrates human-based image quality assessment methods to design optimal attacks that are imperceptible to humans but significantly damaging to deep neural networks. 39, TITLE: Reconstruction Student with Attention for Student-Teacher Pyramid Matching AUTHORS: Shinji Yamada ; Kazuhiro Hotta CATEGORY: cs.CV [cs.CV, eess.IV]HIGHLIGHT: Here we proposed a powerful method which compensates for the shortcomings of STPM. 40, TITLE: Semi-Local Convolutions for LiDAR Scan Processing AUTHORS: Larissa T. Triess ; David Peter ; J. Marius Z�llner CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: Therefore, we propose semi local convolution (SLC), a convolution layer with reduced amount of weight-sharing along the vertical dimension. 41, TITLE: ConDA: Unsupervised Domain Adaptation for LiDAR Segmentation Via Regularized Domain Concatenation AUTHORS: Lingdong Kong ; Niamul Quader ; Venice Erin Liong CATEGORY: cs.CV [cs.CV, cs.LG, cs.RO]HIGHLIGHT: In this work, we improve and extend on this aspect. 42, TITLE: Deformable ProtoPNet: An Interpretable Image Classifier Using Deformable Prototypes AUTHORS: Jon Donnelly ; Alina Jade Barnett ; Chaofan Chen CATEGORY: cs.CV [cs.CV, cs.AI, cs.LG]HIGHLIGHT: In this paper, we address this shortcoming by proposing a case-based interpretable neural network that provides spatially flexible prototypes, called a deformable prototypical part network (Deformable ProtoPNet). 43, TITLE: AdaViT: Adaptive Vision Transformers for Efficient Image Recognition AUTHORS: LINGCHEN MENG et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we argue that due to the large variations among images, their need for modeling long-range dependencies between patches differ. 44, TITLE: 360MonoDepth: High-Resolution 360� Monocular Depth Estimation AUTHORS: Manuel Rey-Area ; Mingze Yuan ; Christian Richardt CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this work, we propose a flexible framework for monocular depth estimation from high-resolution 360{\deg} images using tangent images. 45, TITLE: HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing AUTHORS: Yuval Alaluf ; Omer Tov ; Ron Mokady ; Rinon Gal ; Amit H. Bermano CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this work, we introduce this approach into the realm of encoder-based inversion. 46, TITLE: ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds AUTHORS: Georg B�kman ; Fredrik Kahl ; Axel Flinth CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: In this paper, we are concerned with rotation equivariance on 2D point cloud data. 47, TITLE: ATS: Adaptive Token Sampling For Efficient Vision Transformers AUTHORS: MOHSEN FAYYAZ et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this work, we, therefore, introduce a differentiable parameter-free Adaptive Token Sampling (ATS) module, which can be plugged into any existing vision transformer architecture. 48, TITLE: Spatio-Temporal Multi-Flow Network for Video Frame Interpolation AUTHORS: Duolikun Danier ; Fan Zhang ; David Bull CATEGORY: cs.CV [cs.CV, eess.IV]HIGHLIGHT: In this context, we present a novel deep learning based VFI method, ST-MFNet, based on a Spatio-Temporal Multi-Flow architecture. 49, TITLE: Automated Damage Inspection of Power Transmission Towers from UAV Images AUTHORS: Aleixo Cambeiro Barreiro ; Clemens Seibold ; Anna Hilsmann ; Peter Eisert CATEGORY: cs.CV [cs.CV]HIGHLIGHT: Our main contributions are the development of a system for damage detection on remotely acquired drone images, applying techniques to overcome the issue of data scarcity and ambiguity, as well as the evaluation of the viability of such an approach to solve this particular problem. 50, TITLE: MC-SSL0.0: Towards Multi-Concept Self-Supervised Learning AUTHORS: Sara Atito ; Muhammad Awais ; Ammarah Farooq ; Zhenhua Feng ; Josef Kittler CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: This study aims to investigate the possibility of modelling all the concepts present in an image without using labels. 51, TITLE: Large-Scale Video Analytics Through Object-Level Consolidation AUTHORS: Daniel Rivas ; Francesc Guim ; Jord� Polo ; David Carrera CATEGORY: cs.CV [cs.CV, cs.NI]HIGHLIGHT: In this paper, we present FoMO (Focus on Moving Objects). 52, TITLE: Point Cloud Instance Segmentation with Semi-supervised Bounding-Box Mining AUTHORS: YONGBIN LIAO et. al. CATEGORY: cs.CV [cs.CV, cs.AI]HIGHLIGHT: In this paper, we introduce the first semi-supervised point cloud instance segmentation framework (SPIB) using both labeled and unlabelled bounding boxes as supervision. 53, TITLE: Affect-DML: Context-Aware One-Shot Recognition of Human Affect Using Deep Metric Learning AUTHORS: KUNYU PENG et. al. CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we conceptualize one-shot recognition of emotions in context -- a new problem aimed at recognizing human affect states in finer particle level from a single support sample. 54, TITLE: Image Denoising By Super Neurons: Why Go Deep? AUTHORS: Junaid Malik ; Serkan Kiranyaz ; Moncef Gabbouj CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: As the integration of non-local information is known to benefit denoising, in this work we investigate the use of super neurons for both synthetic and real-world image denoising. 55, TITLE: AssistSR: Affordance-centric Question-driven Video Segment Retrieval AUTHORS: Stan Weixian Lei ; Yuxuan Wang ; Dongxing Mao ; Difei Gao ; Mike Zheng Shou CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In contrast, we present a new task called Affordance-centric Question-driven Video Segment Retrieval (AQVSR). 56, TITLE: Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning AUTHORS: Hanbyel Cho ; Yooshin Cho ; Jaemyung Yu ; Junmo Kim CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we propose a simple yet effective model for 3D human pose estimation in video that can quickly adapt to any distortion environment by utilizing MAML, a representative optimization-based meta-learning algorithm. 57, TITLE: NeeDrop: Self-supervised Shape Representation from Sparse Point Clouds Using Needle Dropping AUTHORS: Alexandre Boulch ; Pierre-Alain Langlois ; Gilles Puy ; Renaud Marlet CATEGORY: cs.CV [cs.CV, cs.CG, cs.LG]HIGHLIGHT: In contrast, we introduce {\method}, an self-supervised method for learning shape representations from possibly extremely sparse point clouds. 58, TITLE: Revisiting Temporal Alignment for Video Restoration AUTHORS: Kun Zhou ; Wenbo Li ; Liying Lu ; Xiaoguang Han ; Jiangbo Lu CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this work, we present a novel, generic iterative alignment module which employs a gradual refinement scheme for sub-alignments, yielding more accurate motion compensation. 59, TITLE: Two-stage Temporal Modelling Framework for Video-based Depression Recognition Using Graph Representation AUTHORS: Jiaqi Xu ; Siyang Song ; Keerthy Kusumam ; Hatice Gunes ; Michel Valstar CATEGORY: cs.CV [cs.CV, 68T40, I.2.1]HIGHLIGHT: In this sense, we propose a two-stage framework that models depression severity from multi-scale short-term and video-level facial behaviours. 60, TITLE: Adaptive Gating for Single-Photon 3D Imaging AUTHORS: Ryan Po ; Adithya Pediredla ; Ioannis Gkioulekas CATEGORY: cs.CV [cs.CV]HIGHLIGHT: We propose an adaptive gating scheme built upon Thompson sampling. 61, TITLE: NeRFReN: Neural Radiance Fields with Reflections AUTHORS: Yuan-Chen Guo ; Di Kang ; Linchao Bao ; Yu He ; Song-Hai Zhang CATEGORY: cs.CV [cs.CV, cs.GR]HIGHLIGHT: Specifically, we propose to split a scene into transmitted and reflected components, and model the two components with separate neural radiance fields. 62, TITLE: DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic Segmentation AUTHORS: Lukas Hoyer ; Dengxin Dai ; Luc Van Gool CATEGORY: cs.CV [cs.CV]HIGHLIGHT: As acquiring pixel-wise annotations of real-world images for semantic segmentation is a costly process, a model can instead be trained with more accessible synthetic data and adapted to real images without requiring their annotations. 63, TITLE: EdiBERT, A Generative Model for Image Editing AUTHORS: Thibaut Issenhuth ; Ugo Tanielian ; J�r�mie Mary ; David Picard CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: In this paper, we aim at making a step towards a unified approach for image editing. 64, TITLE: MultiPath++: Efficient Information Fusion and Trajectory Aggregation for Behavior Prediction AUTHORS: BALAKRISHNAN VARADARAJAN et. al. CATEGORY: cs.CV [cs.CV, cs.AI, cs.LG, cs.RO]HIGHLIGHT: In this paper, we present MultiPath++, a future prediction model that achieves state-of-the-art performance on popular benchmarks. 65, TITLE: Multi-modal Text Recognition Networks: Interactive Enhancements Between Visual and Semantic Features AUTHORS: Byeonghu Na ; Yoonsik Kim ; Sungrae Park CATEGORY: cs.CV [cs.CV]HIGHLIGHT: This paper introduces a novel method, called Multi-modAl Text Recognition Network (MATRN), that enables interactions between visual and semantic features for better recognition performances. 66, TITLE: Unsupervised Domain Generalization for Person Re-identification: A Domain-specific Adaptive Framework AUTHORS: Lei Qi ; Lei Wang ; Yinghuan Shi ; Xin Geng CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we turn to investigate unsupervised domain generalization for ReID, by assuming that no label is available for any source domains. 67, TITLE: Regularized Directional Representations for Medical Image Registration AUTHORS: Vincent Jaouen ; Pierre-Henri Conze ; Guillaume Dardenne ; Julien Bert ; Dimitris Visvikis CATEGORY: cs.CV [cs.CV]HIGHLIGHT: Following this research path, we propose a new method for mono- and multimodal image registration based on the alignment of regularized vector fields derived from structural information such as gradient vector flow fields, a technique we call \textit{vector field similarity}. 68, TITLE: ESL: Event-based Structured Light AUTHORS: Manasi Muglikar ; Guillermo Gallego ; Davide Scaramuzza CATEGORY: cs.CV [cs.CV]HIGHLIGHT: We propose a novel structured-light system using an event camera to tackle the problem of accurate and high-speed depth sensing. 69, TITLE: SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches AUTHORS: Yu Zeng ; Zhe Lin ; Vishal M. Patel CATEGORY: cs.CV [cs.CV, cs.MM]HIGHLIGHT: To this end, we investigate a new paradigm of sketch-based image manipulation: mask-free local image manipulation, which only requires sketch inputs from users and utilizes the entire original image. 70, TITLE: RADU: Ray-Aligned Depth Update Convolutions for ToF Data Denoising AUTHORS: Michael Schelling ; Pedro Hermosilla ; Timo Ropinski CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we propose an iterative denoising approach operating in 3D space, that is designed to learn on 2.5D data by enabling 3D point convolutions to correct the points' positions along the view direction. 71, TITLE: Learning Multiple Dense Prediction Tasks from Partially Annotated Data AUTHORS: Wei-Hong Li ; Xialei Liu ; Hakan Bilen CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we present a label efficient approach and look at jointly learning of multiple dense prediction tasks on partially annotated data, which we call multi-task partially-supervised learning. 72, TITLE: Hole-robust Wireframe Detection AUTHORS: Naejin Kong ; Kiwoong Park ; Harshith Goka CATEGORY: cs.CV [cs.CV]HIGHLIGHT: We show qualitatively and quantitatively that our approach significantly outperforms previous works unable to handle holes, as well as improves ordinary detection without holes given. 73, TITLE: Leveraging The Topological Consistencies of Learning in Deep Neural Networks AUTHORS: Stuart Synakowski ; Fabian Benitez-Quiroz ; Aleix M. Martinez CATEGORY: cs.CV [cs.CV, cs.LG]HIGHLIGHT: In this work, we define a new class of topological features that accurately characterize the progress of learning while being quick to compute during running time. 74, TITLE: ARTSeg: Employing Attention for Thermal Images Semantic Segmentation AUTHORS: Farzeen Munir ; Shoaib Azam ; Unse Fatima ; Moongu Jeon CATEGORY: cs.CV [cs.CV, cs.AI]HIGHLIGHT: In this work, we have employed the thermal camera for semantic segmentation. 75, TITLE: Neural Attention for Image Captioning: Review of Outstanding Methods AUTHORS: Zanyar Zohourianshahzadi ; Jugal K. Kalita CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this survey, we provide a review of literature related to attentive deep learning models for image captioning. 76, TITLE: A Face Recognition System's Worst Morph Nightmare, Theoretically AUTHORS: Una M. Kelly ; Raymond Veldhuis ; Luuk Spreeuwers CATEGORY: cs.CV [cs.CV]HIGHLIGHT: We propose a method to create a third, different type of morph, that has the advantage of being easier to train. 77, TITLE: Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection AUTHORS: Deepti Hegde ; Vishal Patel CATEGORY: cs.CV [cs.CV]HIGHLIGHT: We propose a single-frame approach for source-free, unsupervised domain adaptation of lidar-based 3D object detectors that uses class prototypes to mitigate the effect pseudo-label noise. 78, TITLE: CT-block: A Novel Local and Global Features Extractor for Point Cloud AUTHORS: Shangwei Guo ; Jun Li ; Zhengchao Lai ; Xiantong Meng ; Shaokun Han CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we propose a novel module that can simultaneously extract and fuse local and global features, which is named as CT-block. 79, TITLE: Hyperspectral Image Segmentation Based on Graph Processing Over Multilayer Networks AUTHORS: Songyang Zhang ; Qinwen Deng ; Zhi Ding CATEGORY: cs.CV [cs.CV, eess.SP]HIGHLIGHT: Leveraging on the recent-developed graph signal processing over multilayer networks (M-GSP), this work proposes several approaches to HSI segmentation based on M-GSP feature extraction. 80, TITLE: PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images AUTHORS: Stefano Zorzi ; Shabab Bazrafkan ; Stefan Habenschuss ; Friedrich Fraundorfer CATEGORY: cs.CV [cs.CV]HIGHLIGHT: This paper introduces PolyWorld, a neural network that directly extracts building vertices from an image and connects them correctly to create precise polygons. 81, TITLE: Diffusion Autoencoders: Toward A Meaningful and Decodable Representation AUTHORS: Konpat Preechakul ; Nattanat Chatthee ; Suttisak Wizadwongsa ; Supasorn Suwajanakorn CATEGORY: cs.CV [cs.CV]HIGHLIGHT: Our key idea is to use a learnable encoder for discovering the high-level semantics, and a DPM as the decoder for modeling the remaining stochastic variations. 82, TITLE: Probabilistic Estimation of 3D Human Shape and Pose with A Semantic Local Parametric Model AUTHORS: Akash Sengupta ; Ignas Budvytis ; Roberto Cipolla CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In contrast, we present a method that (i) predicts distributions over local body shape in the form of semantic body measurements and (ii) uses a linear mapping to transform a local distribution over body measurements to a global distribution over SMPL shape parameters. 83, TITLE: HRNET: AI on Edge for Mask Detection and Social Distancing AUTHORS: Kinshuk Sengupta ; Praveen Ranjan Srivastava CATEGORY: cs.CV [cs.CV, cs.AI]HIGHLIGHT: The purpose of the paper is to provide innovative emerging technology framework for community to combat epidemic situations. 84, TITLE: FMD-cGAN: Fast Motion Deblurring Using Conditional Generative Adversarial Networks AUTHORS: Jatin Kumar ; Indra Deep Mastan ; Shanmuganathan Raman CATEGORY: cs.CV [cs.CV, eess.IV, I.4.3; I.4.4]HIGHLIGHT: In this paper, we present a Fast Motion Deblurring-Conditional Generative Adversarial Network (FMD-cGAN) that helps in blind motion deblurring of a single image. 85, TITLE: Unsupervised Domain Adaptation: A Reality Check AUTHORS: Kevin Musgrave ; Serge Belongie ; Ser-Nam Lim CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this paper, we show via large-scale experimentation that 1) in the oracle setting, the difference in accuracy between UDA algorithms is smaller than previously thought, 2) state-of-the-art validation methods are not well-correlated with accuracy, and 3) differences between UDA algorithms are dwarfed by the drop in accuracy caused by validation methods. 86, TITLE: Using A GAN to Generate Adversarial Examples to Facial Image Recognition AUTHORS: Andrew Merrigan ; Alan F. Smeaton CATEGORY: cs.CV [cs.CV]HIGHLIGHT: In this

