Socially Pertinent Robots in Gerontological Healthcare

CORDIS bietet Links zu öffentlichen Ergebnissen und Veröffentlichungen von HORIZONT-Projekten.

Links zu Ergebnissen und Veröffentlichungen von RP7-Projekten sowie Links zu einigen Typen spezifischer Ergebnisse wie Datensätzen und Software werden dynamisch von OpenAIRE abgerufen.

Leistungen

Audio-visual speaker separation/tracking with a moving robot

Result of combining T3.1 and T3.3. This deliverable will discuss the software package to perform audio-visual speaker separation and tracking on a moving robotic platform.

Human description in relevant environments

Result of T41 Fully functional framework for human face and body analysis from visual data It includes tools for multitarget body pose estimation and face analysis tested in relevant environments The framework incorporate information from audiobased speaker recognition

Robot non-verbal behaviour system in target environments

Will deliver the final software design and implementation of the robot non-verbal behaviour manager for the target environment. This includes the interface between the non-verbal behaviour manager, the task planner, and the conversational system.

Multi-party conversational system in target environments

Will deliver the final software components for multi-party interaction including the conversational system, NLU, dialogue management, and NLG geared towards dealing with realistic scenarios (T5.3).

Audio-visual speaker separation/tracking with a static robot

Results of T3.2. and T3.3 A software package incorporating video localization information to assist the audio diarisation and separation tasks for the use of the robot’s dialogue system in both static and dynamic scenarios.

Audio speaker diarisation and extraction with a moving robot

Result of T3.3. A software package, implemented for moving robot that accounts for the dynamics of the acoustic scenario, to disarise and extract the desired speaker.

Semantics-based localisation in realistic environments

Research results and implementation of building a geometric and semantic maps for visual localization developed in T21

Neural network architecture specification and design

This deliverable will provide the software basis for the development of the neural architectures used in the project. After an initial phase of collecting requirements for these networks, we will provide some basis architectures from which partners can build their systems.

Software for generating multi-party situated interactions

Result of T62 This deliverable will present the software for generating multiparty situated interactions

Human description in realistic environments

Result of T41 Research results and implementation of multitarget body pose and state estimation based on deep and transfer learning and domain adaptation techniques

Robot non-verbal behaviour system in realistic environments

Will deliver the initial software design and implementation of the robot nonverbal behaviour manager for a realistic environment This includes the interface between the nonverbal behaviour manager the task planner and the conversational system

Initial software architecture for SPRING-REEM

This Deliverable contains the initial SW architecture (modules and applications integration), after the preliminary Software Integration Cycle.

Visual-based localisation in realistic environments

Result of T21 Research results of visual localization of a robot in realistic indoor environments based on viewpoint visual similarity and via geometric matching and careful camera pose verification using camera pose computation in 3D or similar weaker geometrical verification techniques interaction with the lowlevel robot behaviour and robots odometry

Semantics-based localisation in relevant environments

Will deliver software with a Natural Language vocabulary (categories) for object recognition that interfaces with the high-level task scheduler for semantic scene understanding (T2.3) and builds towards interfacing the Interaction Manager to incorporate object affordances (T2.4).

Mature software architecture for SPRING-REEM

This Deliverable contains the mature SW architecture (modules and applications integration), after the Intermediate Software Integration Cycle.

Visual-based localisation in relevant environments

Result of T24 Implementation of visual localization of a robot in realistic indoor environments based on viewpoint visual similarity and via geometric matching and careful camera pose verification using camera pose computation in 3D or similar weaker geometrical verification techniques interaction with the lowlevel robot behaviour and robots odometry

Learning scene representations in realistic environments

Result of T24 It includes a first prototype implementation of tools for modeling scenes eg objects affordances people behaviors in order to support the semantic localization module from 21

Audio-visual speaker tracking in realistic environments

Midway result of T31 AudioVisual Speaker Detection and Tracking This deliverable will present the software to detect and track speakers in realistic environments

Complete software architecture for SPRING-REEM

This Deliverable contains the complete SW architecture (modules and applications integration), after the final Software Integration Cycle.

Audio-visual speaker tracking in relevant environments

Endway result of T31 AudioVisual Speaker Detection and Tracking This deliverable will present the software to detect and track speakers in relevant environments

High-Level task planner in relevant environments

Software package interleaving the robot application (direct interaction with users) and the low level modules in relevant environments.

Audio-visual data simulator

Result of task T22 AudioVisual data simulator A synthetic data simulator will be developed in order able to generate realistic auditory and visual data This simulator will build upon the interactive data collected in T12

Multi-modal behaviour recognition in realistic environments

Result of T42 First prototype implementation of tools for single target behaviour recognition and for grouplevel behaviour analysis integrating information about face and body motions derived from audiovisual data T41 objects in the scene T23 and proxemics features

Multi-modal behaviour recognition in relevant environments

Result of T4.2. Fully functional human behaviour recognition framework including advanced components for individual- and group-level analysis validated in relevant environment.

Learning scene representations in relevant environments

"""Result of T2.3. Implementation of learning a scene representation for facilitation localization of a robot in realistic indoor environments."

Initial high-level task planner and conversational system prototype for realistic environments.

Will deliver software serving as the initial conversational system and highlevel task planner prototype for realistic environments T51

Audio speaker diarisation and extraction with a static robot

Result of T3.2. A software package, implemented for static robot, for audio speaker diarisation and extraction.

Multi-party ASR and conversational system in realistic environments

Result of T53 A software package for multiparty ASR and conversational system in realistic environments using preliminary speaker separation algorithm

Final neural network architectures

Result of T6.1. This deliverable will present the software of the final neural architectures used in SPRING.

End-project workshop report

Result of T8.4. This document will contain the report on the two-days workshop that will take place in Paris at the end of the project. It will include the feedback from scientists, stakeholders and potential end-users after presentation of the scientific and technological achievements related to SPRING.

User feedback from the intermediate validation (realistic/relevant environments)

Result of T14 Report the user feedback obtained on the intermediate experiments on both realistic and relevant environments

User feedback from the final validation (relevant environments)

Result of T1.5. The document will contain SPRING qualitative and quantitative analysis of end-users feedback (usability, acceptance, usefulness) during interactions with the robot in the relevant environments (day care hospital, hospitalization wards).

Initial Advisory Board recommendations

Document reporting the recommendations of the Advisory Board after its first meeting

Intermediate Advisory Board recommendations

Document reporting the recommendations of the Advisory Board after its second meeting

Specifications of the generator of situated interactions

Initial report of T62 Generation of Multiparty Situated Interactions In this report we will provide the specifications for the generator of interactive data

Mid-point report on organised scientific events and future plans

We will summarize the dissemination and communication actions taken in scientific events and describe the plan for the rest of the project duration

User feedback from the preliminary validation (realistic environments)

Result of T13 Report the user feedback from the various experiments ran on realistic environments laboratory

Emotional and robot acceptance analysis in relevant environments

Result of T4.3. This document will contain SPRING robotic platform evaluation results related to the analysis of the affective state of the user(s) interacting with the robot and specifically to the level of acceptance of the robot.

Dissemination and communication strategy

Report the communication plan and plan the various actions to be taken to promote the project as well as its results in the targeted sectors

Report on design and evaluation of the multi-user social conversational and planning system

Will deliver the final report on the design of the delivered multi-party conversational system software and its evaluation in the application area.

Final Advisory Board recommendations for the future platform

Document reporting the recommendations of the Advisory Board after its third and last meeting.

Privacy and Ethics guidelines for experimental validation and data collection

The deliverable will contain SPRING necessary documents for handling Ethics and Privacy issues For instance approval of the local Ethical committees and GDRP compliance report of all experimental validation and data collection in the project

Website and online presence

Result of T81 We will present the various actions taken aiming to ensure the online presence of SPRING basis for the dissemination and communication of the associated results

Veröffentlichungen

Intérêt de la robotique sociale et d’assistance auprès des sujets âgés (Interest in social and assistive robotics for elderly subjects)

Autoren: Maribel Pino, Sébastien Dacunha , Étienne Berger, Anna Goncalves , Anne-Sophie Rigaud
Veröffentlicht in: Actualités Pharmaceutiques, Ausgabe 60, 611, 2021, Seite(n) 36-39, ISSN 0515-3700
Herausgeber: Elsevier BV
DOI: 10.1016/j.actpha.2021.10.010

Globally Optimal Solution to Inverse Kinematics of 7DOF Serial Manipulator

Autoren: Pavel Trutman; Mohab Safey El Din; Didier Henrion; Tomas Pajdla
Veröffentlicht in: IEEE Robotics and Automation Letters, 2022, ISSN 2377-3766
Herausgeber: IEEE
DOI: 10.48550/arxiv.2007.12550

SocialInteractionGAN: Multi-person Interaction Sequence Generation

Autoren: Airale, Louis; Vaufreydaz, Dominique; Alameda-Pineda, Xavier
Veröffentlicht in: Transactions on Automatic Control, 2021, ISSN 1558-2523
Herausgeber: IEEE

Audio source separation by activity probability detection with maximum correlation and simplex geometry

Autoren: Bracha Laufer-Goldshtein; Ronen Talmon; Sharon Gannot
Veröffentlicht in: EURASIP Journal on Audio, Speech, and Music Processing, Ausgabe 2021, 5, 2021, ISSN 1687-4722
Herausgeber: Springer
DOI: 10.1186/s13636-021-00195-7

Variational Meta Reinforcement Learning for Social Robotics

Autoren: Ballou, Anand; Alameda-Pineda, Xavier; Reinke, Chris
Veröffentlicht in: Applied Intelligence, 2023, ISSN 0924-669X
Herausgeber: Kluwer Academic Publishers
DOI: 10.48550/arxiv.2206.03211

Galois/monodromy groups for decomposing minimal problems in 3D reconstruction

Autoren: Duff, Timothy; Korotynskiy, Viktor; Pajdla, Tomas; Regan, Margaret H.
Veröffentlicht in: SIAM Journal on Applied Algebra and Geometry, 2022, ISSN 2470-6566
Herausgeber: Society for Industrial and Applied Mathematics
DOI: 10.1137/21m142287

The hybrid Cramér-Rao lower bound for simultaneous self-localization and room geometry estimation

Autoren: Maya Veisman; Yair Noam; Sharon Gannot
Veröffentlicht in: EURASIP Journal on Advances in Signal Processing, Ausgabe Vol 2021, Iss 1, Pp 1-22 (2021), 2021, ISSN 1687-6180
Herausgeber: Springer
DOI: 10.1186/s13634-020-00702-6

Variational Inference and Learning of Piecewise linear Dynamical Systems.

Autoren: Xavier Alameda-Pineda; Vincent Drouard; Radu Horaud
Veröffentlicht in: IEEE Transactions on Neural Networks and Learning Systems, Ausgabe 2020, 2020, ISSN 2162-2388
Herausgeber: IEEE
DOI: 10.1109/tnnls.2021.3054407

Orthogonal SVD Covariance Conditioning and Latent Disentanglement

Autoren: Yue Song, Nicu Sebe, Wei Wang
Veröffentlicht in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, ISSN 0162-8828
Herausgeber: Institute of Electrical and Electronics Engineers
DOI: 10.1109/tpami.2022.3228979

Unsupervised High-Resolution Portrait Gaze Correction and Animation

Autoren: Jichao Zhang, Jingjing Chen, Hao Tang, Enver Sangineto, Peng Wu, Yan Yan, Nicu Sebe, Wei Wang
Veröffentlicht in: IEEE Transactions on Image Processing, 2022, ISSN 1057-7149
Herausgeber: Institute of Electrical and Electronics Engineers
DOI: 10.1109/tip.2022.3191852

A Robot-Mediated Activity Using the Nao Robot to Promote COVID-19 Precautionary Measures among Older Adults in Geriatric Facilities

Autoren: Lauriane Blavette; Anne-Sophie Rigaud; Salvatore Maria Anzalone; Clément Kergueris; Baptiste Isabet; Sébastien Dacunha; Maribel Pino
Veröffentlicht in: International Journal of Environmental Research and Public Health; Volume 19; Ausgabe 9; Pages: 5222, 2022, ISSN 1661-7827
Herausgeber: Multidisciplinary Digital Publishing Institute (MDPI)
DOI: 10.3390/ijerph19095222

Unleashing the Transferability Power of Unsupervised Pre-Training for Emotion Recognition in Masked and Unmasked Facial Images

Autoren: Moreno Dinca; Cigdem Beyan; Radoslaw Niewiadomski; Simone Barattin; Nicu Sebe
Veröffentlicht in: IEEE Access, Vol 11, Pp 90876-90890 (2023), 2023, ISSN 2169-3536
Herausgeber: Institute of Electrical and Electronics Engineers Inc.
DOI: 10.1109/access.2023.3308047

Learning and controlling the source-filter representation of speech with a variational autoencoder

Autoren: Samir Sadok; Simon Leglaive; Laurent Girin; Xavier Alameda-Pineda; Renaud Séguier
Veröffentlicht in: Speech Communication, Ausgabe 2, 2023, ISSN 0167-6393
Herausgeber: Elsevier BV
DOI: 10.48550/arxiv.2204.07075

Semi-Supervised Source Localization in Reverberant Environments With Deep Generative Modeling

Autoren: Michael J. Bianco; Sharon Gannot; Efren Fernandez-Grande; Peter Gerstoft
Veröffentlicht in: IEEE Access, Ausgabe Vol 9, Pp 84956-84970 (2021), 2021, ISSN 2169-3536
Herausgeber: Institute of Electrical and Electronics Engineers Inc.
DOI: 10.1109/access.2021.3087697

Multi-frame Motion Segmentation by Combining Two-Frame Results

Autoren: Federica Arrigoni; Elisa Ricci; Tomas Pajdla
Veröffentlicht in: IEEE Transactions on Image Processing, 2022, ISSN 0920-5691
Herausgeber: Kluwer Academic Publishers
DOI: 10.1007/s11263-021-01544-x

Multi-frame Motion Segmentation by Combining Two-Frame Results

Autoren: Federica Arrigoni, Elisa Ricci & Tomas Pajdla
Veröffentlicht in: International Journal of Computer Vision, 2023, ISSN 0920-5691
Herausgeber: Kluwer Academic Publishers

Forward-backward recursive expectation-maximization for concurrent speaker tracking

Autoren: Dorfan, Y., Schwartz, B. & Gannot, S
Veröffentlicht in: EURASIP Journal on Audio, Speech, and Music Processing, Ausgabe 2021, 2 (2021), 2021, ISSN 1687-4722
Herausgeber: Springer
DOI: 10.1186/s13636-020-00189-x

Research directions at the Interaction Lab

Autoren: Oliver Lemon
Veröffentlicht in: AI Communications, vol. 35, no. 4, pp. 295-308, 2022, 2022, Seite(n) Conversational AI for multi-agent communication in Natural Language, ISSN 0921-7126
Herausgeber: IOS Press
DOI: 10.3233/aic-220147

TransCenter: Transformers With Dense Representations for Multiple-Object Tracking

Autoren: Yihong Xu; Yutong Ban; Guillaume Delorme; Chuang Gan; Daniela Rus; Xavier Alameda-Pineda
Veröffentlicht in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, ISSN 0162-8828
Herausgeber: Institute of Electrical and Electronics Engineers
DOI: 10.1109/tpami.2022.3225078

SocialInteractionGAN: Multi-person Interaction Sequence Generation

Autoren: Louis Airale; Dominique Vaufreydaz; Xavier Alameda-Pineda
Veröffentlicht in: IEEE Transactions on Affective Computing, 2021, ISSN 1949-3045
Herausgeber: Institute of Electrical and Electronics Engineers
DOI: 10.48550/arxiv.2103.05916

Successor Feature Representations

Autoren: Reinke, Chris; Alameda-Pineda, Xavier
Veröffentlicht in: Transactions on Machine Learning Research, 2023, ISSN 2835-8856
Herausgeber: OpenReview
DOI: 10.48550/arxiv.2110.15701

Dynamical Variational Autoencoders: A Comprehensive Review

Autoren: Laurent Girin, Simon Leglaive , Xiaoyu Bie , Julien Diard , Thomas Hueber , Xavier Alameda-Pineda
Veröffentlicht in: Foundations and Trends in Machine Learning, Ausgabe 15, 1-2, 2021, ISSN 1935-8237
Herausgeber: Now Publishers Inc.
DOI: 10.1561/2200000089

Dynamically localizing multiple speakers based on the time-frequency domain

Autoren: Hodaya Hammer; Shlomo E. Chazan; Jacob Goldberger; Sharon Gannot
Veröffentlicht in: EURASIP Journal on Audio, Speech, and Music Processing, Ausgabe Vol 2021, Iss 1, Pp 1-10 (2021), 2021, ISSN 1687-4722
Herausgeber: Springer
DOI: 10.1186/s13636-021-00203-w

Uncertainty-aware Contrastive Distillation for Incremental Semantic Segmentation

Autoren: Guanglei Yang; Enrico Fini; Dan Xu; Paolo Rota; Mingli Ding; Moin Nabi; Xavier Alameda-Pineda; Elisa Ricci
Veröffentlicht in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, ISSN 0162-8828
Herausgeber: Institute of Electrical and Electronics Engineers
DOI: 10.48550/arxiv.2203.14098

Semi-Supervised Multiple Source Localization Using Relative Harmonic Coefficients Under Noisy and Reverberant Environments

Autoren: Y. Hu, P. N. Samarasinghe, S. Gannot and T. D. Abhayapala
Veröffentlicht in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, Ausgabe vol. 28, 2020, Seite(n) pp. 3108-3123, ISSN 2329-9304
Herausgeber: IEEE
DOI: 10.1109/taslp.2020.3037521

Variational Structured Attention Networks for Deep Visual Representation Learning

Autoren: Guanglei Yang; Paolo Rota; Xavier Alameda-Pineda; Dan Xu; Mingli Ding; Elisa Ricci
Veröffentlicht in: IEEE Transactions on Image Processing, 2022, ISSN 1057-7149
Herausgeber: Institute of Electrical and Electronics Engineers
DOI: 10.1109/tip.2021.3137647

Continual Attentive Fusion for Incremental Learning in Semantic Segmentation

Autoren: Guanglei Yang; Enrico Fini; Dan Xu; Paolo Rota; Mingli Ding; Tang Hao; Xavier Alameda-Pineda; Elisa Ricci
Veröffentlicht in: IEEE Transactions on Multimedia, 2022, ISSN 1520-9210
Herausgeber: Institute of Electrical and Electronics Engineers
DOI: 10.48550/arxiv.2202.00432

Training-Based Multiple Source Tracking Using Manifold-Learning and Recursive Expectation-Maximization

Autoren: Avital Bross; Sharon Gannot
Veröffentlicht in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, ISSN 2329-9290
Herausgeber: IEEE Advancing Technology for Humanity
DOI: 10.1109/taslp.2023.3245414

Expression-preserving face frontalization improves visually assisted speech processing

Autoren: Zhiqi Kang; Mostafa Sadeghi; Radu Horaud; Xavier Alameda-Pineda
Veröffentlicht in: International Journal of Computer Vision, 2023, ISSN 0920-5691
Herausgeber: Kluwer Academic Publishers
DOI: 10.48550/arxiv.2204.02810

Simultaneous Tracking and Separation of Multiple Sources Using Factor Graph Model

Autoren: Koby Weisberg; Bracha Laufer-Goldshtein; Sharon Gannot
Veröffentlicht in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, Ausgabe 28, 2020, Seite(n) 2848-2864, ISSN 2329-9290
Herausgeber: IEEE Advancing Technology for Humanity
DOI: 10.1109/taslp.2020.3028650

Estimation of acoustic echoes using expectation-maximization methods

Autoren: Saqib, U., Gannot, S. & Jensen, J.
Veröffentlicht in: EURASIP Journal on Audio, Speech, and Music Processing, Ausgabe 2020, 12 (2020), 2020, ISSN 1687-4722
Herausgeber: Springer
DOI: 10.1186/s13636-020-00179-z

Unsupervised Speech Enhancement using Dynamical Variational Autoencoders

Autoren: Xiaoyu Bie; Simon Leglaive; Xavier Alameda-Pineda; Laurent Girin
Veröffentlicht in: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022, ISSN 2329-9290
Herausgeber: IEEE Advancing Technology for Humanity
DOI: 10.1109/taslp.2022.3207349

A recursive expectation-maximization algorithm for speaker tracking and separation

Autoren: Schwartz, O., Gannot, S.
Veröffentlicht in: EURASIP Journal on Audio, Speech, and Music Processing, Ausgabe 2021, 43, 2021, ISSN 1687-4722
Herausgeber: Springer
DOI: 10.1186/s13636-021-00228-1

An online algorithm for echo cancellation, dereverberation and noise reduction based on a Kalman-EM Method

Autoren: Nili Cohen; Gershon Hazan; Boaz Schwartz; Sharon Gannot
Veröffentlicht in: EURASIP Journal on Audio, Speech, and Music Processing, Ausgabe 2021, 33, 2021, ISSN 1687-4722
Herausgeber: Springer
DOI: 10.1186/s13636-021-00219-2

Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement

Autoren: M. Sadeghi and X. Alameda-Pineda
Veröffentlicht in: IEEE Transactions on Signal Processing, Ausgabe vol. 69, 2021, Seite(n) pp. 1899-1909, ISSN 1941-0476
Herausgeber: IEEE
DOI: 10.1109/tsp.2021.3066038

Viewing Graph Solvability via Cycle Consistency

Autoren: Federica Arrigoni, Andrea Fusiello, Elisa Ricci, Tomas Pajdla
Veröffentlicht in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Ausgabe 2021, 2021, Seite(n) 5540-5549
Herausgeber: IEEE

A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning

Autoren: Sadok, Samir; Leglaive, Simon; Girin, Laurent; Alameda-Pineda, Xavier; Séguier, Renaud
Veröffentlicht in: M4MM '22: Proceedings of the 1st International Workshop on Methodologies for Multimedia, 2022
Herausgeber: ACM
DOI: 10.48550/arxiv.2305.03582

Array Configuration Mismatch in Deep DOA Estimation: Towards Robust Training

Autoren: Ayal Schwartz; Elior Hadad; Sharon Gannot; Shlomo E. Chazan
Veröffentlicht in: 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
Herausgeber: IEEE
DOI: 10.1109/waspaa58266.2023.10248048

Vocabulary-free Image Classification

Autoren: Conti, Alessandro; Fini, Enrico; Mancini, Massimiliano; Rota, Paolo; Wang, Yiming; Ricci, Elisa
Veröffentlicht in: NeurIPS2023, Ausgabe 1, 2024
Herausgeber: NeurIPS
DOI: 10.48550/arxiv.2306.00917

Uncertainty-Guided Source-Free Domain Adaptation

Autoren: Subhankar Roy, Martin Trapp, Andrea Pilzer, Juho Kannala, Nicu Sebe, Elisa Ricci, Arno Solin
Veröffentlicht in: ECCV 2022, 2022
Herausgeber: ECVA
DOI: 10.48550/arxiv.2208.07591

Multi-Person Extreme Motion Prediction

Autoren: Wen Guo; Xiaoyu Bie; Xavier Alameda-Pineda; Francesc Moreno-Noguer
Veröffentlicht in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, Ausgabe 2, 2022
Herausgeber: IEEE
DOI: 10.48550/arxiv.2105.08825

Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer

Autoren: Haoyu Chen, Hao Tang, Henglin Shi, Wei Peng, Nicu Sebe, Guoying Zhao
Veröffentlicht in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Ausgabe 2021, 2021, Seite(n) 8630-8639
Herausgeber: IEEE

Developing a Social Conversational Robot for the Hospital waiting room

Autoren: Nancie Gunson, Daniel Hernandez Garcia, Weronika Sieinska, Christian Dondrup, Oliver Lemon
Veröffentlicht in: RO-MAN 2022, 2022
Herausgeber: IEEE
DOI: 10.1109/ro-man53752.2022.9900827

Novel Class Discovery in Semantic Segmentation

Autoren: Yuyang Zhao, Zhun Zhong, Nicu Sebe, Gim Hee Lee
Veröffentlicht in: 2022
Herausgeber: CVF
DOI: 10.48550/arxiv.2112.01900

Neighborhood Contrastive Learning for Novel Class Discovery

Autoren: Zhong, Zhun; Fini, Enrico; Subhankar Roy; Zhiming Luo; Ricci, Elisa; Sebe, Nicu
Veröffentlicht in: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Ausgabe 2021, 2021, Seite(n) 10867-10875
Herausgeber: IEEE
DOI: 10.5281/zenodo.5014108

Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction

Autoren: Guanglei Yang, Hao Tang, Mingli Ding, Nicu Sebe, Elisa Ricci
Veröffentlicht in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Ausgabe 2021, 2021, Seite(n) 16269-16279
Herausgeber: IEEE

A Unified Objective for Novel Class Discovery

Autoren: Enrico Fini, Enver Sangineto, Stéphane Lathuilière, Zhun Zhong, Moin Nabi, Elisa Ricci
Veröffentlicht in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Ausgabe 2021, 2021, Seite(n) 9284-9292
Herausgeber: IEEE

Combining Visual and Social Dialogue for Human-Robot Interaction

Autoren: Nancie Gunson; Daniel Hernandez Garcia; Jose L. Part; Yanchao Yu; Weronika Sieińska; Christian Dondrup; Oliver Lemon
Veröffentlicht in: ICMI '21: Proceedings of the 2021 International Conference on Multimodal Interaction, Ausgabe 2021, 2021, Seite(n) 841-842
Herausgeber: ICMI
DOI: 10.1145/3462244.3481303

Cluster-level pseudo-labelling for source-free cross-domain facial expression recognition

Autoren: Alessandro Conti; Paolo Rota; Yiming Wang; Elisa Ricci
Veröffentlicht in: BMCV 2022, 2022
Herausgeber: The British Machine Vision Association and Society for Pattern Recognition
DOI: 10.5281/zenodo.7296310

RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection

Autoren: Yue Song, Nicu Sebe, Wei Wang
Veröffentlicht in: NeurIPS 2022, 2022
Herausgeber: OpenReview
DOI: 10.48550/arxiv.2209.08590

Multiple Speaker Localization using Mixture of Gaussian Model with Manifold-based Centroids

Autoren: A. Bross, B. Laufer-Goldshtein and S. Gannot
Veröffentlicht in: 2020 28th European Signal Processing Conference (EUSIPCO), Ausgabe 2020, 2021, Seite(n) 895-899
Herausgeber: IEEE
DOI: 10.23919/eusipco47968.2020.9287796

OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in an Open World

Autoren: Zhun Zhong, Linchao Zhu, Zhiming Luo, Shaozi Li, Yi Yang, Nicu Sebe
Veröffentlicht in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Herausgeber: IEEE

Understanding and Answering Incomplete Questions

Autoren: Angus Addlesee; Marco Damonte
Veröffentlicht in: CUI '23: Proceedings of the 5th International Conference on Conversational User Interfaces, 2023
Herausgeber: ACM
DOI: 10.1145/3571884.3597133

Misalignment Recognition in Acoustic Sensor Networks Using a Semi-Supervised Source Estimation Method and Markov Random Fields

Autoren: Miller, Gabriel F; Brendel, Andreas; Kellermann, Walter; Gannot, Sharon
Veröffentlicht in: ICASSP, Ausgabe 2021, 2021
Herausgeber: ICASSP
DOI: 10.1109/icassp39728.2021.9413765

Self-Supervised Models are Continual Learners

Autoren: Fini, Enrico; da Costa, Victor G. Turrisi; Alameda-Pineda, Xavier; Ricci, Elisa; Alahari, Karteek; Mairal, Julien
Veröffentlicht in: CVPR 2022 - IEEE/CVF Conference on Computer Vision and Pattern Recognition, Ausgabe 6, 2022
Herausgeber: CVF
DOI: 10.48550/arxiv.2112.04215

A Bayesian Hierarchical Mixture of Gaussian Model for Multi-Speaker DOA Estimation and Separation

Autoren: Y. Laufer and S. Gannot
Veröffentlicht in: 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing (MLSP), Ausgabe 2020, 2020, Seite(n) 1-6
Herausgeber: IEEE
DOI: 10.1109/mlsp49062.2020.9231852

Curriculum Graph Co-Teaching for Multi-Target Domain Adaptation

Autoren: Subhankar Roy, Evgeny Krivosheev, Zhun Zhong, Nicu Sebe, Elisa Ricci
Veröffentlicht in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Ausgabe 2021, 2021, Seite(n) 5351-5360
Herausgeber: IEEE

Probabilistic Fusion of Persons' Body Features: The Mr. Potato Algorithm

Autoren: Séverin Lemaignan, Lorenzo Ferrini
Veröffentlicht in: HRI '24: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024
Herausgeber: ACM
DOI: 10.1145/3610977.3637479

OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in an Open World

Autoren: Zhun Zhong; Linchao Zhu; Zhiming Luo; Shaozi Li; Yi Yang; Nicu Sebe
Veröffentlicht in: CVPR 2021, 2021
Herausgeber: CVF
DOI: 10.1109/cvpr46437.2021.00934

ROS4HRI: Standardising an Interface for Human-Robot Interaction

Autoren: Raquel Ros, Séverin Lemaignan, Lorenzo Ferrini, Antonio Andriella, Aina Irisarri
Veröffentlicht in: HRI 2023 Workshop, 2023
Herausgeber: IEEE

From two rolling shutters to one global shutter

Autoren: Albl, Cenek; Kukelova, Zuzana; Larsson, Viktor; Pajdla, Tomas; Schindler, Konrad
Veröffentlicht in: 2020
Herausgeber: CVF
DOI: 10.48550/arxiv.2006.01964

The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation

Autoren: Zara, Giacomo; Conti, Alessandro; Roy, Subhankar; Lathuilière, Stéphane; Rota, Paolo; Ricci, Elisa
Veröffentlicht in: 2023 IEEE/CVF International Conference on Computer Vision (ICCV), 2023
Herausgeber: IEEE
DOI: 10.1109/iccv51070.2023.00946

FurChat: An Embodied Conversational Agent using LLMs, Combining Open and Closed-Domain Dialogue with Facial Expressions

Autoren: Cherakara, Neeraj; Varghese, Finny; Shabana, Sheena; Nelson, Nivan; Karukayil, Abhiram; Kulothungan, Rohith; Farhan, Mohammed Afil; Nesset, Birthe; Moujahid, Meriam; Dinkar, Tanvi; Rieser, Verena; Lemon, Oliver
Veröffentlicht in: Proceedings of the 24th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2023
Herausgeber: Association for Computational Linguistics
DOI: 10.18653/v1/2023.sigdial-1.55

Viewing Graph Solvability via Cycle Consistency

Autoren: Federica Arrigoni, Andrea Fusiello, Elisa Ricci, Tomas Pajdla
Veröffentlicht in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2022
Herausgeber: IEEE
DOI: 10.1109/iccv48922.2021.00549

Open-source Natural Language Processing on the PAL Robotics ARI Social Robot

Autoren: Séverin Lemaignan, Sara Cooper, Raquel Ros, Lorenzo Ferrini, Antonio Andriella, Aina Irisarri
Veröffentlicht in: HRI '23: Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, 2023
Herausgeber: ACM

3D-Aware Semantic-Guided Generative Model for Human Synthesis

Autoren: Jichao Zhang, Enver Sangineto, Hao Tang, Aliaksandr Siarohin, Zhun Zhong, Nicu Sebe, Wei Wang
Veröffentlicht in: ECCV 2022, 2022
Herausgeber: ECVA
DOI: 10.48550/arxiv.2112.01422

Unifying Bottom-Up and Top-Down Attention Models for Social Robots

Autoren: S. Lemaignan; S. Cooper; R. Ros; L. Ferrini; A. Andriella; A. Irisarri
Veröffentlicht in: HRI 2023, 2023
Herausgeber: ACM
DOI: 10.1145/3568294.3580041

Semi-Supervised Source Localization with Deep Generative Modeling

Autoren: Michael J. Bianco; Sharon Gannot; Peter Gerstoft
Veröffentlicht in: 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing (MLSP), 2021
Herausgeber: IEEE
DOI: 10.1109/mlsp49062.2020.9231825

Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt Engineering

Autoren: Addlesee, Angus; Sieińska, Weronika; Gunson, Nancie; Garcia, Daniel Hernández; Dondrup, Christian; Lemon, Oliver
Veröffentlicht in: Proceedings of the 24th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2023
Herausgeber: ACL
DOI: 10.18653/v1/2023.sigdial-1.22

Dataset and Evaluation of Automatic Speech Recognition for Multi-lingual Intent Recognition on Social Robots

Autoren: Antonio Andriella, Raquel Ros, Yoav Ellinson, Sharon Gannot, Séverin Lemaignan
Veröffentlicht in: HRI '24: Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024
Herausgeber: ACM
DOI: 10.1145/3610977.3637473

Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer

Autoren: Haoyu Chen; Hao Tang; Henglin Shi; Wei Peng; Nicu Sebe; Guoying Zhao
Veröffentlicht in: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021
Herausgeber: CVF
DOI: 10.48550/arxiv.2108.07520

Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification

Autoren: Yuyang Zhao, Zhun Zhong, Fengxiang Yang, Zhiming Luo, Yaojin Lin, Shaozi Li, Nicu Sebe
Veröffentlicht in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Ausgabe 2021, 2021, Seite(n) 6277-6286
Herausgeber: IEEE

Semi-supervised learning made simple with self-supervised clustering

Autoren: Fini, Enrico; Astolfi, Pietro; Alahari, Karteek; Alameda-Pineda, Xavier; Mairal, Julien; Nabi, Moin; Ricci, Elisa
Veröffentlicht in: 2023
Herausgeber: CVF
DOI: 10.48550/arxiv.2306.07483

Minimal Rolling Shutter Absolute Pose with Unknown Focal Length and Radial Distortion

Autoren: Kukelova, Zuzana; Albl, Cenek; Sugimoto, Akihiro; Schindler, Konrad; Pajdla, Tomas
Veröffentlicht in: European Conference on Computer Vision, Ausgabe 2020, 2020, Seite(n) 698-714
Herausgeber: Springer
DOI: 10.5281/zenodo.4335228

Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?

Autoren: Yue Song, Nicu Sebe, Wei Wang
Veröffentlicht in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Ausgabe 2021, 2021, Seite(n) 1115-1123
Herausgeber: IEEE

Deep Ranking-Based DOA Tracking Algorithm

Autoren: R. Opochinsky, G. Chechik and S. Gannot
Veröffentlicht in: 2021 29th European Signal Processing Conference (EUSIPCO), Ausgabe 2021, 2021, Seite(n) 1020-1024
Herausgeber: IEEE
DOI: 10.23919/eusipco54536.2021.9616297

Optimizing Elimination Templates by Greedy Parameter Search

Autoren: Evgeniy Martyushev, Jana Vrablikova, Tomas Pajdla
Veröffentlicht in: 2022
Herausgeber: CVF
DOI: 10.48550/arxiv.2203.14901

Uncertainty Based Camera Model Selection

Autoren: Michal Polic, Stanislav Steidl, Cenek Albl, Zuzana Kukelova, Tomas Pajdla
Veröffentlicht in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Ausgabe 2020, 2020, Seite(n) 5991-6000
Herausgeber: IEEE

A Bayesian Hierarchical Model for Blind Audio Source Separation

Autoren: Y. Laufer and S. Gannot
Veröffentlicht in: 2020 28th European Signal Processing Conference (EUSIPCO), Ausgabe 2020, 2021, Seite(n) 276-280
Herausgeber: IEEE
DOI: 10.23919/eusipco47968.2020.9287348

Speech enhancement with mixture-of-deep-experts with clean clustering pre-training

Autoren: Chazan, Shlomo E.; Goldberger, Jacob; Gannot, Sharon
Veröffentlicht in: ICASSP, Ausgabe 2021, 2021
Herausgeber: ICASSP

Single microphone speaker extraction using unified time-frequency Siamese-Unet

Autoren: Aviad Eisenberg; Sharon Gannot; Shlomo E. Chazan
Veröffentlicht in: 2022 30th European Signal Processing Conference (EUSIPCO), 2022
Herausgeber: IEEE
DOI: 10.23919/eusipco55093.2022.9909545

Novel Class Discovery in Semantic Segmentation

Autoren: Yuyang Zhao; Zhun Zhong; Nicu Sebe; Gim Hee Lee
Veröffentlicht in: CVPR 2022, 2022
Herausgeber: Computer Vision Foundation
DOI: 10.5281/zenodo.7100314

Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss

Autoren: Riccardo Franceschini, Enrico Fini, Cigdem Beyan, Alessandro Conti, Federica Arrigoni, Elisa Ricci
Veröffentlicht in: ICPR 2022, 2022
Herausgeber: IEEE
DOI: 10.1109/icpr56361.2022.9956589

The impact of removing head movements on audio-visual speech enhancement

Autoren: Zhiqi Kang, Mostafa Sadeghi, Radu Horaud, Xavier Alameda-Pineda, Jacob Donley, Anurag Kumar
Veröffentlicht in: ICASSP 2022, 2022
Herausgeber: IEEE
DOI: 10.1109/icassp43922.2022.9746401

ARI: the Social Assistive Robot and Companion

Autoren: Sara Cooper; Alessandro Di Fava; Carlos Vivas; Luca Marchionni; Francesco Ferro
Veröffentlicht in: 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), 2020
Herausgeber: IEEE
DOI: 10.1109/ro-man47096.2020.9223470

Blind Audio Source Separation Using Two Expectation-Maximization Algorithms

Autoren: A. Eisenberg, B. Schwartz and S. Gannot
Veröffentlicht in: 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing (MLSP), Ausgabe 2020, 2020, Seite(n) 1-6
Herausgeber: IEEE
DOI: 10.1109/mlsp49062.2020.9231931

Evaluation and Design Guidelines for Combining Open-Domain Social Conversation with Task-Based Dialogue in Intelligent Buildings

Autoren: Nancie Gunson; Weronika Sieinska; Christopher Walsh; Christian Dondrup; Oliver Lemon
Veröffentlicht in: IVA '20: Proceedings of the 20th ACM International Conference on Intelligent Virtual Agents, Ausgabe 2020, 2020
Herausgeber: Association for Computing Machinery
DOI: 10.1145/3383652.3423889

Online Blind Audio Source Separation using Recursive Expectation-Maximization

Autoren: Aviad Eisenberg, Boaz Schwartz and Sharon Gannot
Veröffentlicht in: INTERSPEECH 2021, 2021
Herausgeber: ISCA
DOI: 10.21437/interspeech.2021-662

Using monodromy to recover symmetries of polynomial systems

Autoren: Timothy Duff; Viktor Korotynskiy; Tomas Pajdla; Margaret Regan
Veröffentlicht in: ISSAC '23: Proceedings of the 2023 International Symposium on Symbolic and Algebraic Computation, 2023
Herausgeber: ACL
DOI: 10.1145/3597066.3597106

A Visually-Aware Conversational Robot Receptionist

Autoren: Nancie Gunson, Daniel Hernandez Garcia, Weronika Sieińska, Angus Addlesee, Christian Dondrup, Oliver Lemon, Jose L. Part, Yanchao Yu
Veröffentlicht in: SIGdial 2022, 2022
Herausgeber: ACL

Scene-Agnostic Multi-Microphone Speech Dereverberation

Autoren: Yochai Yemini, Ethan Fetaya, Haggai Maron, Sharon Gannot
Veröffentlicht in: INTERSPEECH 2021, Ausgabe 2021, 2021
Herausgeber: ISCA
DOI: 10.21437/interspeech.2021-889

Class-incremental Novel Class Discovery

Autoren: Subhankar Roy, Mingxuan Liu, Zhun Zhong, Nicu Sebe, Elisa Ricci
Veröffentlicht in: ECCV 2022, 2022
Herausgeber: ECVA
DOI: 10.48550/arxiv.2207.08605

Explainable Representations of the Social State: A Model for Social Human-Robot Interactions

Autoren: García, Daniel Hernández; Yu, Yanchao; Sieińska, Weronika; Part, Jose L.; Gunson, Nancie; Lemon, Oliver; Dondrup, Christian
Veröffentlicht in: AAAI FSS-20 AI-HRI, 2020
Herausgeber: Association for the Advancement of Artificial Intelligence

Multimodal Across Domains Gaze Target Detection

Autoren: Francesco Tonini; Cigdem Beyan; Elisa Ricci
Veröffentlicht in: ICMI '22: Proceedings of the 2022 International Conference on Multimodal Interaction, 2022
Herausgeber: ACM
DOI: 10.48550/arxiv.2208.10822

Unsupervised Domain Adaptation for Video Transformers in Action Recognition

Autoren: Victor G. Turrisi da Costa, Giacomo Zara, Paolo Rota, Thiago Oliveira-Santos, Nicu Sebe, Vittorio Murino, Elisa Ricci
Veröffentlicht in: ICPR 2022, 2022
Herausgeber: IEEE
DOI: 10.48550/arxiv.2207.12842

On The Importance of Acoustic Reflections in Beamforming

Autoren: Oren Shmaryahu; Sharon Gannot
Veröffentlicht in: IWAENC 2022, 2022
Herausgeber: IEE
DOI: 10.1109/iwaenc53105.2022.9914749

Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification

Autoren: Yuyang Zhao; Zhun Zhong; Fengxiang Yang; Zhiming Luo; Yaojin Lin; Shaozi Li; Nicu Sebe
Veröffentlicht in: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 2021
Herausgeber: CVF
DOI: 10.5281/zenodo.5014450

It's Good to Chat?

Autoren: Gunson, Nancie; Sieińska, Weronika; Walsh, Christopher; Dondrup, Christian; Lemon, Oliver
Veröffentlicht in: isbn: 9781450375863, Ausgabe 1, 2020
Herausgeber: Association for Computing Machinery

SimpleMTOD: A Simple Language Model for Multimodal Task-Oriented Dialogue with Symbolic Scene Representation

Autoren: Hemanthage, Bhathiya; Dondrup, Christian; Bartie, Phil; Lemon, Oliver
Veröffentlicht in: Proceedings of the 15th International Conference on Computational Semantics, 2023
Herausgeber: ACL
DOI: 10.48550/arxiv.2307.04907

Synchronization of Group-Labelled Multi-Graphs

Autoren: Andrea Porfiri Dal Cin, Luca Magri, Federica Arrigoni, Andrea Fusiello, Giacomo Boracchi
Veröffentlicht in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Ausgabe 2021, 2021, Seite(n) 6453-6463
Herausgeber: IEEE

The Impact of Removing Head Movements on Audio-visual Speech Enhancement

Autoren: Zhiqi Kang; Mostafa Sadeghi; Radu Horaud; Xavier Alameda-Pineda; Jacob Donley; Anurag Kumar
Veröffentlicht in: 2022
Herausgeber: ICASSP
DOI: 10.48550/arxiv.2202.00538

Making Affine Correspondences Work in Camera Geometry Computation

Autoren: Barath, Daniel; Polic, Michal; Förstner, Wolfgang; Sattler, Torsten; Pajdla, Tomas; Kukelova, Zuzana
Veröffentlicht in: European Conference on Computer Vision, Ausgabe 2020, 2020, Seite(n) 723-740
Herausgeber: Springer
DOI: 10.1007/978-3-030-58621-8_42

Fast Differentiable Matrix Square Root

Autoren: Yue Song, Nicu Sebe, Wei Wang
Veröffentlicht in: ICLR 2022, 2023
Herausgeber: OpenReview
DOI: 10.48550/arxiv.2201.08663

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Autoren: Jiahui Huang, He Wang, Tolga Birdal, Minhyuk Sung, Federica Arrigoni, Shi-Min Hu, Leonidas J. Guibas
Veröffentlicht in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Ausgabe 2021, 2021, Seite(n) 7108-7118
Herausgeber: IEEE

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

Autoren: Bie, Xiaoyu; Girin, Laurent; Leglaive, Simon; Hueber, Thomas; Alameda-Pineda, Xavier
Veröffentlicht in: Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.1-5, Ausgabe 2021, 2021, Seite(n) 46-50
Herausgeber: International Speech Communication Association (ISCA)
DOI: 10.21437/interspeech.2021-256

Motion Segmentation with Pairwise Matches and Unknown Number of Motions

Autoren: Federica Arrigoni, Luca Magri, and Tomas Pajdla
Veröffentlicht in: 2020 25th International Conference on Pattern Recognition (ICPR), Ausgabe 2020, 2020
Herausgeber: IEEE
DOI: 10.1109/icpr48806.2021.9413142

On the Usage of the Trifocal Tensor in Motion Segmentation

Autoren: Federica Arrigoni, Luca Magri, Tomas Pajdla
Veröffentlicht in: ECCV’20 Online, 2020
Herausgeber: European Computer Vision Association (ECVA)

Learning Visual Voice Activity Detection with an Automatically Annotated Dataset

Autoren: Guy, Sylvain; Lathuilière, Stéphane; Mesejo, Pablo; Horaud, Radu
Veröffentlicht in: ICPR 2020 - 25th International Conference on Pattern Recognition, Jan 2021, Milano / Virtual, Italy, Ausgabe 2020, 2021, Seite(n) 4851-4856
Herausgeber: IEEE
DOI: 10.1109/icpr48806.2021.9412884

Conversational Agents for Intelligent Buildings

Autoren: Weronika Sieinska, Nancie Gunson, Christopher Walsh, Christian Dondrup, and Oliver Lemon
Veröffentlicht in: Proceedings of the SIGdial 2020 Conference, Ausgabe 2020, 2020, Seite(n) 45-48
Herausgeber: Association for Computational Linguistics

Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?

Autoren: Yue Song; Nicu Sebe; Wei Wang
Veröffentlicht in: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021
Herausgeber: CVF
DOI: 10.48550/arxiv.2105.02498

Speech Modeling with a Hierarchical Transformer Dynamical VAE

Autoren: Xiaoyu Lin; Xiaoyu Bie; Simon Leglaive; Laurent Girin; Xavier Alameda-Pineda
Veröffentlicht in: ICASSP 2023 - IEEE International Conference on Audio, Speech and Signal Processing, 2023
Herausgeber: IEEE
DOI: 10.48550/arxiv.2303.09404

ODANet: Online Deep Appearance Network for Identity-Consistent Multi-person Tracking

Autoren: Guillaume Delorme, Yutong Ban, Guillaume Sarrazin, Xavier Alameda-Pineda
Veröffentlicht in: ICPR 2021: Pattern Recognition. ICPR International Workshops and Challenges, 2021
Herausgeber: Springer
DOI: 10.1007/978-3-030-68780-9_60

From two rolling shutters to one global shutter

Autoren: Albl, Cenek; Kukelova, Zuzana; Larsson, Viktor; Pajdla, Tomas; Schindler, Konrad
Veröffentlicht in: CVPR, Ausgabe 2020, 2020
Herausgeber: IEEE

Robust Relative Transfer Function Identification on Manifolds for Speech Enhancement

Autoren: A. Sofer, T. Kounovský, J. Čmejla, Z. Koldovský and S. Gannot
Veröffentlicht in: 2021 29th European Signal Processing Conference (EUSIPCO), Ausgabe 2021, 2021, Seite(n) 401-405
Herausgeber: IEEE
DOI: 10.23919/eusipco54536.2021.9616175

Curriculum Graph Co-Teaching for Multi-Target Domain Adaptation

Autoren: Subhankar Roy; Evgeny Krivosheev; Zhun Zhong; Nicu Sebe; Elisa Ricci
Veröffentlicht in: CVPR 2021, 2021, ISBN 978-1-6654-4509-2
Herausgeber: Computer Vision Foundation
DOI: 10.5281/zenodo.5014029

A Two-Stage Speaker Extraction Algorithm Under Adverse Acoustic Conditions Using a Single-Microphone

Autoren: Eisenberg, Aviad; Gannot, Sharon; Chazan, Shlomo E.
Veröffentlicht in: 2023 31st European Signal Processing Conference (EUSIPCO), 2023
Herausgeber: IEEE
DOI: 10.23919/eusipco58844.2023.10289764

Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction

Autoren: Guanglei Yang; Hao Tang; Mingli Ding; Nicu Sebe; Elisa Ricci
Veröffentlicht in: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Ausgabe 6, 2021
Herausgeber: CVF
DOI: 10.1109/iccv48922.2021.01596

Robot control and navigation: ARI’s autonomous system

Autoren: Francesco Ferro, Federico Nardi, Sara Cooper, Luca Marchionni
Veröffentlicht in: 29th IEEE International Conference on Robot & Human Interactive Communication ROMAN-2020, 2020
Herausgeber: IEEE

Democratizing Fine-grained Visual Recognition with Large Language Models

Autoren: Liu, Mingxuan; Roy, Subhankar; Li, Wenjing; Zhong, Zhun; Sebe, Nicu; Ricci, Elisa
Veröffentlicht in: ICLR 2024, 2024
Herausgeber: ICLR
DOI: 10.48550/arxiv.2401.13837

Decoupled Direction-of-Arrival Estimations Using Relative Harmonic Coefficients

Autoren: Y. Hu, T. D. Abhayapala, P. N. Samarasinghe and S. Gannot
Veröffentlicht in: 2020 28th European Signal Processing Conference (EUSIPCO), Ausgabe 2020, 2021, Seite(n) 246-250
Herausgeber: IEEE
DOI: 10.23919/eusipco47968.2020.9287611

Rotation Synchronization via Deep Matrix Factorization

Autoren: Gk Tejus; Giacomo Zara; Paolo Rota; Andrea Fusiello; Elisa Ricci; Federica Arrigoni
Veröffentlicht in: IEEE International Conference on Robotics and Automation (ICRA), 2023
Herausgeber: IEEE
DOI: 10.1109/icra48891.2023.10160548

A Large-Scale Homography Benchmark

Autoren: Barath, Daniel; Mishkin, Dmytro; Polic, Michal; Förstner, Wolfgang; Matas, Jiri
Veröffentlicht in: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
Herausgeber: IEEE
DOI: 10.1109/cvpr52729.2023.02046

Object-aware Gaze Target Detection

Autoren: Francesco Tonini; Nicola Dall'Asen; Cigdem Beyan; Elisa Ricci
Veröffentlicht in: 2023 IEEE/CVF International Conference on Computer Vision (ICCV), 2023
Herausgeber: IEEE
DOI: 10.1109/iccv51070.2023.01998

A Conversational AI System for Tackling Misinformation

Autoren: Nancie Gunson; Weronika Sieińska; Yanchao Yu; Daniel Hernandez Garcia; Jose L. Part; Christian Dondrup; Oliver Lemon
Veröffentlicht in: GoodIT '21: Proceedings of the Conference on Information Technology for Social Good, Ausgabe 2021, 2021, Seite(n) 265-270
Herausgeber: Association for Computing Machinery
DOI: 10.1145/3462203.3475874

Towards Visual Dialogue for Human-Robot Interaction

Autoren: Jose L. Part; Daniel Hernández García; Yanchao Yu; Nancie Gunson; Christian Dondrup; Oliver Lemon
Veröffentlicht in: HRI '21 Companion: Companion of the 2021 ACM/IEEE International Conference on Human-Robot Interaction, Ausgabe 2021, 2021, Seite(n) 670-672 (video submission)
Herausgeber: IEEE
DOI: 10.1145/3434074.3447278

Smoothing the Disentangled Latent Style Space for Unsupervised Image-to-Image Translation

Autoren: Yahui Liu; Sangineto, Enver; Yajing Chen; Linchao Bao; Haoxian Zhang; Sebe, Nicu; Lepri, Bruno; Wang, Wei; De Nadai, Marco
Veröffentlicht in: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Ausgabe 2021, 2021
Herausgeber: IEEE
DOI: 10.5281/zenodo.5014014

D-InLoc++: Indoor Localization in Dynamic Environments

Autoren: Martina Dubenova, Anna Zderadickova, Ondrej Kafka, Tomas Pajdla, Michal Polic
Veröffentlicht in: Lecture Notes in Computer Science book series (LNCS,volume 13485), 2022
Herausgeber: Springer
DOI: 10.1007/978-3-031-16788-1_16

Suche nach OpenAIRE-Daten ...

Leistungen

Veröffentlichungen

Herunterladen Den Inhalt der Seite herunterladen