2022
2021
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
ICCV (2021)
Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation
ICCV (2021)
Training data-efficient image transformers & distillation through attention
ICML (2021)
Semantic Palette: Guiding Scene Generation with Class Proportions
CVPR (2021)
Online Bag-of-Visual-Words Generation for Unsupervised Representation Learning
CVPR (2021)
DICE: Diversity in Deep Ensembles via Conditional Redundancy Adversarial Estimation
ICLR (2021)
2020
The Missing Data Encoder: Cross-Channel Image Completion with Hide-And-Seek Adversarial Network
AAAI (2020)
Deep Entwined Learning Head Pose and Face Alignment Inside an Attentional Cascade with Doubly-Conditional fusion
FG (2020)
SEMEDA: Enhancing Segmentation Precision with Semantic Edge Aware Loss
Pattern Recognition (2020)
2019
ADVENT: Adversarial Entropy Minimization for Domain Adaptation in Semantic Segmentation
CVPR (oral) (2019)
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection
AAAI (2019)
Exploiting Negative Evidence for Deep Latent Structured Models
PAMI IEEE Transactions on Pattern Analysis and Machine Intelligence (2019)
2018
Revisiting Multi-Task Learning with ROCK: a Deep Residual Auxiliary Block for Visual Detection
NIPS - NeurIPS (2018)
HybridNet: Classication and Reconstruction Cooperation for Semi-Supervised Learning
ECCV (2018)
Cross-modal retrieval in the cooking context: Learning semantic text-image embeddings
SIGIR (2018)
Finding beans in burgers: Deep semantic-visual embedding with localization
CVPR (2018)
End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection
IJCV International Journal of Computer Vision (2018)
SyMIL MinMax Latent SVM for Weakly Labeled Data
IEEE Transactions on Neural Networks and Learning Systems (2018)
Classifying low-resolution images by integrating privileged information in deep CNNs
Pattern Recognition letters (2018)
2017
Deformable Part-based Fully Convolutional Network for Object Detection
BMVC, Best Paper (on ArXiv) (2017)
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
ICCV, (on ArXiv) (2017)
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation
CVPR (2017)
Paper Project page Supplementary
author = {Durand, Thibaut and Mordan, Taylor and Thome, Nicolas and Cord, Matthieu},
title = {{WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation}},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2017}
}
Learning a Distance Metric from Relative Comparisons between Quadruplets of Images
IJCV International Journal of Computer Vision (2017)
Gaze Latent Support Vector Machine for Image Classification Improved by Weakly Supervised Region Selection
Pattern Recognition (PR) (2017)
2016
Gaze latent SVM for image classification
IEEE ICIP (2016)
Max-Min convolutional neural networks for image classification
IEEE ICIP (2016)
2015
MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking
ICCV (2015)
Recipe Recognition with Large Multimodal Food Dataset
Cooking and Eating Activities in IEEE ICME (2015)
2014
Incremental Learning of Latent Structural SVM for Weakly Supervised Image Classification
IEEE ICIP (2014)
Model-Based Analysis-Synthesis for Realistic Tree Reconstruction and Growth Simulation
TGARS (2014)
Perceptual principles for video classification with Slow Feature Analysis
IEEE Journal of Selected Topics in Signal Processing (2014)
Learning Deep Hierarchical Visual Feature Coding
IEEE Transactions on Neural Networks and Learning Systems (2014)
2013
Dynamic Scene Classification: Learning Motion Descriptors with Slow Features Analysis
CVPR (2013)
JKernelMachines: A simple framework for Kernel Machines
JMLR (May 2013) in JMLR Track for Machine Learning Open Source Software (2013)
Extended coding and pooling in the HMAX model
IEEE Transactions on Image Processing (Vol 22, Issue 2, Feb 2013) (2013)
T-HOG: an Effective Gradient-Based Descriptor for Single Line Text Regions
Pattern Recognition (Vol 46, Issue 3, March 2013) (2013)
2012
Visual Indexing and Retrieval
Series: SpringerBriefs in Computer Science, ISBN 978-1-4614-3587-7 (2012)
Structural and Visual Comparisons for Web Page Archiving
ACM Symposium on Document Engineering (12th DocEng) (2012)
Learning geometric combinations of Gaussian kernels with alternating Quasi-Newton algorithm
ESANN 2012 (2012)
An Application of Swarm Intelligence to Distributed Image Retrieval
Information Sciences (IF 3.6) (Vol 192, June 2012) (2012)
BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task.
Working Notes of the Conference and Labs of the Evaluation Forum (CLEF) (2012)
2011
Learning invariant color features with sparse topographic restricted boltzmann machines
IEEE ICIP (2011)
Hmax-s: deep scale representation for biologically inspired image categorization
IEEE ICIP (2011)
Using LIDO to handle 3D Cultural Heritage Documentation Data Provenance
CBMI (2011)
Text Detection and Recognition in Urban Scenes
ICCV on Computer Vision for Remote Sensing of the Environment (2011)
Combining complementary kernels in complex visual categorization
ICCV on Kernels and Distances for Computer Vision (2011)
IRIM at TRECVID 2011: Semantic Indexing and Instance Search
TREC Video Retrieval Evaluation 2011 (2011)
2010
An efficient System for combining complementary kernels in complex visual categorization tasks
IEEE ICIP (2010)
SnooperText: A Multiresolution System for Text Detection in Complex Visual Scenes
IEEE ICIP (2010)
SALSAS: Sub-linear Active Learning Strategy with Approximate k-NN Search
Pattern Recognition (2010)
Spatio-Temporal Tube data representation and Kernel design for SVM-based video object retrieval system
MTAP (2010)
Indexing personal image collections: A flexible, scalable solution
IEEE trans on Consumer Electronics (2010)
Biasing Restricted Boltzmann Machines to Manipulate Latent Selectivity and Sparsity
NIPS workshop (2010)
2009
Advanced Techniques in CBIR: Local Descriptors, Visual Dictionaries and Bags of Features
ACM Symposium on Document Engineering (2009)
Similarity Search and Indexing for High-Dimensional Data
ACM Symposium on Document Engineering (2009)
Geometric Consistency Checking for Local-Descriptor Based Document Retrieval
ACM Sympo DocEng (2009)
Study of sift descriptors for image matching based localization in urban street view context
CMRT (2009)
2008
Machine Learning Techniques for Multimedia
Case Studies on Organization and Retrieval. Series: Cognitive Technologies Processing multimedia content has emerged as a key area for the application of machine learning techniques, where the objectives are to provide insight into the domain from which the data is drawn, and to organize that data and improve the performance of the processes manipulating it.
An interactive video content-based retrieval system
IWSSIP (2008)
Detection, segmentation and characterisation of vegetation in high-resolution aerial images for 3d city modelling
ISPRS (2008)
Image Retrieval over Networks : Active Learning using Ant Algorithm
IEEE Transactions on Multimedia (2008)
Combining visual dictionary, kernel-based similarity and learning strategy for image category retrieval
CVIU (2008)
Detection, Characterization and Modeling Vegetation in Urban Areas from High Resolution Aerial Imagery
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (2008)
Active learning methods for Interactive Image Retrieval
IEEE Transactions on Image Processing (2008)
Summarization scheme based on near-duplicate analysis
TRECVID Video Sum (ACM multimedia) (2008)
2007
Automatic Extraction of urban vegetation structures from High Resolution Imagery and digital elevation models
In Urban remote sensing joint event, Paris, 2007 (2007)
Kernel on Bags for multi-object database retrieval
ACM International Conference on Image and Video Retrieval (2007)
Descriptor matching for image identificationon cultural databases
IEEE International Conference on Image Processing (2007)
Matching High-Dimensional Local Descriptors for Image Identification on Cultural Databases
ICDAR (2007)
3D Content-based retrieval in artwork databases
3DTV Conference (2007)
Automatic Extraction and Classification of Vegetation Areas from High Resolution Images in Urban Areas
In SCIA conference, 2007 (2007)
Stochastic exploration and active learning for image retrieval
Image and Vision Computing (2007)
2006
Robust Scene Cut Detection by Supervised Learning
EUSIPCO (2006)
2005
Semantic kernel learning for interactive image retrieval
IEEE International Conference on Image Processing (2005)
Active learning techniques for user interactive systems: application to image retrieval
ICML Workshop on Machine Learning techniques for processing (2005)
2004
RETIN AL: An Active Learning Strategy for Image Category Retrieval
IEEE International Conference on Image Processing (ICIP), Singapore, October 2004 (2004)
Discriminative Classification vs Modeling Methods in CBIR
In IEEE Advanced Concepts for Intelligent Vision Systems (ACIVS), Brussel, Belgium, September 2004 (2004)
In International Workshop on Computer Vision meets Databases (CVDB), ACM Sigmod, Paris, France, June 2004 (2004)
Approche interactive de la recherche d'images par le contenu
TSI (2004)
Smooth surface reconstruction using tensor field as structuring elements
International Journal of the Eurographics Association, Computer Graphics Forum, p. 813-823, 2004 (2004)
2002
Long-term similarity learning in content-based image retrieval
In IEEE International Conference on Image Processing (ICIP), Rochester, USA, Sept. 2002 (2002)
Terrain surface modeling from altimetric data
In IEEE International Conference on Image Processing (ICIP) (2002)
Building Detection from High Resolution Digital Elevation Models in Urban Areas
In ISPRS Commission III, Symposium, p. B-96, Graz, Austria, 2002 (2002)
Analyse d'images aeriennes haute resolution pour la reconstruction de scenes urbaines
Bulletin de la SFPT, n. 166, p. 34-43, 2002 (2002)
2001
Three-dimentional building and modeling using a statistical approach
IEEE Trans. on Image Proessing, Vol. 10(5), p.715-723, Ma 2001 (2001)
Accurate building structure recovery from high resolution aerial imagery
Computer Vision and Image Understanding, 82, p.138-173, 2001 (2001)
Back-propagation algorithm for relevance feedback in image retrieval
In ICIP 2001, Thessaloniki, Greece, Octobre 2001 (2001)
J. Fournier, M. Cord, S. Philipp-Foliguet
Pattern Analysis and Applications Journal, Vol. 4 (2/3), p 153-173, 2001 (2001)
1999
Bayesian model identification: application to building reconstruction in aerial imagery
In IEEE International Conference on Image Processing (ICIP), Kobe, Japan, Oct. (1999)
1998
Building Detection and Restitution from High Resolution Aerial Imagery
Computer Vision and Image Understanding, 72(2) p.122-142 (1998)
1997
Optimal Adjusting of Edge Detectors to Extract Close Contours
In SCIA 1997, Lappeenranta, Finland (1997)