Loading [MathJax]/extensions/tex2jax.js

On the spatial extents of SIFT descriptors for visual concept detection

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Markus Mühling
  • Ralph Ewerth
  • Bernd Freisleben

External Research Organisations

  • Philipps-Universität Marburg

Details

Original languageEnglish
Title of host publicationComputer Vision Systems
Subtitle of host publication 8th International Conference, ICVS 2011, Proceedings
Pages71-80
Number of pages10
Publication statusPublished - 2011
Externally publishedYes
Event8th International Conference on Computer Vision Systems, ICVS 2011 - Sophia Antipolis, France
Duration: 20 Sept 201122 Sept 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6962 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

State-of-the-art systems for visual concept detection typically rely on the Bag-of-Visual-Words representation. While several aspects of this representation have , such as keypoint sampling strategy, vocabulary size, projection method, weighting scheme or the integration of color, the impact of the spatial extents of local SIFT descriptors has not been studied in previous work. In this paper, the effect of different spatial extents in a state-of-the-art system for visual concept detection is investigated. Based on the observation that SIFT descriptors with different spatial extents yield large performance differences, we propose a concept detection system that combines feature representations for different spatial extents using multiple kernel learning. It is shown experimentally on a large set of 101 concepts from the Mediamill Challenge and on the PASCAL Visual Object Classes Challenge that these feature representations are complementary: Superior performance can be achieved on both test sets using the proposed system.

Keywords

    Bag-of-Words, Magnification Factor, SIFT, Spatial Bin Size, Video Retrieval, Visual Concept Detection

ASJC Scopus subject areas

Cite this

On the spatial extents of SIFT descriptors for visual concept detection. / Mühling, Markus; Ewerth, Ralph; Freisleben, Bernd.
Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings. 2011. p. 71-80 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6962 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Mühling, M, Ewerth, R & Freisleben, B 2011, On the spatial extents of SIFT descriptors for visual concept detection. in Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6962 LNCS, pp. 71-80, 8th International Conference on Computer Vision Systems, ICVS 2011, Sophia Antipolis, France, 20 Sept 2011. https://doi.org/10.1007/978-3-642-23968-7_8
Mühling, M., Ewerth, R., & Freisleben, B. (2011). On the spatial extents of SIFT descriptors for visual concept detection. In Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings (pp. 71-80). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6962 LNCS). https://doi.org/10.1007/978-3-642-23968-7_8
Mühling M, Ewerth R, Freisleben B. On the spatial extents of SIFT descriptors for visual concept detection. In Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings. 2011. p. 71-80. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-23968-7_8
Mühling, Markus ; Ewerth, Ralph ; Freisleben, Bernd. / On the spatial extents of SIFT descriptors for visual concept detection. Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings. 2011. pp. 71-80 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{f3b08112aaf24f89bd869e1b6e143bec,
title = "On the spatial extents of SIFT descriptors for visual concept detection",
abstract = "State-of-the-art systems for visual concept detection typically rely on the Bag-of-Visual-Words representation. While several aspects of this representation have , such as keypoint sampling strategy, vocabulary size, projection method, weighting scheme or the integration of color, the impact of the spatial extents of local SIFT descriptors has not been studied in previous work. In this paper, the effect of different spatial extents in a state-of-the-art system for visual concept detection is investigated. Based on the observation that SIFT descriptors with different spatial extents yield large performance differences, we propose a concept detection system that combines feature representations for different spatial extents using multiple kernel learning. It is shown experimentally on a large set of 101 concepts from the Mediamill Challenge and on the PASCAL Visual Object Classes Challenge that these feature representations are complementary: Superior performance can be achieved on both test sets using the proposed system.",
keywords = "Bag-of-Words, Magnification Factor, SIFT, Spatial Bin Size, Video Retrieval, Visual Concept Detection",
author = "Markus M{\"u}hling and Ralph Ewerth and Bernd Freisleben",
year = "2011",
doi = "10.1007/978-3-642-23968-7_8",
language = "English",
isbn = "9783642239670",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "71--80",
booktitle = "Computer Vision Systems",
note = "8th International Conference on Computer Vision Systems, ICVS 2011 ; Conference date: 20-09-2011 Through 22-09-2011",

}

Download

TY - GEN

T1 - On the spatial extents of SIFT descriptors for visual concept detection

AU - Mühling, Markus

AU - Ewerth, Ralph

AU - Freisleben, Bernd

PY - 2011

Y1 - 2011

N2 - State-of-the-art systems for visual concept detection typically rely on the Bag-of-Visual-Words representation. While several aspects of this representation have , such as keypoint sampling strategy, vocabulary size, projection method, weighting scheme or the integration of color, the impact of the spatial extents of local SIFT descriptors has not been studied in previous work. In this paper, the effect of different spatial extents in a state-of-the-art system for visual concept detection is investigated. Based on the observation that SIFT descriptors with different spatial extents yield large performance differences, we propose a concept detection system that combines feature representations for different spatial extents using multiple kernel learning. It is shown experimentally on a large set of 101 concepts from the Mediamill Challenge and on the PASCAL Visual Object Classes Challenge that these feature representations are complementary: Superior performance can be achieved on both test sets using the proposed system.

AB - State-of-the-art systems for visual concept detection typically rely on the Bag-of-Visual-Words representation. While several aspects of this representation have , such as keypoint sampling strategy, vocabulary size, projection method, weighting scheme or the integration of color, the impact of the spatial extents of local SIFT descriptors has not been studied in previous work. In this paper, the effect of different spatial extents in a state-of-the-art system for visual concept detection is investigated. Based on the observation that SIFT descriptors with different spatial extents yield large performance differences, we propose a concept detection system that combines feature representations for different spatial extents using multiple kernel learning. It is shown experimentally on a large set of 101 concepts from the Mediamill Challenge and on the PASCAL Visual Object Classes Challenge that these feature representations are complementary: Superior performance can be achieved on both test sets using the proposed system.

KW - Bag-of-Words

KW - Magnification Factor

KW - SIFT

KW - Spatial Bin Size

KW - Video Retrieval

KW - Visual Concept Detection

UR - http://www.scopus.com/inward/record.url?scp=80053457242&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-23968-7_8

DO - 10.1007/978-3-642-23968-7_8

M3 - Conference contribution

AN - SCOPUS:80053457242

SN - 9783642239670

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 71

EP - 80

BT - Computer Vision Systems

T2 - 8th International Conference on Computer Vision Systems, ICVS 2011

Y2 - 20 September 2011 through 22 September 2011

ER -