Reading Between the Lines: Object Localization Using Implicit Cues from Image Tags

Hwang, Sung Ju; Grauman, Kristen

doi:10.1109/TPAMI.2011.190

Scholarworks@UNIST

UNIST Library

File Download

There are no files associated with this item.

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

황성주

Hwang, Sung Ju

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.endPage	1158	-
dc.citation.number	6	-
dc.citation.startPage	1145	-
dc.citation.title	IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE	-
dc.citation.volume	34	-
dc.contributor.author	Hwang, Sung Ju	-
dc.contributor.author	Grauman, Kristen	-
dc.date.accessioned	2023-12-22T05:07:45Z	-
dc.date.available	2023-12-22T05:07:45Z	-
dc.date.created	2015-08-04	-
dc.date.issued	2012-06	-
dc.description.abstract	Current uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage "unspoken" cues that rest within an ordered list of image tags so as to improve object localization. We define three novel implicit features from an image's tags-the relative prominence of each object as signified by its order of mention, the scale constraints implied by unnamed objects, and the loose spatial links hinted at by the proximity of names on the list. By learning a conditional density over the localization parameters (position and scale) given these cues, we show how to improve both accuracy and efficiency when detecting the tagged objects. Furthermore, we show how the localization density can be learned in a semantic space shared by the visual and tag-based features, which makes the technique applicable for detection in untagged input images. We validate our approach on the PASCAL VOC, LabelMe, and Flickr image data sets, and demonstrate its effectiveness relative to both traditional sliding windows as well as a visual context baseline. Our algorithm improves state-of-the-art methods, successfully translating insights about human viewing behavior (such as attention, perceived importance, or gaze) into enhanced object detection.	-
dc.identifier.bibliographicCitation	IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, v.34, no.6, pp.1145 - 1158	-
dc.identifier.doi	10.1109/TPAMI.2011.190	-
dc.identifier.issn	0162-8828	-
dc.identifier.scopusid	2-s2.0-84860244538	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/18446	-
dc.identifier.url	http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6030877	-
dc.identifier.wosid	000302916600009	-
dc.language	영어	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	Reading Between the Lines: Object Localization Using Implicit Cues from Image Tags	-
dc.type	Article	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.