F. Guidi, C. Sacerdoti Coen
We present a short survey of the literature on indexing and retrieval of
mathematical knowledge, with pointers to 72 papers and tentative taxonomies of
both retrieval problems and recurring techniques.
Authors' comments: CICM 2015, 20 pages
Sheng-Jun Yang, Xu-Jie Wang, Jun Li, Jun Rui, Xiao-Hui Bao, Jian-Wei Pan
Entanglement between a single photon and a quantum memory forms the building
blocks for quantum repeater and quantum network. Previous entanglement sources
are typically with low retrieval efficiency, which limits future larger-scale
applications. Here, we report a source of highly retrievable spinwave-photon
entanglement. Polarization entanglement is created through interaction of a
single photon with ensemble of atoms inside a low-finesse ring cavity. The
cavity is engineered to be resonant for dual spinwave modes, which thus enables
efficient retrieval of the spinwave qubit. An intrinsic retrieval efficiency up
to 76(4)% has been observed. Such a highly retrievable atom-photon entanglement
source will be very useful in future larger-scale quantum repeater and quantum
network applications.
Authors' comments: 5 pages, 3 figures
Jennifer Roldan-Carlos, Mathias Lux, Xavier Giró-i-Nieto, Pia Muñoz, Nektarios Anagnostopoulos
In endoscopic procedures, surgeons work with live video streams from the
inside of their subjects. A main source for documentation of procedures are
still frames from the video, identified and taken during the surgery. However,
with growing demands and technical means, the streams are saved to storage
servers and the surgeons need to retrieve parts of the videos on demand. In
this submission we present a demo application allowing for video retrieval
based on visual features and late fusion, which allows surgeons to re-find
shots taken during the procedure.
Authors' comments: Paper accepted at the IEEE/ACM 13th International Workshop on
Content-Based Multimedia Indexing (CBMI) in Prague (Czech Republic) between
10 and 12 June 2015
Mingsheng Long, Yue Cao, Jianmin Wang, Philip S. Yu
Efficient similarity retrieval from large-scale multimodal database is pervasive in modern search engines and social networks. To support queries across content modalities, the system should enable cross-modal correlation and computation-efficient indexing. While hashing methods have shown great potential in achieving this goal, current attempts generally fail to learn isomorphic hash codes in a seamless scheme, that is, they embed multiple modalities in a continuous isomorphic space and separately threshold embeddings into binary codes, which incurs substantial loss of retrieval accuracy. In this paper, we approach seamless multimodal hashing by proposing a novel Composite Correlation Quantization (CCQ) model. Specifically, CCQ jointly finds correlation-maximal mappings that transform different modalities into isomorphic latent space, and learns composite quantizers that convert the isomorphic latent features into compact binary codes. An optimization framework is devised to preserve both intra-modal similarity and inter-modal correlation through minimizing both reconstruction and quantization errors, which can be trained from both paired and partially paired data in linear time. A comprehensive set of experiments clearly show the superior effectiveness and efficiency of CCQ against the state of the art hashing methods for both unimodal and cross-modal retrieval.
Eva Mohedano, Amaia Salvador, Sergi Porta, Xavier Giró-i-Nieto, Graham Healy, Kevin McGuinness, Noel O'Connor, Alan F. Smeaton
This paper explores the potential for using Brain Computer Interfaces (BCI)
as a relevance feedback mechanism in content-based image retrieval. We
investigate if it is possible to capture useful EEG signals to detect if
relevant objects are present in a dataset of realistic and complex images. We
perform several experiments using a rapid serial visual presentation (RSVP) of
images at different rates (5Hz and 10Hz) on 8 users with different degrees of
familiarization with BCI and the dataset. We then use the feedback from the BCI
and mouse-based interfaces to retrieve localized objects in a subset of TRECVid
images. We show that it is indeed possible to detect such objects in complex
images and, also, that users with previous knowledge on the dataset or
experience with the RSVP outperform others. When the users have limited time to
annotate the images (100 seconds in our experiments) both interfaces are
comparable in performance. Comparing our best users in a retrieval task, we
found that EEG-based relevance feedback outperforms mouse-based feedback. The
realistic and complex image dataset differentiates our work from previous
studies on EEG for image retrieval.
Authors' comments: This preprint is the full version of a short paper accepted in the
ACM International Conference on Multimedia Retrieval (ICMR) 2015 (Shanghai,
China)
Alexander Sagel, Dominik Meyer, Hao Shen
This work studies the problem of content-based image retrieval, specifically, texture retrieval. It focuses on feature extraction and similarity measure for texture images. Our approach employs a recently developed method, the so-called Scattering transform, for the process of feature extraction in texture retrieval. It shares a distinctive property of providing a robust representation, which is stable with respect to spatial deformations. Recent work has demonstrated its capability for texture classification, and hence as a promising candidate for the problem of texture retrieval. Moreover, we adopt a common approach of measuring the similarity of textures by comparing the subband histograms of a filterbank transform. To this end we derive a similarity measure based on the popular Bhattacharyya Kernel. Despite the popularity of describing histograms using parametrized probability density functions, such as the Generalized Gaussian Distribution, it is unfortunately not applicable for describing most of the Scattering transform subbands, due to the complex modulus performed on each one of them. In this work, we propose to use the Weibull distribution to model the Scattering subbands of descendant layers. Our numerical experiments demonstrated the effectiveness of the proposed approach, in comparison with several state of the arts.
Mark Iwen, Aditya Viswanathan, Yang Wang
We develop a fast phase retrieval method which can utilize a large class of
local phaseless correlation-based measurements in order to recover a given
signal ${\bf x} \in \mathbb{C}^d$ (up to an unknown global phase) in
near-linear $\mathcal{O} \left( d \log^4 d \right)$-time. Accompanying
theoretical analysis proves that the proposed algorithm is guaranteed to
deterministically recover all signals ${\bf x}$ satisfying a natural flatness
(i.e., non-sparsity) condition for a particular choice of deterministic
correlation-based measurements. A randomized version of these same measurements
is then shown to provide nonuniform probabilistic recovery guarantees for
arbitrary signals ${\bf x} \in \mathbb{C}^d$. Numerical experiments demonstrate
the method's speed, accuracy, and robustness in practice -- all code is made
publicly available.
Finally, we conclude by developing an extension of the proposed method to the
sparse phase retrieval problem; specifically, we demonstrate a sublinear-time
compressive phase retrieval algorithm which is guaranteed to recover a given
$s$-sparse vector ${\bf x} \in \mathbb{C}^d$ with high probability in just
$\mathcal{O}(s \log^5 s \cdot \log d)$-time using only $\mathcal{O}(s \log^4 s
\cdot \log d)$ magnitude measurements. In doing so we demonstrate the existence
of compressive phase retrieval algorithms with near-optimal linear-in-sparsity
runtime complexities.
Authors' comments: added more empirical evaluations/performance comparisons,
clarifications/additions to introduction/abstract
Lanbo Zhang
This paper presents a new user feedback mechanism based on Wikipedia concepts for interactive retrieval. In this mechanism, the system presents to the user a group of Wikipedia concepts, and the user can choose those relevant to refine his/her query. To realize this mechanism, we propose methods to address two problems: 1) how to select a small number of possibly relevant Wikipedia concepts to show the user, and 2) how to re-rank retrieved documents given the user-identified Wikipedia concepts. Our methods are evaluated on three TREC data sets. The experiment results show that our methods can dramatically improve retrieval performances.
Ali Sharif Razavian, Josephine Sullivan, Stefan Carlsson, Atsuto Maki
This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval. Besides the choice of convolutional layers, we present an efficient pipeline exploiting multi-scale schemes to extract local features, in particular, by taking geometric invariance into explicit account, i.e. positions, scales and spatial consistency. In our experiments using five standard image retrieval datasets, we demonstrate that generic ConvNet image representations can outperform other state-of-the-art methods if they are extracted appropriately.
Tejal Bhamre, Teng Zhang, Amit Singer
In single particle reconstruction (SPR) from cryo-electron microscopy
(cryo-EM), the 3D structure of a molecule needs to be determined from its 2D
projection images taken at unknown viewing directions. Zvi Kam showed already
in 1980 that the autocorrelation function of the 3D molecule over the rotation
group SO(3) can be estimated from 2D projection images whose viewing directions
are uniformly distributed over the sphere. The autocorrelation function
determines the expansion coefficients of the 3D molecule in spherical harmonics
up to an orthogonal matrix of size $(2l+1)\times (2l+1)$ for each
$l=0,1,2,...$. In this paper we show how techniques for solving the phase
retrieval problem in X-ray crystallography can be modified for the cryo-EM
setup for retrieving the missing orthogonal matrices. Specifically, we present
two new approaches that we term Orthogonal Extension and Orthogonal
Replacement, in which the main algorithmic components are the singular value
decomposition and semidefinite programming. We demonstrate the utility of these
approaches through numerical experiments on simulated data.
Authors' comments: Modified introduction and summary. Accepted to the IEEE International
Symposium on Biomedical Imaging
Georgia T. Papadakis, Pochi Yeh, Harry A. Atwater
We present a general method for retrieving the effective tensorial
permittivity of any uniaxially anisotropic metamaterial. By relaxing the
usually imposed condition of non-magnetic metal/dielectric metamaterials, we
also retrieve the permeability tensor and show that hyperbolic metamaterials
exhibit a strong diamagnetic response in the visible regime. We obtain global
material parameters, directly measurable with spectroscopic ellipsometry and
distinguishable from mere wave parameters, by using the generalized dispersion
equation for uniaxial crystals along with existing homogenization methods. Our
method is analytically and experimentally verified for Ag/SiO2 planar
metamaterials with varying number of layers and compared to the effective
medium theory. We also propose an experimental method for retrieving material
parameters using methods other than ellipsometry.
Authors' comments: 17 pages, 9 figures
Kevin Shih, Wei Di, Vignesh Jagadeesh, Robinson Piramuthu
Text is ubiquitous in the artificial world and easily attainable when it
comes to book title and author names. Using the images from the book cover set
from the Stanford Mobile Visual Search dataset and additional book covers and
metadata from openlibrary.org, we construct a large scale book cover retrieval
dataset, complete with 100K distractor covers and title and author strings for
each. Because our query images are poorly conditioned for clean text
extraction, we propose a method for extracting a matching noisy and erroneous
OCR readings and matching it against clean author and book title strings in a
standard document look-up problem setup. Finally, we demonstrate how to use
this text-matching as a feature in conjunction with popular retrieval features
such as VLAD using a simple learning setup to achieve significant improvements
in retrieval accuracy over that of either VLAD or the text alone.
Authors' comments: 8 pages, 9 figures, 1 table
Philipp Mayr, Andrea Scharnhorst
This special issue brings together eight papers from experts of communities
which often have been perceived as different once: bibliometrics,
scientometrics and informetrics on the one side and information retrieval on
the other. The idea of this special issue started at the workshop "Combining
Bibliometrics and Information Retrieval" held at the 14th International
Conference of Scientometrics and Informetrics, Vienna, July 14-19, 2013. Our
motivation as guest editors started from the observation that main discourses
in both fields are different, that communities are only partly overlapping and
from the belief that a knowledge transfer would be profitable for both sides.
Authors' comments: 8 pages, 1 figure, editorial for a special issue to appear in
Scientometrics
Çağkan Yapar, Volker Pohl, Holger Boche
This paper considers the problem of recovering a $k$-sparse, $N$-dimensional
complex signal from Fourier magnitude measurements. It proposes a Fourier
optics setup such that signal recovery up to a global phase factor is possible
with very high probability whenever $M \gtrsim 4k\log_2(N/k)$ random Fourier
intensity measurements are available. The proposed algorithm is comprised of
two stages: An algebraic phase retrieval stage and a compressive sensing step
subsequent to it. Simulation results are provided to demonstrate the
applicability of the algorithm for noiseless and noisy scenarios.
Authors' comments: 8 pages, 4 figures, submitted to ICASSP 2015 on Oct 6th, 2014
Terence H. Chan, Siu-Wai Ho, Hirosuke Yamamoto
Private information retrieval scheme for coded data storage is considered in
this paper. We focus on the case where the size of each data record is large
and hence only the download cost (but not the upload cost for transmitting
retrieval queries) is of interest. We prove that the tradeoff between storage
cost and retrieval/download cost depends on the number of data records in the
system. We also propose a fairly general class of linear storage codes and
retrieval schemes and derive conditions under which our retrieval schemes are
error-free and private. Tradeoffs between the storage cost and retrieval costs
are also obtained. Finally, we consider special cases when the underlying
storage code is based on an MDS code. Using our proposed method, we show that a
randomly generated retrieval scheme is indeed very likely to be private and
error-free.
Authors' comments: submitted to IEEE Journal of Selected Topics in Signal Processing
Mark Iwen, Aditya Viswanathan, Yang Wang
In this short note we propose a simple two-stage sparse phase retrieval strategy that uses a near-optimal number of measurements, and is both computationally efficient and robust to measurement noise. In addition, the proposed strategy is fairly general, allowing for a large number of new measurement constructions and recovery algorithms to be designed with minimal effort.
Xirong Li
Due to the subjective nature of social tagging, measuring the relevance of social tags with respect to the visual content is crucial for retrieving the increasing amounts of social-networked images. Witnessing the limit of a single measurement of tag relevance, we introduce in this paper tag relevance fusion as an extension to methods for tag relevance estimation. We present a systematic study, covering tag relevance fusion in early and late stages, and in supervised and unsupervised settings. Experiments on a large present-day benchmark set show that tag relevance fusion leads to better image retrieval. Moreover, unsupervised tag relevance fusion is found to be practically as effective as supervised tag relevance fusion, but without the need of any training efforts. This finding suggests the potential of tag relevance fusion for real-world deployment.
Djallel Bouneffouf
Context-Based Information Retrieval is recently modelled as an exploration/
exploitation trade-off (exr/exp) problem, where the system has to choose
between maximizing its expected rewards dealing with its current knowledge
(exploitation) and learning more about the unknown user's preferences to
improve its knowledge (exploration). This problem has been addressed by the
reinforcement learning community but they do not consider the risk level of the
current user's situation, where it may be dangerous to explore the
non-top-ranked documents the user may not desire in his/her current situation
if the risk level is high. We introduce in this paper an algorithm named
CBIR-R-greedy that considers the risk level of the user's situation to
adaptively balance between exr and exp.
Authors' comments: arXiv admin note: substantial text overlap with arXiv:1408.2195
Dilip K. Limbu, Andy M. Connor, Russel Pears, Stephen G. MacDonell
Contextual retrieval is a critical technique for today's search engines in terms of facilitating queries and returning relevant information. This paper reports on the development and evaluation of a system designed to tackle some of the challenges associated with contextual information retrieval from the World Wide Web (WWW). The developed system has been designed with a view to capturing both implicit and explicit user data which is used to develop a personal contextual profile. Such profiles can be shared across multiple users to create a shared contextual knowledge base. These are used to refine search queries and improve both the search results for a user as well as their search experience. An empirical study has been undertaken to evaluate the system against a number of hypotheses. In this paper, results related to one are presented that support the claim that users can find information more readily using the contextual search system.
Simon Gog, Matthias Petri
We engineer a self-index based retrieval system capable of rank-safe
evaluation of top-k queries. The framework generalizes the GREEDY approach of
Culpepper et al. (ESA 2010) to handle multi-term queries, including over
phrases. We propose two techniques which significantly reduce the ranking time
for a wide range of popular Information Retrieval (IR) relevance measures, such
as TFxIDF and BM25. First, we reorder elements in the document array according
to document weight. Second, we introduce the repetition array, which
generalizes Sadakane's (JDA 2007) document frequency structure to document
subsets. Combining document and repetition array, we achieve attractive
functionality-space trade-offs. We provide an extensive evaluation of our
system on terabyte-sized IR collections.
Authors' comments: 14 pages, 9 figures