Çağkan Yapar, Volker Pohl, Holger Boche
This contribution proposes a two stage strategy to allow for phase retrieval
in state of the art sub-Nyquist sampling schemes for sparse multiband signals.
The proposed strategy is based on data acquisition via modulated wideband
converters known from sub-Nyquist sampling. This paper describes how the
modulators have to be modified such that signal recovery from sub-Nyquist
amplitude samples becomes possible and a corresponding recovery algorithm is
given which is computational efficient. In addition, the proposed strategy is
fairly general, allowing for several constructions and recovery algorithms.
Authors' comments: Submitted to ICASSP 2016
Vidyadhar Rao, Prateek Jain, C. V Jawahar
Typical retrieval systems have three requirements: a) Accurate retrieval
i.e., the method should have high precision, b) Diverse retrieval, i.e., the
obtained set of points should be diverse, c) Retrieval time should be small.
However, most of the existing methods address only one or two of the above
mentioned requirements. In this work, we present a method based on randomized
locality sensitive hashing which tries to address all of the above requirements
simultaneously. While earlier hashing approaches considered approximate
retrieval to be acceptable only for the sake of efficiency, we argue that one
can further exploit approximate retrieval to provide impressive trade-offs
between accuracy and diversity. We extend our method to the problem of
multi-label prediction, where the goal is to output a diverse and accurate set
of labels for a given document in real-time. Moreover, we introduce a new
notion to simultaneously evaluate a method's performance for both the precision
and diversity measures. Finally, we present empirical results on several
different retrieval tasks and show that our method retrieves diverse and
accurate images/labels while ensuring $100x$-speed-up over the existing diverse
retrieval approaches.
Authors' comments: 10 pages
Gerard t Hooft
The mechanism by which black holes return the absorbed information to the
outside world is reconsidered, and described in terms of a set of mutually
non-interacting modes. Our mechanism is based on the mostly classical
gravitational back-reaction. The diagonalized formalism is particularly useful
for further studies of this process. Although no use is made of string theory,
our analysis appears to point towards an ensuing string-like interaction. It is
shown how black hole entropy can be traced down to classical gravitational
back-reaction.
Authors' comments: 10 pages, no figures
Jinma Guo, Jianmin Li
Along with data on the web increasing dramatically, hashing is becoming more
and more popular as a method of approximate nearest neighbor search. Previous
supervised hashing methods utilized similarity/dissimilarity matrix to get
semantic information. But the matrix is not easy to construct for a new
dataset. Rather than to reconstruct the matrix, we proposed a straightforward
CNN-based hashing method, i.e. binarilizing the activations of a fully
connected layer with threshold 0 and taking the binary result as hash codes.
This method achieved the best performance on CIFAR-10 and was comparable with
the state-of-the-art on MNIST. And our experiments on CIFAR-10 suggested that
the signs of activations may carry more information than the relative values of
activations between samples, and that the co-adaption between feature extractor
and hash functions is important for hashing.
Authors' comments: 16 pages, 6 figures
Ingo P. Waldmann, Marco Rocchetto, Giovanna Tinetti, Emma J. Barton, Sergey N. Yurchenko, Jonathan Tennyson
Tau-REx (Tau Retrieval of Exoplanets) is a novel, fully Bayesian atmospheric
retrieval code custom built for extrasolar atmospheres. In Waldmann et al.
(2015) the transmission spectroscopic case was introduced, here we present the
emission spectroscopy spectral retrieval for the Tau-REx framework. Compared to
transmission spectroscopy, the emission case is often significantly more
degenerate due to the need to retrieve the full atmospheric
temperature-pressure (TP) profile. This is particularly true in the case of
current measurements of exoplanetary atmospheres, which are either of low
signal-to-noise, low spectral resolution or both. Here we present a new way of
combining two existing approaches to the modelling of the said TP profile: 1)
the parametric profile, where the atmospheric TP structure is analytically
approximated by a few model parameters, 2) the Layer-by-Layer approach, where
individual atmospheric layers are modelled. Both these approaches have distinct
advantages and disadvantages in terms of convergence properties and potential
model biases. The Tau-REx hybrid model presented here is a new two-stage TP
profile retrieval, which combines the robustness of the analytic solution with
the accuracy of the Layer-by-Layer approach. The retrieval process is
demonstrated using simulations of the hot-Jupiter WASP-76b and the hot
SuperEarth 55 Cnc e, as well as on the secondary eclipse measurements of
HD189733b.
Authors' comments: ApJ accepted
Christina Lioma, Jakob Grue Simonsen, Birger Larsen, Niels Dalum Hansen
Modelling term dependence in IR aims to identify co-occurring terms that are too heavily dependent on each other to be treated as a bag of words, and to adapt the indexing and ranking accordingly. Dependent terms are predominantly identified using lexical frequency statistics, assuming that (a) if terms co-occur often enough in some corpus, they are semantically dependent; (b) the more often they co-occur, the more semantically dependent they are. This assumption is not always correct: the frequency of co-occurring terms can be separate from the strength of their semantic dependence. E.g. "red tape" might be overall less frequent than "tape measure" in some corpus, but this does not mean that "red"+"tape" are less dependent than "tape"+"measure". This is especially the case for non-compositional phrases, i.e. phrases whose meaning cannot be composed from the individual meanings of their terms (such as the phrase "red tape" meaning bureaucracy). Motivated by this lack of distinction between the frequency and strength of term dependence in IR, we present a principled approach for handling term dependence in queries, using both lexical frequency and semantic evidence. We focus on non-compositional phrases, extending a recent unsupervised model for their detection [21] to IR. Our approach, integrated into ranking using Markov Random Fields [31], yields e?ectiveness gains over competitive TREC baselines, showing that there is still room for improvement in the very well-studied area of term dependence in IR.
Zehra Camlica, H. R. Tizhoosh, Farzad Khalvati
Content-based image retrieval (CBIR) of medical images is a crucial task that
can contribute to a more reliable diagnosis if applied to big data. Recent
advances in feature extraction and classification have enormously improved CBIR
results for digital images. However, considering the increasing accessibility
of big data in medical imaging, we are still in need of reducing both memory
requirements and computational expenses of image retrieval systems. This work
proposes to exclude the features of image blocks that exhibit a low encoding
error when learned by a $n/p/n$ autoencoder ($p\!<\!n$). We examine the
histogram of autoendcoding errors of image blocks for each image class to
facilitate the decision which image regions, or roughly what percentage of an
image perhaps, shall be declared relevant for the retrieval task. This leads to
reduction of feature dimensionality and speeds up the retrieval process. To
validate the proposed scheme, we employ local binary patterns (LBP) and support
vector machines (SVM) which are both well-established approaches in CBIR
research community. As well, we use IRMA dataset with 14,410 x-ray images as
test data. The results show that the dimensionality of annotated feature
vectors can be reduced by up to 50% resulting in speedups greater than 27% at
expense of less than 1% decrease in the accuracy of retrieval when validating
the precision and recall of the top 20 hits.
Authors' comments: To appear in proceedings of The 5th International Conference on Image
Processing Theory, Tools and Applications (IPTA'15), Nov 10-13, 2015,
Orleans, France
Leonardo A. Duarte, Otávio A. B. Penatti, Jurandy Almeida
Often, videos are composed of multiple concepts or even genres. For instance, news videos may contain sports, action, nature, etc. Therefore, encoding the distribution of such concepts/genres in a compact and effective representation is a challenging task. In this sense, we propose the Bag of Genres representation, which is based on a visual dictionary defined by a genre classifier. Each visual word corresponds to a region in the classification space. The Bag of Genres video vector contains a summary of the activations of each genre in the video content. We evaluate the proposed method for video genre retrieval using the dataset of MediaEval Tagging Task of 2012 and for video event retrieval using the EVVE dataset. Results show that the proposed method achieves results comparable or superior to state-of-the-art methods, with the advantage of providing a much more compact representation than existing features.
F. Guidi, C. Sacerdoti Coen
We present a short survey of the literature on indexing and retrieval of
mathematical knowledge, with pointers to 72 papers and tentative taxonomies of
both retrieval problems and recurring techniques.
Authors' comments: CICM 2015, 20 pages
Sheng-Jun Yang, Xu-Jie Wang, Jun Li, Jun Rui, Xiao-Hui Bao, Jian-Wei Pan
Entanglement between a single photon and a quantum memory forms the building
blocks for quantum repeater and quantum network. Previous entanglement sources
are typically with low retrieval efficiency, which limits future larger-scale
applications. Here, we report a source of highly retrievable spinwave-photon
entanglement. Polarization entanglement is created through interaction of a
single photon with ensemble of atoms inside a low-finesse ring cavity. The
cavity is engineered to be resonant for dual spinwave modes, which thus enables
efficient retrieval of the spinwave qubit. An intrinsic retrieval efficiency up
to 76(4)% has been observed. Such a highly retrievable atom-photon entanglement
source will be very useful in future larger-scale quantum repeater and quantum
network applications.
Authors' comments: 5 pages, 3 figures
Jennifer Roldan-Carlos, Mathias Lux, Xavier Giró-i-Nieto, Pia Muñoz, Nektarios Anagnostopoulos
In endoscopic procedures, surgeons work with live video streams from the
inside of their subjects. A main source for documentation of procedures are
still frames from the video, identified and taken during the surgery. However,
with growing demands and technical means, the streams are saved to storage
servers and the surgeons need to retrieve parts of the videos on demand. In
this submission we present a demo application allowing for video retrieval
based on visual features and late fusion, which allows surgeons to re-find
shots taken during the procedure.
Authors' comments: Paper accepted at the IEEE/ACM 13th International Workshop on
Content-Based Multimedia Indexing (CBMI) in Prague (Czech Republic) between
10 and 12 June 2015
Mingsheng Long, Yue Cao, Jianmin Wang, Philip S. Yu
Efficient similarity retrieval from large-scale multimodal database is pervasive in modern search engines and social networks. To support queries across content modalities, the system should enable cross-modal correlation and computation-efficient indexing. While hashing methods have shown great potential in achieving this goal, current attempts generally fail to learn isomorphic hash codes in a seamless scheme, that is, they embed multiple modalities in a continuous isomorphic space and separately threshold embeddings into binary codes, which incurs substantial loss of retrieval accuracy. In this paper, we approach seamless multimodal hashing by proposing a novel Composite Correlation Quantization (CCQ) model. Specifically, CCQ jointly finds correlation-maximal mappings that transform different modalities into isomorphic latent space, and learns composite quantizers that convert the isomorphic latent features into compact binary codes. An optimization framework is devised to preserve both intra-modal similarity and inter-modal correlation through minimizing both reconstruction and quantization errors, which can be trained from both paired and partially paired data in linear time. A comprehensive set of experiments clearly show the superior effectiveness and efficiency of CCQ against the state of the art hashing methods for both unimodal and cross-modal retrieval.
Eva Mohedano, Amaia Salvador, Sergi Porta, Xavier Giró-i-Nieto, Graham Healy, Kevin McGuinness, Noel O'Connor, Alan F. Smeaton
This paper explores the potential for using Brain Computer Interfaces (BCI)
as a relevance feedback mechanism in content-based image retrieval. We
investigate if it is possible to capture useful EEG signals to detect if
relevant objects are present in a dataset of realistic and complex images. We
perform several experiments using a rapid serial visual presentation (RSVP) of
images at different rates (5Hz and 10Hz) on 8 users with different degrees of
familiarization with BCI and the dataset. We then use the feedback from the BCI
and mouse-based interfaces to retrieve localized objects in a subset of TRECVid
images. We show that it is indeed possible to detect such objects in complex
images and, also, that users with previous knowledge on the dataset or
experience with the RSVP outperform others. When the users have limited time to
annotate the images (100 seconds in our experiments) both interfaces are
comparable in performance. Comparing our best users in a retrieval task, we
found that EEG-based relevance feedback outperforms mouse-based feedback. The
realistic and complex image dataset differentiates our work from previous
studies on EEG for image retrieval.
Authors' comments: This preprint is the full version of a short paper accepted in the
ACM International Conference on Multimedia Retrieval (ICMR) 2015 (Shanghai,
China)
Alexander Sagel, Dominik Meyer, Hao Shen
This work studies the problem of content-based image retrieval, specifically, texture retrieval. It focuses on feature extraction and similarity measure for texture images. Our approach employs a recently developed method, the so-called Scattering transform, for the process of feature extraction in texture retrieval. It shares a distinctive property of providing a robust representation, which is stable with respect to spatial deformations. Recent work has demonstrated its capability for texture classification, and hence as a promising candidate for the problem of texture retrieval. Moreover, we adopt a common approach of measuring the similarity of textures by comparing the subband histograms of a filterbank transform. To this end we derive a similarity measure based on the popular Bhattacharyya Kernel. Despite the popularity of describing histograms using parametrized probability density functions, such as the Generalized Gaussian Distribution, it is unfortunately not applicable for describing most of the Scattering transform subbands, due to the complex modulus performed on each one of them. In this work, we propose to use the Weibull distribution to model the Scattering subbands of descendant layers. Our numerical experiments demonstrated the effectiveness of the proposed approach, in comparison with several state of the arts.
Mark Iwen, Aditya Viswanathan, Yang Wang
We develop a fast phase retrieval method which can utilize a large class of
local phaseless correlation-based measurements in order to recover a given
signal ${\bf x} \in \mathbb{C}^d$ (up to an unknown global phase) in
near-linear $\mathcal{O} \left( d \log^4 d \right)$-time. Accompanying
theoretical analysis proves that the proposed algorithm is guaranteed to
deterministically recover all signals ${\bf x}$ satisfying a natural flatness
(i.e., non-sparsity) condition for a particular choice of deterministic
correlation-based measurements. A randomized version of these same measurements
is then shown to provide nonuniform probabilistic recovery guarantees for
arbitrary signals ${\bf x} \in \mathbb{C}^d$. Numerical experiments demonstrate
the method's speed, accuracy, and robustness in practice -- all code is made
publicly available.
Finally, we conclude by developing an extension of the proposed method to the
sparse phase retrieval problem; specifically, we demonstrate a sublinear-time
compressive phase retrieval algorithm which is guaranteed to recover a given
$s$-sparse vector ${\bf x} \in \mathbb{C}^d$ with high probability in just
$\mathcal{O}(s \log^5 s \cdot \log d)$-time using only $\mathcal{O}(s \log^4 s
\cdot \log d)$ magnitude measurements. In doing so we demonstrate the existence
of compressive phase retrieval algorithms with near-optimal linear-in-sparsity
runtime complexities.
Authors' comments: added more empirical evaluations/performance comparisons,
clarifications/additions to introduction/abstract
Lanbo Zhang
This paper presents a new user feedback mechanism based on Wikipedia concepts for interactive retrieval. In this mechanism, the system presents to the user a group of Wikipedia concepts, and the user can choose those relevant to refine his/her query. To realize this mechanism, we propose methods to address two problems: 1) how to select a small number of possibly relevant Wikipedia concepts to show the user, and 2) how to re-rank retrieved documents given the user-identified Wikipedia concepts. Our methods are evaluated on three TREC data sets. The experiment results show that our methods can dramatically improve retrieval performances.
Ali Sharif Razavian, Josephine Sullivan, Stefan Carlsson, Atsuto Maki
This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval. Besides the choice of convolutional layers, we present an efficient pipeline exploiting multi-scale schemes to extract local features, in particular, by taking geometric invariance into explicit account, i.e. positions, scales and spatial consistency. In our experiments using five standard image retrieval datasets, we demonstrate that generic ConvNet image representations can outperform other state-of-the-art methods if they are extracted appropriately.
Tejal Bhamre, Teng Zhang, Amit Singer
In single particle reconstruction (SPR) from cryo-electron microscopy
(cryo-EM), the 3D structure of a molecule needs to be determined from its 2D
projection images taken at unknown viewing directions. Zvi Kam showed already
in 1980 that the autocorrelation function of the 3D molecule over the rotation
group SO(3) can be estimated from 2D projection images whose viewing directions
are uniformly distributed over the sphere. The autocorrelation function
determines the expansion coefficients of the 3D molecule in spherical harmonics
up to an orthogonal matrix of size $(2l+1)\times (2l+1)$ for each
$l=0,1,2,...$. In this paper we show how techniques for solving the phase
retrieval problem in X-ray crystallography can be modified for the cryo-EM
setup for retrieving the missing orthogonal matrices. Specifically, we present
two new approaches that we term Orthogonal Extension and Orthogonal
Replacement, in which the main algorithmic components are the singular value
decomposition and semidefinite programming. We demonstrate the utility of these
approaches through numerical experiments on simulated data.
Authors' comments: Modified introduction and summary. Accepted to the IEEE International
Symposium on Biomedical Imaging
Georgia T. Papadakis, Pochi Yeh, Harry A. Atwater
We present a general method for retrieving the effective tensorial
permittivity of any uniaxially anisotropic metamaterial. By relaxing the
usually imposed condition of non-magnetic metal/dielectric metamaterials, we
also retrieve the permeability tensor and show that hyperbolic metamaterials
exhibit a strong diamagnetic response in the visible regime. We obtain global
material parameters, directly measurable with spectroscopic ellipsometry and
distinguishable from mere wave parameters, by using the generalized dispersion
equation for uniaxial crystals along with existing homogenization methods. Our
method is analytically and experimentally verified for Ag/SiO2 planar
metamaterials with varying number of layers and compared to the effective
medium theory. We also propose an experimental method for retrieving material
parameters using methods other than ellipsometry.
Authors' comments: 17 pages, 9 figures
Kevin Shih, Wei Di, Vignesh Jagadeesh, Robinson Piramuthu
Text is ubiquitous in the artificial world and easily attainable when it
comes to book title and author names. Using the images from the book cover set
from the Stanford Mobile Visual Search dataset and additional book covers and
metadata from openlibrary.org, we construct a large scale book cover retrieval
dataset, complete with 100K distractor covers and title and author strings for
each. Because our query images are poorly conditioned for clean text
extraction, we propose a method for extracting a matching noisy and erroneous
OCR readings and matching it against clean author and book title strings in a
standard document look-up problem setup. Finally, we demonstrate how to use
this text-matching as a feature in conjunction with popular retrieval features
such as VLAD using a simple learning setup to achieve significant improvements
in retrieval accuracy over that of either VLAD or the text alone.
Authors' comments: 8 pages, 9 figures, 1 table