Nikku Madhusudhan
Exoplanetary atmospheric retrieval refers to the inference of atmospheric
properties of an exoplanet given an observed spectrum. The atmospheric
properties include the chemical compositions, temperature profiles,
clouds/hazes, and energy circulation. These properties, in turn, can provide
key insights into the atmospheric physicochemical processes of exoplanets as
well as their formation mechanisms. Major advancements in atmospheric retrieval
have been made in the last decade, thanks to a combination of state-of-the-art
spectroscopic observations and advanced atmospheric modeling and statistical
inference methods. These developments have already resulted in key constraints
on the atmospheric H2O abundances, temperature profiles, and other properties
for several exoplanets. Upcoming facilities such as the JWST will further
advance this area. The present chapter is a pedagogical review of this exciting
frontier of exoplanetary science. The principles of atmospheric retrievals of
exoplanets are discussed in detail, including parametric models and statistical
inference methods, along with a review of key results in the field. Some of the
main challenges in retrievals with current observations are discussed along
with new directions and the future landscape.
Authors' comments: 30 pages, 3 figures, Published in Springer Handbook of Exoplanets
Shah Nawaz, Muhammad Kamran Janjua, Alessandro Calefati, Ignazio Gallo
This paper proposes a cross-modal retrieval system that leverages on image
and text encoding. Most multimodal architectures employ separate networks for
each modality to capture the semantic relationship between them. However, in
our work image-text encoding can achieve comparable results in terms of
cross-modal retrieval without having to use a separate network for each
modality. We show that text encodings can capture semantic relationships
between multiple modalities. In our knowledge, this work is the first of its
kind in terms of employing a single network and fused image-text embedding for
cross-modal retrieval. We evaluate our approach on two famous multimodal
datasets: MS-COCO and Flickr30K.
Authors' comments: 14 pages. Under review at ECCVW (MULA 2018)
Ramina Ghods, Andrew S. Lan, Tom Goldstein, Christoph Studer
Phase retrieval deals with the recovery of complex- or real-valued signals
from magnitude measurements. As shown recently, the method PhaseMax enables
phase retrieval via convex optimization and without lifting the problem to a
higher dimension. To succeed, PhaseMax requires an initial guess of the
solution, which can be calculated via spectral initializers. In this paper, we
show that with the availability of an initial guess, phase retrieval can be
carried out with an ever simpler, linear procedure. Our algorithm, called
PhaseLin, is the linear estimator that minimizes the mean squared error (MSE)
when applied to the magnitude measurements. The linear nature of PhaseLin
enables an exact and nonasymptotic MSE analysis for arbitrary measurement
matrices. We furthermore demonstrate that by iteratively using PhaseLin, one
arrives at an efficient phase retrieval algorithm that performs on par with
existing convex and nonconvex methods on synthetic and real-world data.
Authors' comments: To be presented at CISS 2018 (http://ee-ciss.princeton.edu/)
Mahtab Mirmohseni, Mohammad Ali Maddah-Ali
The widespread use of cloud computing services raises the question of how one can delegate the processing tasks to the untrusted distributed parties without breeching the privacy of its data and algorithms. Motivated by the algorithm privacy concerns in a distributed computing system, in this paper, we introduce the private function retrieval (PFR) problem, where a user wishes to efficiently retrieve a linear function of $K$ messages from $N$ non-communicating replicated servers while keeping the function hidden from each individual server. The goal is to find a scheme with minimum communication cost. To characterize the fundamental limits of the communication cost, we define the capacity of PFR problem as the size of the message that can be privately retrieved (which is the size of one file) normalized to the required downloaded information bits. We first show that for the PFR problem with $K$ messages, $N=2$ servers and a linear function with binary coefficients the capacity is $C=\frac{1}{2}\Big(1-\frac{1}{2^K}\Big)^{-1}$. Interestingly, this is the capacity of retrieving one of $K$ messages from $N=2$ servers while keeping the index of the requested message hidden from each individual server, the problem known as private information retrieval (PIR). Then, we extend the proposed achievable scheme to the case of arbitrary number of servers and coefficients in the field $GF(q)$ with arbitrary $q$ and obtain $R=\Big(1-\frac{1}{N}\Big)\Big(1+\frac{\frac{1}{N-1}}{(\frac{q^K-1}{q-1})^{N-1}}\Big)$.
Christophe Van Gysel, Maarten de Rijke, Evangelos Kanoulas
Unsupervised learning of low-dimensional, semantic representations of words
and entities has recently gained attention. In this paper we describe the
Semantic Entity Retrieval Toolkit (SERT) that provides implementations of our
previously published entity representation models. The toolkit provides a
unified interface to different representation learning algorithms, fine-grained
parsing configuration and can be used transparently with GPUs. In addition,
users can easily modify existing models or implement their own models in the
framework. After model training, SERT can be used to rank entities according to
a textual query and extract the learned entity/word representation for use in
downstream algorithms, such as clustering or recommendation.
Authors' comments: SIGIR 2017 Workshop on Neural Information Retrieval (Neu-IR'17). 2017
BingZhang Hu, Feng Zheng, Ling Shao
Face retrieval has received much attention over the past few decades, and
many efforts have been made in retrieving face images against pose,
illumination, and expression variations. However, the conventional works fail
to meet the requirements of a potential and novel task --- retrieving a
person's face image at a specific age, especially when the specific 'age' is
not given as a numeral, i.e. 'retrieving someone's image at the similar age
period shown by another person's image'. To tackle this problem, we propose a
dual reference face retrieval framework in this paper, where the system takes
two inputs: an identity reference image which indicates the target identity and
an age reference image which reflects the target age. In our framework, the raw
images are first projected on a joint manifold, which preserves both the age
and identity locality. Then two similarity metrics of age and identity are
exploited and optimized by utilizing our proposed quartet-based model. The
experiments show promising results, outperforming hierarchical methods.
Authors' comments: Accepted at AAAI 2018
Yifan Sun, Liang Zheng, Weijian Deng, Shengjin Wang
This paper proposes the SVDNet for retrieval problems, with focus on the
application of person re-identification (re-ID). We view each weight vector
within a fully connected (FC) layer in a convolutional neuron network (CNN) as
a projection basis. It is observed that the weight vectors are usually highly
correlated. This problem leads to correlations among entries of the FC
descriptor, and compromises the retrieval performance based on the Euclidean
distance. To address the problem, this paper proposes to optimize the deep
representation learning process with Singular Vector Decomposition (SVD).
Specifically, with the restraint and relaxation iteration (RRI) training
scheme, we are able to iteratively integrate the orthogonality constraint in
CNN training, yielding the so-called SVDNet. We conduct experiments on the
Market-1501, CUHK03, and Duke datasets, and show that RRI effectively reduces
the correlation among the projection vectors, produces more discriminative FC
descriptors, and significantly improves the re-ID accuracy. On the Market-1501
dataset, for instance, rank-1 accuracy is improved from 55.3% to 80.5% for
CaffeNet, and from 73.8% to 82.3% for ResNet-50.
Authors' comments: accepted as spotlight to ICCV 2017
Sara Botelho-Andrade, Peter G. Casazza, Desai Cheng, John Haas, Tin T. Tran, Janet C. Tremain, Zhiqiang Xu
We show that a scalable frame does phase retrieval if and only if the hyperplanes of its orthogonal complements do phase retrieval. We then show this result fails in general by giving an example of a frame for $\mathbb R^3$ which does phase retrieval but its induced hyperplanes fail phase retrieval. Moreover, we show that such frames always exist in $\mathbb R^d$ for any dimension $d$. We also give an example of a frame in $\mathbb R^3$ which fails phase retrieval but its perps do phase retrieval. We will also see that a family of hyperplanes doing phase retrieval in $\mathbb R^d$ must contain at least $2d-2$ hyperplanes. Finally, we provide an example of six hyperplanes in $\mathbb R^4$ which do phase retrieval.
Namrata Vaswani, Seyedehsara Nayer, Yonina C. Eldar
We develop two iterative algorithms for solving the low rank phase retrieval
(LRPR) problem. LRPR refers to recovering a low-rank matrix $\X$ from
magnitude-only (phaseless) measurements of random linear projections of its
columns. Both methods consist of a spectral initialization step followed by an
iterative algorithm to maximize the observed data likelihood. We obtain sample
complexity bounds for our proposed initialization approach to provide a good
approximation of the true $\X$. When the rank is low enough, these bounds are
significantly lower than what existing single vector phase retrieval algorithms
need. Via extensive experiments, we show that the same is also true for the
proposed complete algorithms.
Authors' comments: To appear in IEEE Trans. Signal Processing, 2017
Juan Miguel Arrazola, Markos Karasamanis, Norbert Lütkenhaus
Complex cryptographic protocols are often constructed from simpler
building-blocks. In order to advance quantum cryptography, it is important to
study practical building-blocks that can be used to develop new protocols. An
example is quantum retrieval games (QRGs), which have broad applicability and
have already been used to construct quantum money schemes. In this work, we
introduce a general construction of quantum retrieval games based on the hidden
matching problem and show how they can be implemented in practice using
available technology. More precisely, we provide a general method to construct
(1-out-of-k) QRGs, proving that their cheating probabilities decrease
exponentially in $k$. In particular, we define new QRGs based on coherent
states of light, which can be implemented even in the presence of experimental
imperfections. Our results constitute a new tool in the arsenal of the
practical quantum cryptographer.
Authors' comments: 10 pages, 11 figures
Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saenko, Trevor Darrell
In this paper, we address the task of natural language object retrieval, to
localize a target object within a given image based on a natural language query
of the object. Natural language object retrieval differs from text-based image
retrieval task as it involves spatial information about objects within the
scene and global scene context. To address this issue, we propose a novel
Spatial Context Recurrent ConvNet (SCRC) model as scoring function on candidate
boxes for object retrieval, integrating spatial configurations and global
scene-level contextual information into the network. Our model processes query
text, local image descriptors, spatial configurations and global context
features through a recurrent network, outputs the probability of the query text
conditioned on each candidate box as a score for the box, and can transfer
visual-linguistic knowledge from image captioning domain to our task.
Experimental results demonstrate that our method effectively utilizes both
local and global information, outperforming previous baseline methods
significantly on different datasets and scenarios, and can exploit large scale
vision and language datasets for knowledge transfer.
Authors' comments: Proceedings of the IEEE Conference on Computer Vision and Pattern
Recognition, 2016
Liat Liberman, Yonatan Israel, Eilon Poem, Yaron Silberberg
The retrieval of phases from intensity measurements is a key process in many
fields in science, from optical microscopy to x-ray crystallography. Here we
study phase retrieval of a one-dimensional multi-phase object that is
illuminated by quantum states of light. We generalize the iterative
Gerchberg-Saxton algorithm to photon correlation measurements on the output
plane, rather than the standard intensity measurements. We report a numerical
comparison of classical and quantum phase retrieval of a small one-dimensional
object of discrete phases from its far-field diffraction. While the classical
algorithm was ambiguous and often converged to wrong solutions, quantum light
produced a unique reconstruction with smaller errors and faster convergence. We
attribute these improvements to a larger Hilbert space that constrains the
algorithm.
Authors' comments: 6 pages, 5 figures, comments are welcome
Dan Edidin
We characterize collections of orthogonal projections for which it is
possible to reconstruct a vector from the magnitudes of the corresponding
projections. As a result we are able to show that in an $M$-dimensional real
vector space a vector can be reconstructed from the magnitudes of its
projections onto a generic collection of $N \geq 2M-1$ subspaces. We also show
that this bound is sharp when $N = 2^k +1$. The results of this paper answer a
number of questions raised in \cite{CCPW:13}.
Authors' comments: 10 pages
Michael Bloodgood, Benjamin Strauss
Translation Memory (TM) systems are one of the most widely used translation
technologies. An important part of TM systems is the matching algorithm that
determines what translations get retrieved from the bank of available
translations to assist the human translator. Although detailed accounts of the
matching algorithms used in commercial systems can't be found in the
literature, it is widely believed that edit distance algorithms are used. This
paper investigates and evaluates the use of several matching algorithms,
including the edit distance algorithm that is believed to be at the heart of
most modern commercial TM systems. This paper presents results showing how well
various matching algorithms correlate with human judgments of helpfulness
(collected via crowdsourcing with Amazon's Mechanical Turk). A new algorithm
based on weighted n-gram precision that can be adjusted for translator length
preferences consistently returns translations judged to be most helpful by
translators for multiple domains and language pairs.
Authors' comments: 9 pages, 6 tables, 3 figures; appeared in Proceedings of the 14th
Conference of the European Chapter of the Association for Computational
Linguistics, April 2014
Ju-Chiang Wang, Yi-Hsuan Yang, Hsin-Min Wang
Much of the appeal of music lies in its power to convey emotions/moods and to
evoke them in listeners. In consequence, the past decade witnessed a growing
interest in modeling emotions from musical signals in the music information
retrieval (MIR) community. In this article, we present a novel generative
approach to music emotion modeling, with a specific focus on the
valence-arousal (VA) dimension model of emotion. The presented generative
model, called \emph{acoustic emotion Gaussians} (AEG), better accounts for the
subjectivity of emotion perception by the use of probability distributions.
Specifically, it learns from the emotion annotations of multiple subjects a
Gaussian mixture model in the VA space with prior constraints on the
corresponding acoustic features of the training music pieces. Such a
computational framework is technically sound, capable of learning in an online
fashion, and thus applicable to a variety of applications, including
user-independent (general) and user-dependent (personalized) emotion
recognition and emotion-based music retrieval. We report evaluations of the
aforementioned applications of AEG on a larger-scale emotion-annotated corpora,
AMG1608, to demonstrate the effectiveness of AEG and to showcase how
evaluations are conducted for research on emotion-based MIR. Directions of
future work are also discussed.
Authors' comments: 40 pages, 18 figures, 5 tables, author version
Suma D., U. Dinesh Acharya, Geetha M., Raviraja Holla M
Locating and distilling the valuable relevant information continued to be the
major challenges of Information Retrieval (IR) Systems owing to the explosive
growth of online web information. These challenges can be considered the XML
Information Retrieval challenges as XML has become a de facto standard over the
Web. The research on XML IR starts with the classical IR strategies customized
to XML IR. Later novel IR strategies specific to XML IR are evolved. Meanwhile
literatures reveal development of the rapid and intelligent IR systems. Despite
their success in their specified constrained domains, they have additional
limitations in the complex information space. The effectiveness of IR systems
is thus unsolved in satisfying the most. This article attemptsan overview of
earlier efforts and the gaps in XML IR.
Authors' comments: 7 pages, 0 figures
Ko-Jen Hsiao, Alex Kulesza, Alfred Hero
Socially-based recommendation systems have recently attracted significant
interest, and a number of studies have shown that social information can
dramatically improve a system's predictions of user interests. Meanwhile, there
are now many potential applications that involve aspects of both recommendation
and information retrieval, and the task of collaborative retrieval---a
combination of these two traditional problems---has recently been introduced.
Successful collaborative retrieval requires overcoming severe data sparsity,
making additional sources of information, such as social graphs, particularly
valuable. In this paper we propose a new model for collaborative retrieval, and
show that our algorithm outperforms current state-of-the-art approaches by
incorporating information from social networks. We also provide empirical
analyses of the ways in which cultural interests propagate along a social graph
using a real-world music dataset.
Authors' comments: 10 pages
Philipp Mayr, Andrea Scharnhorst, Birger Larsen, Philipp Schaer, Peter Mutschke
Bibliometric techniques are not yet widely used to enhance retrieval
processes in digital libraries, although they offer value-added effects for
users. In this workshop we will explore how statistical modelling of
scholarship, such as Bradfordizing or network analysis of coauthorship network,
can improve retrieval services for specific communities, as well as for large,
cross-domain collections. This workshop aims to raise awareness of the missing
link between information retrieval (IR) and bibliometrics/scientometrics and to
create a common ground for the incorporation of bibliometric-enhanced services
into retrieval at the digital library interface.
Authors' comments: 6 pages, accepted workshop proposal for ECIR 2014
Jameson Cahill, Peter G. Casazza, Jesse Peterson, Lindsey Woodland
The problem of recovering a vector from the absolute values of its inner products against a family of measurement vectors has been well studied in mathematics and engineering. A generalization of this phase retrieval problem also exists in engineering: recovering a vector from measurements consisting of norms of its orthogonal projections onto a family of subspaces. There exist semidefinite programming algorithms to solve this problem, but much remains unknown for this more general case. Can families of subspaces for which such measurements are injective be completely classified? What is the minimal number of subspaces required to have injectivity? How closely does this problem compare to the usual phase retrieval problem with families of measurement vectors? In this paper, we answer or make incremental steps toward these questions. We provide several characterizations of subspaces which yield injective measurements, and through a concrete construction, we prove the surprising result that phase retrieval can be achieved with $2M-1$ projections of arbitrary rank in $\HH_M$. Finally we present several open problems as we discuss issues unique to the phase retrieval problem with subspaces.
Henrik Ohlsson, Yonina C. Eldar, Allen Y. Yang, S. Shankar Sastry
The classical shift retrieval problem considers two signals in vector form
that are related by a shift. The problem is of great importance in many
applications and is typically solved by maximizing the cross-correlation
between the two signals. Inspired by compressive sensing, in this paper, we
seek to estimate the shift directly from compressed signals. We show that under
certain conditions, the shift can be recovered using fewer samples and less
computation compared to the classical setup. Of particular interest is shift
estimation from Fourier coefficients. We show that under rather mild conditions
only one Fourier coefficient suffices to recover the true shift.
Authors' comments: Submitted to IEEE Transactions on Signal Processing. Accepted to the
38th International Conference on Acoustics, Speech, and Signal Processing
(ICASSP), Vancouver, Canada, May 2013