Claudio Carmeli, Teiko Heinosaari, Jussi Schultz, Alessandro Toigo
We prove that, regardless of the choice of the angles $\theta_1,\theta_2,\theta_3$, three fractional Fourier transforms $F_{\theta_1}$, $F_{\theta_2}$ and $F_{\theta_3}$ do not solve the phase retrieval problem. That is, there do not exist three angles $\theta_1$, $\theta_2$, $\theta_3$ such that any signal $\psi\in L^2(R)$ could be determined up to a constant phase by knowing only the three intensities $|F_{\theta_1}\psi|^2$, $|F_{\theta_2}\psi|^2$ and $|F_{\theta_3}\psi|^2$. This provides a negative argument against a recent speculation by P. Jaming, who stated that three suitably chosen fractional Fourier transforms are good candidates for phase retrieval in infinite dimension. We recast the question in the language of quantum mechanics, where our result shows that any fixed triple of rotated quadrature observables $Q_{\theta_1}$, $Q_{\theta_2}$ and $Q_{\theta_3}$ is not enough to determine all unknown pure quantum states. The sufficiency of four rotated quadrature observables, or equivalently fractional Fourier transforms, remains an open question.
Bernardo Ferreira, João Rodrigues, João Leitão, Henrique Domingos
Storage requirements for visual data have been increasing in recent years,
following the emergence of many new highly interactive, multimedia services and
applications for both personal and corporate use. This has been a key driving
factor for the adoption of cloud-based data outsourcing solutions. However,
outsourcing data storage to the Cloud also leads to new challenges that must be
carefully addressed, especially regarding privacy. In this paper we propose a
secure framework for outsourced privacy-preserving storage and retrieval in
large image repositories. Our proposal is based on a novel cryptographic
scheme, named IES-CBIR, specifically designed for media image data. Our
solution enables both encrypted storage and querying using Content Based Image
Retrieval (CBIR) while preserving privacy. We have built a prototype of the
proposed framework, formally analyzed and proven its security properties, and
experimentally evaluated its performance and precision. Our results show that
IES-CBIR is provably secure, allows more efficient operations than existing
proposals, both in terms of time and space complexity, and enables more
realistic, interesting and practical application scenarios.
Authors' comments: This paper has been withdrawn by the author, as it is outdated and a
more recent version with major differences has been published. due to a
crucial sign error in equation 1
Romàn Zapatrin
An operationalistic scheme, called Melucci metaphor, is suggested
representing Information Retrieval as physical measurements with beam of
particles playing the role of the flow of retrieved documents. The
possibilities of query expansion by extra term are studied from this
perspective, when the particles-`docuscles' are assumed to be of classical or
quantum nature. It is shown that in both cases the choice of an extra term
based on Bayesian belief revision is still valid on the qualitative level for
boosting the relevance of the retrieved documents.
Authors' comments: Latex, 8 pages
Yonina C. Eldar, Pavel Sidorenko, Dustin G. Mixon, Shaby Barel, Oren Cohen
We consider the classical 1D phase retrieval problem. In order to overcome
the difficulties associated with phase retrieval from measurements of the
Fourier magnitude, we treat recovery from the magnitude of the short-time
Fourier transform (STFT). We first show that the redundancy offered by the STFT
enables unique recovery for arbitrary nonvanishing inputs, under mild
conditions. An efficient algorithm for recovery of a sparse input from the STFT
magnitude is then suggested, based on an adaptation of the recently proposed
GESPAR algorithm. We demonstrate through simulations that using the STFT leads
to improved performance over recovery from the oversampled Fourier magnitude
with the same number of measurements.
Authors' comments: To appear in IEEE Signal Processing Letters
Zakria Hussain, Arto Klami, Jussi Kujala, Alex P. Leung, Kitsuchart Pasupa, Peter Auer, Samuel Kaski, Jorma Laaksonen et al.
This paper describes PinView, a content-based image retrieval system that
exploits implicit relevance feedback collected during a search session. PinView
contains several novel methods to infer the intent of the user. From relevance
feedback, such as eye movements or pointer clicks, and visual features of
images, PinView learns a similarity metric between images which depends on the
current interests of the user. It then retrieves images with a specialized
online learning algorithm that balances the tradeoff between exploring new
images and exploiting the already inferred interests of the user. We have
integrated PinView to the content-based image retrieval system PicSOM, which
enables applying PinView to real-world image databases. With the new algorithms
PinView outperforms the original PicSOM, and in online experiments with real
users the combination of implicit and explicit feedback gives the best results.
Authors' comments: 12 pages
Zhuotun Zhu, Xinggang Wang, Song Bai, Cong Yao, Xiang Bai
We study the problem of how to build a deep learning representation for 3D
shape. Deep learning has shown to be very effective in variety of visual
applications, such as image classification and object detection. However, it
has not been successfully applied to 3D shape recognition. This is because 3D
shape has complex structure in 3D space and there are limited number of 3D
shapes for feature learning. To address these problems, we project 3D shapes
into 2D space and use autoencoder for feature learning on the 2D images. High
accuracy 3D shape retrieval performance is obtained by aggregating the features
learned on 2D images. In addition, we show the proposed deep learning feature
is complementary to conventional local image descriptors. By combing the global
deep learning representation and the local descriptor representation, our
method can obtain the state-of-the-art performance on 3D shape retrieval
benchmarks.
Authors' comments: 6 pages, 7 figures, 2014ICSPAC
Ardeshir Mohammad Ebtehaj, Efi Foufoula-Georgiou, Gilad Lerman, Rafael Luis Bras
We demonstrate that the global fields of temperature, humidity and
geopotential heights admit a nearly sparse representation in the wavelet
domain, offering a viable path forward to explore new paradigms of
sparsity-promoting data assimilation and compressive recovery of land
surface-atmospheric states from space. We illustrate this idea using retrieval
products of the Atmospheric Infrared Sounder (AIRS) and Advanced Microwave
Sounding Unit (AMSU) on board the Aqua satellite. The results reveal that the
sparsity of the fields of temperature is relatively pressure-independent while
atmospheric humidity and geopotential heights are typically sparser at lower
and higher pressure levels, respectively. We provide evidence that these
land-atmospheric states can be accurately estimated using a small set of
measurements by taking advantage of their sparsity prior.
Authors' comments: 12 pages, 8 figures, 1 table
Arun Surya, Swapan K. Saha
Speckle Imaging based on triple correlation is a very efficient image
reconstruction technique which is used to retrieve Fourier phase information of
the object in presence of atmospheric turbulence. We have developed both Direct
Bispectrum and Radon transform based Tomographic speckle masking algorithms to
retrieve atmospherically distorted astronomical images. The latter is a much
computationally efficient technique because it works with one dimensional image
projections. Tomographic speckle imaging provides good image recovery like
direct bispectrum but with a large improvement in computational time and memory
requirements. The algorithms were compared with speckle simulations of aperture
masking interferometry with 17 sub-apertures using different objects. The
results of the computationally efficient tomographic technique with laboratory
and real astronomical speckle images are also discussed.
Authors' comments: Journal of Optics, July 2014
Vikas Verma
Content Based Image Retrieval(CBIR) is one of the important subfield in the field of Information Retrieval. The goal of a CBIR algorithm is to retrieve semantically similar images in response to a query image submitted by the end user. CBIR is a hard problem because of the phenomenon known as $\textit {semantic gap}$. In this thesis, we aim at analyzing the performance of a CBIR system build using local feature vectors and Intermediate Matching Kernel. We also propose a Two-Step Matching process for reducing the response time of the CBIR systems. Further, we develop a Meta-Learning framework for improving the retrieval performance of these systems. Our results show that the Two-Step Matching process significantly reduces response time and the Meta-Learning Framework improves the retrieval performance by more than two fold. We also analyze the performance of various image classification systems that use different image representations constructed from the local feature vectors.
Anna-Lena Horlemann-Trautmann
Spread codes and cyclic orbit codes are special families of constant
dimension subspace codes. These codes have been well-studied for their error
correction capability, transmission rate and decoding methods, but the question
of how to encode and retrieve messages has not been investigated. In this work
we show how a message set of consecutive integers can be encoded and retrieved
for these two code families.
Authors' comments: This is an extension of the previous work "Message Encoding for
Spread and Orbit Codes", which appeared in the Proceedings of the 2014 IEEE
International Symposium on Information Theory 2014 (Honolulu, USA)
Zongcheng Ji, Zhengdong Lu, Hang Li
Human computer conversation is regarded as one of the most difficult problems
in artificial intelligence. In this paper, we address one of its key
sub-problems, referred to as short text conversation, in which given a message
from human, the computer returns a reasonable response to the message. We
leverage the vast amount of short conversation data available on social media
to study the issue. We propose formalizing short text conversation as a search
problem at the first step, and employing state-of-the-art information retrieval
(IR) techniques to carry out the task. We investigate the significance as well
as the limitation of the IR approach. Our experiments demonstrate that the
retrieval-based model can make the system behave rather "intelligently", when
combined with a huge repository of conversation data from social media.
Authors' comments: 21 pages, 4 figures
Arvind Murugan, Zorana Zeravcic, Michael P. Brenner, Stanislas Leibler
Self-assembly materials are traditionally designed so that molecular or
meso-scale components form a single kind of large structure. Here, we propose a
scheme to create "multifarious assembly mixtures", which self-assemble many
different large structures from a set of shared components. We show that the
number of multifarious structures stored in the solution of components
increases rapidly with the number of different types of components. Yet, each
stored structure can be retrieved by tuning only a few parameters, the number
of which is only weakly dependent on the size of the assembled structure.
Implications for artificial and biological self-assembly are discussed.
Authors' comments: Paper + SI. Figures at the end
Luca Matteis
How do RDF datasets currently get published on the Web? They are either available as large RDF files, which need to be downloaded and processed locally, or they exist behind complex SPARQL endpoints. By providing a RESTful API that can access triple data, we allow users to query a dataset through a simple interface based on just a couple of HTTP parameters. If RDF resources were published this way we could quickly build applications that depend on these datasets, without having to download and process them locally. This is what Restpark is: a set of HTTP GET parameters that servers need to handle, and respond with JSON-LD.
Boris Iolis, Gianluca Bontempi
We demonstrate a method to optimize the combination of distinct components in
a paragraph retrieval system. Our system makes use of several indices, query
generators and filters, each of them potentially contributing to the quality of
the returned list of results. The components are combined with a weighed sum,
and we optimize the weights using a heuristic optimization algorithm. This
allows us to maximize the quality of our results, but also to determine which
components are most valuable in our system. We evaluate our approach on the
paragraph selection task of a Question Answering dataset.
Authors' comments: 5 pages, 1 figure, unpublished
Yasser El Madani El Alami, El Habib Nfaoui, Omar El Beqqali
This paper presents an integrated multi-agents architecture for indexing and
retrieving video information.The focus of our work is to elaborate an
extensible approach that gathers a priori almost of the mandatory tools which
palliate to the major intertwining problems raised in the whole process of the
video lifecycle (classification, indexing and retrieval). In fact, effective
and optimal retrieval video information needs a collaborative approach based on
multimodal aspects. Clearly, it must to take into account the distributed
aspect of the data sources, the adaptation of the contents, semantic
annotation, personalized request and active feedback which constitute the
backbone of a vigorous system which improve its performances in a smart way
Authors' comments: 11 pages, 11 figures, The Proceeding of International Conference on
Soft Computing and Software Engineering 2013
Dilip K. Limbu, Andy M. Connor, Stephen G. MacDonell
Search engines are the most commonly used type of tool for finding relevant
information on the Internet. However, today's search engines are far from
perfect. Typical search queries are short, often one or two words, and can be
ambiguous therefore returning inappropriate results. Contextual information
retrieval (CIR) is a critical technique for these search engines to facilitate
queries and return relevant information. Despite its importance, little
progress has been made in CIR due to the difficulty of capturing and
representing contextual information about users. Numerous contextual
information retrieval approaches exist today, but to the best of our knowledge
none of them offer a similar service to the one proposed in this paper.
This paper proposes an alternative framework for contextual information
retrieval from the WWW. The framework aims to improve query results (or make
search results more relevant) by constructing a contextual profile based on a
user's behaviour, their preferences, and a shared knowledge base, and using
this information in the search engine framework to find and return relevant
information.
Authors' comments: Proceedings of the 14th International Conference on Adaptive Systems
and Software Engineering (IASSE 2005)
Ken Chatfield, Karen Simonyan, Andrew Zisserman
We investigate the gains in precision and speed, that can be obtained by
using Convolutional Networks (ConvNets) for on-the-fly retrieval - where
classifiers are learnt at run time for a textual query from downloaded images,
and used to rank large image or video datasets.
We make three contributions: (i) we present an evaluation of state-of-the-art
image representations for object category retrieval over standard benchmark
datasets containing 1M+ images; (ii) we show that ConvNets can be used to
obtain features which are incredibly performant, and yet much lower dimensional
than previous state-of-the-art image representations, and that their
dimensionality can be reduced further without loss in performance by
compression using product quantization or binarization. Consequently, features
with the state-of-the-art performance on large-scale datasets of millions of
images can fit in the memory of even a commodity GPU card; (iii) we show that
an SVM classifier can be learnt within a ConvNet framework on a GPU in parallel
with downloading the new training images, allowing for a continuous refinement
of the model as more images become available, and simultaneous training and
ranking. The outcome is an on-the-fly system that significantly outperforms its
predecessors in terms of: precision of retrieval, memory requirements, and
speed, facilitating accurate on-the-fly learning and ranking in under a second
on a single GPU.
Authors' comments: Published in proceedings of ACCV 2014
Dagmar Kern, Peter Mutschke, Philipp Mayr
We propose an online access panel to support the evaluation process of
Interactive Information Retrieval (IIR) systems - called IIRpanel. By
maintaining an online access panel with users of IIR systems we assume that the
recurring effort to recruit participants for web-based as well as for lab
studies can be minimized. We target on using the online access panel not only
for our own development processes but to open it for other interested
researchers in the field of IIR. In this paper we present the concept of
IIRpanel as well as first implementation details.
Authors' comments: 2 pages, 1 figure, 2014 IEEE/ACM Joint Conference on Digital
Libraries (JCDL), London, 8th-12th September 2014
Emmanuel Candes, Xiaodong Li, Mahdi Soltanolkotabi
We study the problem of recovering the phase from magnitude measurements;
specifically, we wish to reconstruct a complex-valued signal x of C^n about
which we have phaseless samples of the form y_r = |< a_r,x >|^2, r = 1,2,...,m
(knowledge of the phase of these samples would yield a linear system). This
paper develops a non-convex formulation of the phase retrieval problem as well
as a concrete solution algorithm. In a nutshell, this algorithm starts with a
careful initialization obtained by means of a spectral method, and then refines
this initial estimate by iteratively applying novel update rules, which have
low computational complexity, much like in a gradient descent scheme. The main
contribution is that this algorithm is shown to rigorously allow the exact
retrieval of phase information from a nearly minimal number of random
measurements. Indeed, the sequence of successive iterates provably converges to
the solution at a geometric rate so that the proposed scheme is efficient both
in terms of computational and data resources. In theory, a variation on this
scheme leads to a near-linear time algorithm for a physically realizable model
based on coded diffraction patterns. We illustrate the effectiveness of our
methods with various experiments on image data. Underlying our analysis are
insights for the analysis of non-convex optimization schemes that may have
implications for computational problems beyond phase retrieval.
Authors' comments: IEEE Transactions on Information Theory, Vol. 64 (4), Feb. 2015
D. H. Apriyanti, A. A. Arymurthy, L. T. Handoko
In this paper, we developed the system for recognizing the orchid species by
using the images of flower. We used MSRM (Maximal Similarity based on Region
Merging) method for segmenting the flower object from the background and
extracting the shape feature such as the distance from the edge to the centroid
point of the flower, aspect ratio, roundness, moment invariant, fractal
dimension and also extract color feature. We used HSV color feature with
ignoring the V value. To retrieve the image, we used Support Vector Machine
(SVM) method. Orchid is a unique flower. It has a part of flower called lip
(labellum) that distinguishes it from other flowers even from other types of
orchids. Thus, in this paper, we proposed to do feature extraction not only on
flower region but also on lip (labellum) region. The result shows that our
proposed method can increase the accuracy value of content based flower image
retrieval for orchid species up to $\pm$ 14%. The most dominant feature is
Centroid Contour Distance, Moment Invariant and HSV Color. The system accuracy
is 85,33% in validation phase and 79,33% in testing phase.
Authors' comments: Proceeding of International Conference on Computer, Control,
Informatics and its Applications 2013, pp. 53-57