Jialu Liu
This article gives a survey for bag-of-words (BoW) or bag-of-features model in image retrieval system. In recent years, large-scale image retrieval shows significant potential in both industry applications and research problems. As local descriptors like SIFT demonstrate great discriminative power in solving vision problems like object recognition, image classification and annotation, more and more state-of-the-art large scale image retrieval systems are trying to rely on them. A common way to achieve this is first quantizing local descriptors into visual words, and then applying scalable textual indexing and retrieval schemes. We call this model as bag-of-words or bag-of-features model. The goal of this survey is to give an overview of this model and introduce different strategies when building the system based on this model.
Liu Liang
Image retrieval has been a top topic in the field of both computer vision and
machine learning for a long time. Content based image retrieval, which tries to
retrieve images from a database visually similar to a query image, has
attracted much attention. Two most important issues of image retrieval are the
representation and ranking of the images. Recently, bag-of-words based method
has shown its power as a representation method. Moreover, nonnegative matrix
factorization is also a popular way to represent the data samples. In addition,
contextual similarity learning has also been studied and proven to be an
effective method for the ranking problem. However, these technologies have
never been used together. In this paper, we developed an effective image
retrieval system by representing each image using the bag-of-words method as
histograms, and then apply the nonnegative matrix factorization to factorize
the histograms, and finally learn the ranking score using the contextual
similarity learning method. The proposed novel system is evaluated on a large
scale image database and the effectiveness is shown.
Authors' comments: 4 pages
Marko Horvat, Anton Grbin, Gordan Gledec
Repositories of images with semantic and emotion content descriptions are
valuable tools in many areas such as Affective Computing and Human-Computer
Interaction, but they are also important in the development of multimodal
searchable online databases. Ever growing number of image documents available
on the Internet continuously motivates research of better annotation models and
more efficient retrieval methods which use mash-up of available data on
semantics, scenes, objects, events, context and emotion. Formal knowledge
representation of such high-level semantics requires rich, explicit, human but
also machine-processable information. To achieve these goals we present an
online ontology-based image annotation tool WNtags and demonstrate its
usefulness in knowledge representation and image retrieval using the
International Affective Picture System database. The WNtags uses WordNet as
image tagging glossary but considers Suggested Upper Merged Ontology as the
preferred upper labeling formalism. The retrieval is performed using node
distance metrics to establish semantic relatedness between a query and the
collaboratively weighted tags describing high-level image semantics, after
which the result is ranked according to the derived importance. We also
elaborate plans to improve the WNtags to create a collaborative Web-based
multimedia repository for research in human emotion and attention.
Authors' comments: 16 pages, 4 figures. arXiv admin note: substantial text overlap with
arXiv:1302.2223
Robert Fung, S. L. Crawford, Lee A. Appelbaum, Richard M. Tong
While concept-based methods for information retrieval can provide improved
performance over more conventional techniques, they require large amounts of
effort to acquire the concepts and their qualitative and quantitative
relationships. This paper discusses an architecture for probabilistic
concept-based information retrieval which addresses the knowledge acquisition
problem. The architecture makes use of the probabilistic networks technology
for representing and reasoning about concepts and includes a knowledge
acquisition component which partially automates the construction of concept
knowledge bases from data. We describe two experiments that apply the
architecture to the task of retrieving documents about terrorism from a set of
documents from the Reuters news service. The experiments provide positive
evidence that the architecture design is feasible and that there are advantages
to concept-based methods.
Authors' comments: Appears in Proceedings of the Sixth Conference on Uncertainty in
Artificial Intelligence (UAI1990)
Afonso S. Bandeira, Yutong Chen, Dustin G. Mixon
In diffraction imaging, one is tasked with reconstructing a signal from its
power spectrum. To resolve the ambiguity in this inverse problem, one might
invoke prior knowledge about the signal, but phase retrieval algorithms in this
vein have found limited success. One alternative is to create redundancy in the
measurement process by illuminating the signal multiple times, distorting the
signal each time with a different mask. Despite several recent advances in
phase retrieval, the community has yet to construct an ensemble of masks which
uniquely determines all signals and admits an efficient reconstruction
algorithm. In this paper, we leverage the recently proposed polarization method
to construct such an ensemble. We also present numerical simulations to
illustrate the stability of the polarization method in this setting. In
comparison to a state-of-the-art phase retrieval algorithm known as PhaseLift,
we find that polarization is much faster with comparable stability.
Authors' comments: 18 pages, 3 figures
Miroslav Stampar
This paper describes an advanced SQL injection technique where DNS resolution
process is exploited for retrieval of malicious SQL query results. Resulting
DNS requests are intercepted by attackers themselves at the controlled remote
name server extracting valuable data. Open source SQL injection tool sqlmap has
been adjusted to automate this task. With modifications done, attackers are
able to use this technique for fast and low profile data retrieval, especially
in cases where other standard ones fail.
Authors' comments: 7 pages, 3 figures, 1 table. Presented at PHDays 2012 security
conference, Moscow, Russia
Mounira Taileb
Content-based image retrieval (CBIR) has been one of the most important
research areas in computer vision. It is a widely used method for searching
images in huge databases. In this paper we present a CBIR system called
NOHIS-Search. The system is based on the indexing technique NOHIS-tree. The two
phases of the system are described and the performance of the system is
illustrated with the image database ImagEval. NOHIS-Search system was compared
to other two CBIR systems; the first that using PDDP indexing algorithm and the
second system is that using the sequential search. Results show that
NOHIS-Search system outperforms the two other systems.
Authors' comments: 6 pages, 10th International Conference on Advances in Mobile
Computing & Multimedia (MoMM2012)
Afonso S. Bandeira, Jameson Cahill, Dustin G. Mixon, Aaron A. Nelson
Recent advances in convex optimization have led to new strides in the phase
retrieval problem over finite-dimensional vector spaces. However, certain
fundamental questions remain: What sorts of measurement vectors uniquely
determine every signal up to a global phase factor, and how many are needed to
do so? Furthermore, which measurement ensembles lend stability? This paper
presents several results that address each of these questions. We begin by
characterizing injectivity, and we identify that the complement property is
indeed a necessary condition in the complex case. We then pose a conjecture
that 4M-4 generic measurement vectors are both necessary and sufficient for
injectivity in M dimensions, and we prove this conjecture in the special cases
where M=2,3. Next, we shift our attention to stability, both in the worst and
average cases. Here, we characterize worst-case stability in the real case by
introducing a numerical version of the complement property. This new property
bears some resemblance to the restricted isometry property of compressed
sensing and can be used to derive a sharp lower Lipschitz bound on the
intensity measurement mapping. Localized frames are shown to lack this property
(suggesting instability), whereas Gaussian random measurements are shown to
satisfy this property with high probability. We conclude by presenting results
that use a stochastic noise model in both the real and complex cases, and we
leverage Cramer-Rao lower bounds to identify stability with stronger versions
of the injectivity characterizations.
Authors' comments: 22 pages
Fanny Yang, Volker Pohl, Holger Boche
This paper considers the recovery of continuous time signals from the
magnitude of its samples. It uses a combination of structured modulation and
oversampling and provides sufficient conditions on the signal and the sampling
system such that signal recovery is possible. In particular, it is shown that
an average sampling rate of four times the Nyquist rate is sufficient to
reconstruct a signal from its magnitude measurements.
Authors' comments: Submitted to SAMPTA 2013
A. J. E. M. Janssen, Johan S. H. van Leeuwaarden
In many-server systems it is crucial to staff the right number of servers so
that targeted service levels are met. These staffing problems typically lead to
constraint satisfaction problems that are hard to solve. During the last
decade, a powerful many-server asymptotic theory has been developed to solve
such problems and optimal staffing rules are known to obey the square-root
staffing principle. This paper develops many-server asymptotics in the
so-called QED regime, and presents refinements to many-server asymptotics and
square-root staffing for a Markovian queueing model with admission control and
retrials.
Authors' comments: This is a longer report version of a paper which is under submission
Ana L. Teixeira, Rui C. Santos, Joao P. Leal, Jose A. Martinho Simoes, Andre O. Falcao
Standard enthalpies of formation are used for assessing the efficiency and safety of chemical processes in the chemical industry. However, the number of compounds for which the enthalpies of formation are available is many orders of magnitude smaller than the number of known compounds. Thermochemical data prediction methods are therefore clearly needed. Several commercial and free chemical databases are currently available, the NIST WebBook being the most used free source. To overcome this problem a cheminformatics system was designed and built with two main objectives in mind: collecting and retrieving critically evaluated thermochemical values, and estimating new data. In its present version, by using cheminformatics techniques, ThermInfo allows the retrieval of the value of a thermochemical property, such as a gas-phase standard enthalpy of formation, by inputting, for example, the molecular structure or the name of a compound. The same inputs can also be used to estimate data (presently restricted to non-polycyclic hydrocarbons) by using the Extended Laidler Bond Additivity (ELBA) method. The information system is publicly available at http://www.therminfo.com or http://therminfo.lasige.di.fc.ul.pt. ThermInfo's strength lies in the data quality, availability (free access), search capabilities, and, in particular, prediction ability, based on a user-friendly interface that accepts inputs in several formats.
Ameer Tawfik Albaham, Naomie Salim
Online forums or message boards are rich knowledge-based communities. In
these communities, thread retrieval is an essential tool facilitating
information access. However, the issue on thread search is how to combine
evidence from text units(messages) to estimate thread relevance. In this paper,
we first rank a list of messages, then we score threads by aggregating their
ranked messages' scores. To aggregate the message scores, we adopt several
voting techniques that have been applied in ranking aggregates tasks such as
blog distillation and expert finding. The experimental result shows that many
voting techniques should be preferred over a baseline that treats a thread as a
concatenation of its message texts.
Authors' comments: The original publication is available at
http://www.springerlink.com/. Fixing minor typos. arXiv admin note: text
overlap with arXiv:1212.5590
Youssef Bassil
This paper is a survey discussing Information Retrieval concepts, methods,
and applications. It goes deep into the document and query modelling involved
in IR systems, in addition to pre-processing operations such as removing stop
words and searching by synonym techniques. The paper also tackles text
categorization along with its application in neural networks and machine
learning. Finally, the architecture of web crawlers is to be discussed shedding
the light on how internet spiders index web documents and how they allow users
to search for items on the web.
Authors' comments: LACSC - Lebanese Association for Computational Sciences,
http://www.lacsc.org
P. L. Aisher, J. Crass, C. Mackay
Increasing interest in astronomical applications of non-linear curvature
wavefront sensors for turbulence detection and correction makes it important to
understand how best to handle the data they produce, particularly at low light
levels. Algorithms for wavefront phase-retrieval from a four-plane curvature
wavefront sensor are developed and compared, with a view to their use for low
order phase compensation in instruments combining adaptive optics and Lucky
Imaging. The convergence speed and quality of iterative algorithms is compared
to their step-size and techniques for phase retrieval at low photon counts are
explored.
Computer simulations show that at low light levels, preprocessing by
convolution of the measured signal with a gaussian function can reduce by an
order of magnitude the photon flux required for accurate phase retrieval of
low-order errors. This facilitates wavefront correction on large telescopes
with very faint reference stars.
Authors' comments: 16 pages. Accepted for publication in MNRAS
R. K. Roul, S. K. Sahay
The size of web has increased exponentially over the past few years with
thousands of documents related to a subject available to the user. With this
much amount of information available, it is not possible to take the full
advantage of the World Wide Web without having a proper framework to search
through the available data. This requisite organization can be done in many
ways. In this paper we introduce a combine approach to cluster the web pages
which first finds the frequent sets and then clusters the documents. These
frequent sets are generated by using Frequent Pattern growth technique. Then by
applying Fuzzy C- Means algorithm on it, we found clusters having documents
which are highly related and have similar features. We used Gensim package to
implement our approach because of its simplicity and robust nature. We have
compared our results with the combine approach of (Frequent Pattern growth,
K-means) and (Frequent Pattern growth, Cosine_Similarity). Experimental results
show that our approach is more efficient then the above two combine approach
and can handles more efficiently the serious limitation of traditional Fuzzy
C-Means algorithm, which is sensitiveto initial centroid and the number of
clusters to be formed.
Authors' comments: 11 Pages, 2 figures
Deepika Sharma, Deepak Garg
Internet is one of the main sources of information for millions of people. One can find information related to practically all matters on internet. Moreover if we want to retrieve information about some particular topic we may find thousands of Web Pages related to that topic. But our main concern is to find relevant Web Pages from among that collection. So in this paper I have discussed that how information is retrieved from the web and the efforts required for retrieving this information in terms of system and users efforts.
Md. Abdullah al Mamun, Md. Hanif, Md. Rakib Uddin, Tanvir Ahmed, Md. Mofizul Islam
Finding desired information from large data set is a difficult problem.
Information retrieval is concerned with the structure, analysis, organization,
storage, searching, and retrieval of information. Index is the main constituent
of an IR system. Now a day exponential growth of information makes the index
structure large enough affecting the IR system's quality. So compressing the
Index structure is our main contribution in this paper. We compressed the
document number in inverted file entries using a new coding technique based on
run-length encoding. Our coding mechanism uses a specified code which acts over
run-length coding. We experimented and found that our coding mechanism on an
average compresses 67.34% percent more than the other techniques.
Authors' comments: 5 pages
Ipsita Mohanty, R. Leela Velusamy
Advanced internet technologies providing services like e-mail, social
networking, online banking, online shopping etc., have made day-to-day
activities simple and convenient. Increasing dependency on the internet,
convenience, and decreasing cost of electronic devices have resulted in
frequent use of online services. However, increased indulgence over the
internet has also accelerated the pace of digital crimes. The increase in
number and complexity of digital crimes has caught the attention of forensic
investigators. The Digital Investigators are faced with the challenge of
gathering accurate digital evidence from as many sources as possible. In this
paper, an attempt was made to recover digital evidence from a system's RAM in
the form of information about the most recent browsing session of the user.
Four different applications were chosen and the experiment was conducted across
two browsers. It was found that crucial information about the target user such
as, user name, passwords, etc., was recoverable.
Authors' comments: 15 pages, 9 figures; International Journal of Security, Privacy and
Trust Management (IJSPTM), Vol. 1, No 3/4, August 2012
Reza Tavoli, Fariborz Mahmoudi
Research has been devoted in the past few years to relevance feedback as an effective solution to improve performance of information retrieval systems. Relevance feedback refers to an interactive process that helps to improve the retrieval performance. In this paper we propose the use of relevance feedback to improve document image retrieval System (DIRS) performance. This paper compares a variety of strategies for positive and negative feedback. In addition, feature subspace is extracted and updated during the feedback process using a Principal Component Analysis (PCA) technique and based on user's feedback. That is, in addition to reducing the dimensionality of feature spaces, a proper subspace for each type of features is obtained in the feedback process to further improve the retrieval accuracy. Experiments show that using relevance Feedback in DIR achieves better performance than common DIR.
Qiong Liu
Extensive research efforts have been dedicated to 3D model retrieval in
recent decades. Recently, view-based methods have attracted much research
attention due to the high discriminative property of multi-views for 3D object
representation. In this report, we summarize the view-based 3D model methods
and provide the further research trends. This paper focuses on the scheme for
matching between multiple views of 3D models and the application of
bag-of-visual-words method in 3D model retrieval. For matching between multiple
views, the many-to-many matching, probabilistic matching and semisupervised
learning methods are introduced. For bag-of-visual-words application in 3D
model retrieval, we first briefly review the bag-of-visual-words works on
multimedia and computer vision tasks, where the visual dictionary has been
detailed introduced. Then a series of 3D model retrieval methods by using
bag-of-visual-words description are surveyed in this paper. At last, we
summarize the further research content in view-based 3D model retrieval.
Authors' comments: 15 pages. arXiv admin note: text overlap with arXiv:1207.7244 by
other author without attribution