Ranjeet Devarakonda, Giri Palanisamy, Bruce Wilson
The Open Archive Initiative Protocol for Metadata Handling (OAI-PMHiii) is a
standard that is seeing increased use as a means for exchanging structured
metadata. OAI-PMH implementations must support Dublin Core as a metadata
standard, with other metadata formats as optional. We have developed tools
which enable Mercury to consume metadata from OAI-PMH services in any of the
metadata formats we support (Dublin Core, Darwin Core, FCDC CSDGM, GCMD DIF,
EML, and ISO 19115/19137). We are also making ORNL DAAC metadata available
through OAI-PMH for other metadata tools to utilize. This paper describes
Mercury capabilities with multiple metadata formats, in general, and, more
specifically, the results of our OAI-PMH implementations and the lessons
learned.
Authors' comments: This paper has been withdrawn by the authors. Planning to submit a
journal paper
Ross N. Hoffman
As an alternative to either directly assimilating radiances or the naive use
of retrieved profiles (of temperature, humidity, aerosols, and chemical
species), a strategy is described that makes use of the so-called averaging
kernel (AK) and other information from the retrieval process. This AK approach
has the potential to improve the use of remotely sensed observations of the
atmosphere. First, we show how to use the AK and the retrieval noise covariance
to transform the retrieved quantities into observations that are unbiased and
have uncorrelated errors, and to eliminate both the smoothing inherent in the
retrieval process and the effect of the prior. Since the effect of the prior is
removed, any prior, including the forecast from the data assimilation cycle can
be used. Then we show how to transform this result into EOF space, when a
truncated EOF series has been used in the retrieval process. This provides a
degree of data compression and eliminates those transformed variables that have
very small information content. In both approaches a vertical interpolation
from the dynamical model coordinate to the radiative transfer coordinate is
required. We define an algorithm using the EOF representation to optimize this
vertical interpolation
Authors' comments: Revision adds more detail and clarity
Eric L. Seidel, Gabrielle Allen, Steven Brandt, Frank Löffler, Erik Schnetter
Assembling simulation software along with the associated tools and utilities
is a challenging endeavor, particularly when the components are distributed
across multiple source code versioning systems. It is problematic for
researchers compiling and running the software across many different
supercomputers, as well as for novices in a field who are often presented with
a bewildering list of software to collect and install. In this paper, we
describe a language (CRL) for specifying software components with the details
needed to obtain them from source code repositories. The language supports
public and private access. We describe a tool called GetComponents which
implements CRL and can be used to assemble software. We demonstrate the tool
for application scenarios with the Cactus Framework on the NSF TeraGrid
resources. The tool itself is distributed with an open source license and
freely available from our web page.
Authors' comments: 8 pages, 5 figures, TeraGrid 2010
Hui Hui Wang, Dzulkifli Mohamad, N. A. Ismail
This paper attempts to discuss the evolution of the retrieval approaches
focusing on development, challenges and future direction of the image
retrieval. It highlights both the already addressed and outstanding issues. The
explosive growth of image data leads to the need of research and development of
Image Retrieval. However, Image retrieval researches are moving from keyword,
to low level features and to semantic features. Drive towards semantic features
is due to the problem of the keywords which can be very subjective and time
consuming while low level features cannot always describe high level concepts
in the users' mind. Hence, introducing an interpretation inconsistency between
image descriptors and high level semantics that known as the semantic gap. This
paper also discusses the semantic gap issues, user query mechanisms as well as
common ways used to bridge the gap in image retrieval.
Authors' comments: IEEE Publication Format,
https://sites.google.com/site/journalofcomputing/
R. Sivaraman, R. M. Chandrasekaran
Web Based Query Management System (WBQMS) is a methodology to design and to
implement Mobile Business, in which a server is the gateway to connect
databases with clients which sends requests and receives responses in a
distributive manner. The gateway, which communicates with mobile phone via GSM
Modem, receives the coded queries from users and sends packed results back. The
software which communicates with the gateway system via SHORT MESSAGE, packs
users' requests, IDs and codes, and sends the package to the gateway; then
interprets the packed data for the users to read on a page of GUI. Whenever and
wherever they are, the customer can query the information by sending messages
through the client device which may be mobile phone or PC. The mobile clients
can get the appropriate services through the mobile business architecture in
distributed environment. The messages are secured through the client side
encoding mechanism to avoid the intruders. The gateway system is programmed by
Java, while the software at clients by J2ME and the database is created by
Oracle for reliable and interoperable services.
Authors' comments: IEEE Publication format, International Journal of Computer Science
and Information Security, IJCSIS, Vol. 7 No. 3, March 2010, USA. ISSN 1947
5500, http://sites.google.com/site/ijcsis/
S. Saraswathi, Asma Siddhiqaa. M, Kalaimagal. K, Kalaiyarasi. M
This paper addresses the design and implementation of BiLingual Information
Retrieval system on the domain, Festivals. A generic platform is built for
BiLingual Information retrieval which can be extended to any foreign or Indian
language working with the same efficiency. Search for the solution of the query
is not done in a specific predefined set of standard languages but is chosen
dynamically on processing the user's query. This paper deals with Indian
language Tamil apart from English. The task is to retrieve the solution for the
user given query in the same language as that of the query. In this process, a
Ontological tree is built for the domain in such a way that there are entries
in the above listed two languages in every node of the tree. A Part-Of-Speech
(POS) Tagger is used to determine the keywords from the given query. Based on
the context, the keywords are translated to appropriate languages using the
Ontological tree. A search is performed and documents are retrieved based on
the keywords. With the use of the Ontological tree, Information Extraction is
done. Finally, the solution for the query is translated back to the query
language (if necessary) and produced to the user.
Authors' comments: https://sites.google.com/site/journalofcomputing/
Sumalatha Ramachandran, Sharon Joseph, Sujaya Paulraj, Vetriselvi Ramaraj
Web search engines retrieve a vast amount of information for a given search
query. But the user needs only trustworthy and high-quality information from
this vast retrieved data. The response time of the search engine must be a
minimum value in order to satisfy the user. An optimum level of response time
should be maintained even when the system is overloaded. This paper proposes an
optimal Load Shedding algorithm which is used to handle overload conditions in
real-time data stream applications and is adapted to the Information Retrieval
System of a web search engine. Experiment results show that the proposed
algorithm enables a web search engine to provide trustworthy search results to
the user within an optimum response time, even during overload conditions.
Authors' comments: https://sites.google.com/site/journalofcomputing/
Carlos M. Lorenzetti, Ana G. Maguitman
This paper proposes an incremental method that can be used by an intelligent
system to learn better descriptions of a thematic context. The method starts
with a small number of terms selected from a simple description of the topic
under analysis and uses this description as the initial search context. Using
these terms, a set of queries are built and submitted to a search engine. New
documents and terms are used to refine the learned vocabulary. Evaluations
performed on a large number of topics indicate that the learned vocabulary is
much more effective than the original one at the time of constructing queries
to retrieve relevant material.
Authors' comments: 10 pages, 3 figures, CLEI 2008
Rajkumar Kannan
The basic classification techniques for organizing information are thesauri,
taxonomy and faceted classification. Topic map is relatively a new entrant to
this information space. Topic map standard describes how complex relationships
between abstract concepts and real world resources can be represented using XML
syntax. This paper explores how topic map incorporates the traditional
techniques and what are its advantages and disadvantages in several dimensions
such as content management, indexing, knowledge representation, constraint
specification and query languages in the context of information retrieval. The
constructs of topic maps are illustrated with a use-case implemented in XTM
Authors' comments: National Conference on Advances in Knowledge Management(NCAKM'10),
pp195-198, March 2010, India
Simin Feng
We apply the equivalent theory to orthorhombic anisotropic materials and
provide a general unit-cell design criterion for achieving a length-independent
retrieval of the effective material parameters from a single layer of unit
cells. We introduce a graphical retrieval method and phase unwrapping
techniques. The graphical method utilizes the linear regression technique. Our
method can reduce the uncertainty of experimental measurements and the
ambiguity of phase unwrapping. Moreover, the graphical method can
simultaneously determine the bulk values of the six effective material
parameters, permittivity and permeability tensors, from a single layer of unit
cells.
Authors' comments: This paper has been withdrawn by the author. Replaced by a new
version with a new title "Graphical retrieval method for orthorhombic
anisotropic materials."
Ismail I. Amr, Mohamed Amin, Passent El Kafrawy, Amr M. Sauber
Although content-based image retrieval (CBIR) is not a new subject, it keeps
attracting more and more attention, as the amount of images grow tremendously
due to internet, inexpensive hardware and automation of image acquisition. One
of the applications of CBIR is fetching images from a database. This paper
presents a new method for automatic image retrieval using moment invariants and
image entropy, our technique could be used to find semi or perfect matches
based on query by example manner, experimental results demonstrate that the
purposed technique is scalable and efficient.
Authors' comments: IEEE format, International Journal of Computer Science and
Information Security, IJCSIS January 2010, ISSN 1947 5500,
http://sites.google.com/site/ijcsis/
Rajkumar Kannan, Balakrishnan Ramadoss
Dance video is one of the important types of narrative videos with semantic
rich content. This paper proposes a new meta model, Dance Video Content Model
(DVCM) to represent the expressive semantics of the dance videos at multiple
granularity levels. The DVCM is designed based on the concepts such as video,
shot, segment, event and object, which are the components of MPEG-7 MDS. This
paper introduces a new relationship type called Temporal Semantic Relationship
to infer the semantic relationships between the dance video objects. Inverted
file based index is created to reduce the search time of the dance queries. The
effectiveness of containment queries using precision and recall is depicted.
Keywords: Dance Video Annotations, Effectiveness Metrics, Metamodeling,
Temporal Semantic Relationships.
Authors' comments: INFOCOMP Journal of Computer Science, Brazil
Jayanthi Manicassamy, P. Dhavachelvan
Now a day's, search engines are been most widely used for extracting information's from various resources throughout the world. Where, majority of searches lies in the field of biomedical for retrieving related documents from various biomedical databases. Currently search engines lacks in document clustering and representing relativeness level of documents extracted from the databases. In order to overcome these pitfalls a text based search engine have been developed for retrieving documents from Medline and PubMed biomedical databases. The search engine has incorporated page ranking bases clustering concept which automatically represents relativeness on clustering bases. Apart from this graph tree construction is made for representing the level of relatedness of the documents that are networked together. This advance functionality incorporation for biomedical document based search engine found to provide better results in reviewing related documents based on relativeness.
N. Madhusudhan, S. Seager
We present a new method to retrieve molecular abundances and temperature
profiles from exoplanet atmosphere photometry and spectroscopy. We run millions
of 1D atmosphere models in order to cover the large range of allowed parameter
space, and present error contours in the atmospheric properties, given the
data. In order to run such a large number of models, we have developed a
parametric pressure-temperature (P-T) profile coupled with line-by-line
radiative transfer, hydrostatic equilibrium, and energy balance, along with
prescriptions for non-equilibrium molecular composition and energy
redistribution. We apply our temperature and abundance retrieval method to the
atmospheres of two transiting exoplanets, HD 189733b and HD 209458b, which have
the best available Spitzer and HST observations. For HD 189733b, we find
efficient day-night redistribution of energy in the atmosphere, and molecular
abundance constraints confirming the presence of H2O, CO, CH4, and CO2. For HD
209458b, we confirm and constrain the day-side thermal inversion in an average
1D temperature profile. We also report independent detections of H$_2$O, CO,
CH$_4$ and CO$_2$ on the dayside of HD 209458b, based on six-channel Spitzer
photometry. We report constraints for HD 189733b due to individual data sets
separately; a few key observations are variable in different data sets at
similar wavelengths. Moreover, a noticeably strong carbon dioxide absorption in
one data set is significantly weaker in another. We must, therefore,
acknowledge the strong possibility that the atmosphere is variable, both in its
energy redistribution state and in the chemical abundances.
Authors' comments: 20 pages in emulateapj format, 11 figures. Final version, after proof
corrections
Young-Wook Cho, Yoon-Ho Kim
We report slowed propagation and storage and retrieval of thermal light in
warm rubidium vapor using the effect of electromagnetically-induced
transparency (EIT). We first demonstrate slowed-propagation of the probe
thermal light beam through an EIT medium by measuring the second-order
correlation function of the light field using the Hanbury-Brown$-$Twiss
interferometer. We also report an experimental study on the effect of the EIT
slow-light medium on the temporal coherence of thermal light. Finally, we
demonstrate the storage and retrieval of thermal light beam in the EIT medium.
The direct measurement of the photon number statistics of the retrieved light
field shows that the photon number statistics is preserved during the storage
and retrieval process.
Authors' comments: 4 pages, 4 figures
R. Sivaraman, R. Prabakaran, S. Sujatha
WiCoM enables remote management of web resources. Our application Mobile
reporter is aimed at Journalist, who will be able to capture the events in
real-time using their mobile phones and update their web server on the latest
event. WiCoM has been developed using J2ME technology on the client side and
PHP on the server side. The communication between the client and the server is
established through GPRS. Mobile reporter will be able to upload, edit and
remove both textual as well as multimedia contents in the server.
Authors' comments: 4 Pages IEEE format, International Journal of Computer Science and
Information Security, IJCSIS 2009, ISSN 1947 5500, Impact Factor 0.423,
http://sites.google.com/site/ijcsis/
Patrice Lopez, Laurent Romary
This paper presents the system called PATATRAS (PATent and Article Tracking, Retrieval and AnalysiS) realized for the IP track of CLEF 2009. Our approach presents three main characteristics: 1. The usage of multiple retrieval models (KL, Okapi) and term index definitions (lemma, phrase, concept) for the three languages considered in the present track (English, French, German) producing ten different sets of ranked results. 2. The merging of the different results based on multiple regression models using an additional validation set created from the patent collection. 3. The exploitation of patent metadata and of the citation structures for creating restricted initial working sets of patents and for producing a final re-ranking regression model. As we exploit specific metadata of the patent documents and the citation relations only at the creation of initial working sets and during the final post ranking step, our architecture remains generic and easy to extend.
Priti Maheswary, Namita Srivastava
Grouping images into semantically meaningful categories using low-level
visual feature is a challenging and important problem in content-based image
retrieval. The groupings can be used to build effective indices for an image
database. Digital image analysis techniques are being used widely in remote
sensing assuming that each terrain surface category is characterized with
spectral signature observed by remote sensors. Even with the remote sensing
images of IRS data, integration of spatial information is expected to assist
and to improve the image analysis of remote sensing data. In this paper we
present a satellite image retrieval based on a mixture of old fashioned ideas
and state of the art learning tools. We have developed a methodology to
classify remote sensing images using HSV color features and Haar wavelet
texture features and then grouping them on the basis of particular threshold
value. The experimental results indicate that the use of color and texture
feature extraction is very useful for image retrieval.
Authors' comments: 5 pages IEEE format, International Journal of Computer Science and
Information Security, IJCSIS 2009, ISSN 1947 5500, Impact Factor 0.423,
http://sites.google.com/site/ijcsis/
Derek Flood, Kevin Mc Daid, Fergal Mc Caffery
Spreadsheets are a ubiquitous software tool, used for a wide variety of tasks
such as financial modelling, statistical analysis and inventory management.
Extracting meaningful information from such data can be a difficult task,
especially for novice users unfamiliar with the advanced data processing
features of many spreadsheet applications. We believe that through the use of
Natural Language Processing (NLP) techniques this task can be made considerably
easier. This paper introduces NLP-SIR, a Natural language interface for
spreadsheet information retrieval. The results of a recent evaluation which
compared NLP-SIR with existing Information retrieval tools are also outlined.
This evaluation has shown that NLP-SIR is a more effective method of
spreadsheet information retrieval.
Authors' comments: 12 Pages, 2 Colour Figures, 3 Tables
Colum Foley, Alan F. Smeaton
Traditional Information Retrieval (IR) research has focussed on a single user
interaction modality, where a user searches to satisfy an information need.
Recent advances in web technologies and computer hardware have enabled multiple
users to collaborate on many computer-supported tasks, therefore there is an
increasing opportunity to support two or more users searching together at the
same time in order to satisfy a shared information need, which we refer to as
Synchronous Collaborative Information Retrieval (SCIR). SCIR systems represent
a significant paradigmatic shift from traditional IR systems. In order to
support effective SCIR, new techniques are required to coordinate users'
activities. In addition, the novel domain of SCIR presents challenges for
effective evaluations of these systems. In this paper we will propose an
effective and re-usable evaluation methodology based on simulating users
searching together. We will outline how we have used this evaluation in
empirical studies of the effects of different division of labour and sharing of
knowledge techniques for SCIR.
Authors' comments: Presented at 1st Intl Workshop on Collaborative Information Seeking,
2008 (arXiv:0908.0583)