benty-fields - Search paper

With the advent of the Internet, a new era of digital information exchange has begun. Currently, the Internet encompasses more than five billion online sites and this number is exponentially increasing every day. Fundamentally, Information Retrieval (IR) is the science and practice of storing documents and retrieving information from within these documents. Mathematically, IR systems are at the core based on a feature vector model coupled with a term weighting scheme that weights terms in a document according to their significance with respect to the context in which they appear. Practically, Vector Space Model (VSM), Term Frequency (TF), and Inverse Term Frequency (IDF) are among other long-established techniques employed in mainstream IR systems. However, present IR models only target generic-type text documents, in that, they do not consider specific formats of files such as HTML web documents. This paper proposes a new semantic-sensitive web information retrieval model for HTML documents. It consists of a vector model called SWVM and a weighting scheme called BTF-IDF, particularly designed to support the indexing and retrieval of HTML web documents. The chief advantage of the proposed model is that it assigns extra weights for terms that appear in certain pre-specified HTML tags that are correlated to the semantics of the document. Additionally, the model is semantic-sensitive as it generates synonyms for every term being indexed and later weights them appropriately to increase the likelihood of retrieving documents with similar context but different vocabulary terms. Experiments conducted, revealed a momentous enhancement in the precision of web IR systems and a radical increase in the number of relevant documents being retrieved. As further research, the proposed model is to be upgraded so as to support the indexing and retrieval of web images in multimedia-rich web documents.
Authors' comments: LACSC - Lebanese Association for Computational Sciences, http://www.lacsc.org/; European Journal of Scientific Research, Vol. 69, No. 4, February 2012

Vote

Add to Library

Recommend

7590. Information Retrieval Systems Adapted to the Biomedical Domain

Mónica Marrero, Sonia Sánchez-Cuadrado, Julián Urbano, Jorge Morato, José-Antonio Moreiro

El Profesional de la Informacion, 19, 246-254 (2010)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.6845v1

Vote

Add to Library

Recommend

7591. Kernel Density Feature Points Estimator for Content-Based Image Retrieval

Tranos Zuva, Oludayo O. Olugbara, Sunday O. Ojo, Seleman M. Ngwira

Signal & Image Processing: An International Journal (SIPIJ), Vol.4 No 1, February 2012, Pages: 103-111

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.5078v1

Vote

Add to Library

Recommend

7592. An Accurate Arabic Root-Based Lemmatizer for Information Retrieval Purposes

Tarek El-Shishtawy, Fatma El-Ghannam

IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 1, No 3, January 2012 ISSN (Online): 1694-0814

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.3584v1

Vote

Add to Library

Recommend

7593. Categories of Emotion names in Web retrieved texts

Sergey Petrov, Jose F. Fontanari, Leonid I. Perlovsky

IJPBS, 2, 173-184 (2012)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.2293v1

Vote

Add to Library

Recommend

7594. Designing and using prior knowledge for phase retrieval

Eliyahu Osherovich, Michael Zibulevsky, Irad Yavneh

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.0879v1

Vote

Add to Library

Recommend

7595. An evaluation of local shape descriptors for 3D shape retrieval

Sarah Tang, Afzal Godil

Three-Dimensional Image Processing (3DIP) and Applications II (2012)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1202.2368v1

Vote

Add to Library

Recommend

7596. A Markov Random Field Topic Space Model for Document Retrieval

Scott Hand

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.6640v1

Vote

Add to Library

Recommend

7597. 3D Model Retrieval Based on Semantic and Shape Indexes

My Abdellah Kassimi, Omar El beqqali

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.6387v1

Vote

Add to Library

Recommend

7598. Compressive Phase Retrieval From Squared Output Measurements Via Semidefinite Programming

Henrik Ohlsson, Allen Y. Yang, Roy Dong, S. Shankar Sastry

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.6323v3

Vote

Add to Library

Recommend

7599. Practical Top-K Document Retrieval in Reduced Space

Gonzalo Navarro, Daniel Valenzuela

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.4395v1

Vote

Add to Library

Recommend

7600. Perfectly Balanced Allocation With Estimated Average Using Expected Constant Retries

Sourav Dutta, Souvik Bhattacherjee, Ankur Narang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.0801v3

Vote

Add to Library

Recommend

Benty-search

7581. A Generic Framework for Efficient and Effective Subsequence Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1208.0286v1

7582. Semantic Information Retrieval Using Ontology In University Domain

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1207.5745v1

7583. Content Based Multimedia Information Retrieval to Support Digital Libraries

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1207.4259v1

7584. Information Retrieval Model: A Social Network Extraction Perspective

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1207.3583v1

7585. Optimal Storage and Retrieval of Single-Photon Waveforms

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1207.2670v1

7586. Information Retrieval in Intelligent Systems: Current Scenario & Issues

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1206.3667v1

7587. Improving Retrieval Results with discipline-specific Query Expansion

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1206.2126v1

7588. Feature Weighting for Improving Document Image Retrieval System Performance

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1206.1291v1

7589. Semantic-Sensitive Web Information Retrieval Model for HTML Documents

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1204.0186v1

7590. Information Retrieval Systems Adapted to the Biomedical Domain

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1203.6845v1

7591. Kernel Density Feature Points Estimator for Content-Based Image Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1203.5078v1

7592. An Accurate Arabic Root-Based Lemmatizer for Information Retrieval Purposes

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1203.3584v1

7593. Categories of Emotion names in Web retrieved texts

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1203.2293v1

7594. Designing and using prior knowledge for phase retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1203.0879v1

7595. An evaluation of local shape descriptors for 3D shape retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1202.2368v1

7596. A Markov Random Field Topic Space Model for Document Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1111.6640v1

7597. 3D Model Retrieval Based on Semantic and Shape Indexes

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1111.6387v1

7598. Compressive Phase Retrieval From Squared Output Measurements Via Semidefinite Programming

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1111.6323v3

7599. Practical Top-K Document Retrieval in Reduced Space

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1111.4395v1

7600. Perfectly Balanced Allocation With Estimated Average Using Expected Constant Retries

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1111.0801v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1208.0286v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1207.5745v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1207.4259v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1207.3583v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1207.2670v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1206.3667v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1206.2126v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1206.1291v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1204.0186v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.6845v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.5078v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.3584v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.2293v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1203.0879v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1202.2368v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.6640v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.6387v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.6323v3

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.4395v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1111.0801v3