benty-fields - Search paper

Retrieval of text information from natural scene images and video frames is a challenging task due to its inherent problems like complex character shapes, low resolution, background noise, etc. Available OCR systems often fail to retrieve such information in scene/video frames. Keyword spotting, an alternative way to retrieve information, performs efficient text searching in such scenarios. However, current word spotting techniques in scene/video images are script-specific and they are mainly developed for Latin script. This paper presents a novel word spotting framework using dynamic shape coding for text retrieval in natural scene image and video frames. The framework is designed to search query keyword from multiple scripts with the help of on-the-fly script-wise keyword generation for the corresponding script. We have used a two-stage word spotting approach using Hidden Markov Model (HMM) to detect the translated keyword in a given text line by identifying the script of the line. A novel unsupervised dynamic shape coding based scheme has been used to group similar shape characters to avoid confusion and to improve text alignment. Next, the hypotheses locations are verified to improve retrieval performance. To evaluate the proposed system for searching keyword from natural scene image and video frames, we have considered two popular Indic scripts such as Bangla (Bengali) and Devanagari along with English. Inspired by the zone-wise recognition approach in Indic scripts[1], zone-wise text information has been used to improve the traditional word spotting performance in Indic scripts. For our experiment, a dataset consisting of images of different scenes and video frames of English, Bangla and Devanagari scripts were considered. The results obtained showed the effectiveness of our proposed word spotting approach.
Authors' comments: Multimedia Tools and Applications, Springer

Vote

Add to Library

Recommend

7708. Retrieving Young Cloudy L-Dwarfs: A Nearby Planetary-Mass Companion BD+60 1417B and Its Isolated Red Twin W0047

Caprice L. Phillips, Jacqueline K. Faherty, Ben Burningham, Johanna M. Vos, Eileen Gonzales, Emily J. Griffith, Sherelyn Alejandro Merchan, Emily Calamari et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2407.01694v1

Vote

Add to Library

Recommend

7709. High-precision atmospheric characterization of a Y dwarf with JWST NIRSpec G395H spectroscopy: isotopologue, C/O ratio, metallicity, and the abundances of six molecular species

Ben W. P. Lew, Thomas Roellig, Natasha E. Batalha, Michael Line, Thomas Greene, Sagnick Murkherjee, Richard Freedman, Michael Meyer et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2402.05900v1

Vote

Add to Library

Recommend

7710. MMRAG-RFT: Two-stage Reinforcement Fine-tuning for Explainable Multi-modal Retrieval-augmented Generation

Shengwei Zhao, Jingwen Yao, Sitong Wei, Linhai Xu, Yuying Liu, Dong Zhang, Zhiqiang Tian, Shaoyi Du

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.17194v1

Vote

Add to Library

Recommend

7711. DIRC-RAG: Accelerating Edge RAG with Robust High-Density and High-Loading-Bandwidth Digital In-ReRAM Computation

Kunming Shao, Zhipeng Liao, Jiangnan Yu, Liang Zhao, Qiwei Li, Xijie Huang, Jingyu He, Fengshi Tian et al.

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.25278v1

Vote

Add to Library

Recommend

7712. Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction

Xinyu Ma, Jiafeng Guo, Ruqing Zhang, Yixing Fan, Xueqi Cheng

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022)

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2204.10641v1

Vote

Add to Library

Recommend

7713. LOCORE: Image Re-ranking with Long-Context Sequence Modeling

Zilin Xiao, Pavel Suma, Ayush Sachdeva, Hao-Jen Wang, Giorgos Kordopatis-Zilos, Giorgos Tolias, Vicente Ordonez

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2503.21772v1

Vote

Add to Library

Recommend

7714. Learning Optimal Tree Models Under Beam Search

Jingwei Zhuo, Ziru Xu, Wei Dai, Han Zhu, Han Li, Jian Xu, Kun Gai

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2006.15408v1

Vote

Add to Library

Recommend

7715. Enhancing Large Language Models with Retrieval Augmented Generation for Software Testing and Inspection Automation

Zoe Fingleton, Nazanin Siavash, Armin Moin

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.15270v1

Vote

Add to Library

Recommend

7716. UrbanClipAtlas: A Visual Analytics Framework for Event and Scene Retrieval in Urban Videos

Joel Perca, Luis Sante, Juanpablo Heredia, Joao Rulff, Claudio Silva, Jorge Poco

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.15225v1

Vote

Add to Library

Recommend

7717. Learning Where to Embed: Noise-Aware Positional Embedding for Query Retrieval in Small-Object Detection

Yangchen Zeng, Zhenyu Yu, Dongming Jiang, Wenbo Zhang, Yifan Hong, Zhanhua Hu, Jiao Luo, Kangning Cui

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.15065v1

Vote

Add to Library

Recommend

7718. RaTA-Tool: Retrieval-based Tool Selection with Multimodal Large Language Models

Gabriele Mattioli, Evelyn Turri, Sara Sarto, Lorenzo Baraldi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14951v1

Vote

Add to Library

Recommend

7719. SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

Xin Xie, Dongyun Xue, Wuguannan Yao, Mingxiao Feng, Wengang Zhou, Xiang Qi, Houqiang Li, Peng Zhang

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14712v1

Vote

Add to Library

Recommend

7720. Retrieve, Then Classify: Corpus-Grounded Automation of Clinical Value Set Authoring

Sumit Mukherjee, Juan Shu, Nairwita Mazumder, Tate Kernell, Celena Wheeler, Shannon Hastings, Chris Sidey-Gibbons

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14616v1

Vote

Add to Library

Recommend

Benty-search

7701. Retrieval dynamics of neural networks for sparsely coded sequential patterns

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | cond-mat/9805135v2

7702. Noun-Phrase Analysis in Unrestricted Text for Information Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | cmp-lg/9605019v1

7703. Retrieval Phase Diagrams of Non-monotonic Hopfield Networks

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | cond-mat/9604065v2

7704. A Network of Oscillators for Retrieving Phase Information

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | adap-org/9408001v2

7705. Statistical versus symbolic parsing for captioned-information retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | cmp-lg/9408008v1

7706. Context-Augmented Code Generation Using Programming Knowledge Graphs

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2410.18251v2

7707. Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 1708.05529v6

7708. Retrieving Young Cloudy L-Dwarfs: A Nearby Planetary-Mass Companion BD+60 1417B and Its Isolated Red Twin W0047

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2407.01694v1

7709. High-precision atmospheric characterization of a Y dwarf with JWST NIRSpec G395H spectroscopy: isotopologue, C/O ratio, metallicity, and the abundances of six molecular species

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2402.05900v1

7710. MMRAG-RFT: Two-stage Reinforcement Fine-tuning for Explainable Multi-modal Retrieval-augmented Generation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2512.17194v1

7711. DIRC-RAG: Accelerating Edge RAG with Robust High-Density and High-Loading-Bandwidth Digital In-ReRAM Computation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2510.25278v1

7712. Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2204.10641v1

7713. LOCORE: Image Re-ranking with Long-Context Sequence Modeling

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2503.21772v1

7714. Learning Optimal Tree Models Under Beam Search

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2006.15408v1

7715. Enhancing Large Language Models with Retrieval Augmented Generation for Software Testing and Inspection Automation

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.15270v1

7716. UrbanClipAtlas: A Visual Analytics Framework for Event and Scene Retrieval in Urban Videos

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.15225v1

7717. Learning Where to Embed: Noise-Aware Positional Embedding for Query Retrieval in Small-Object Detection

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.15065v1

7718. RaTA-Tool: Retrieval-based Tool Selection with Multimodal Large Language Models

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14951v1

7719. SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14712v1

7720. Retrieve, Then Classify: Corpus-Grounded Automation of Clinical Value Set Authoring

Show abstract | Show figures | Show BibTeX | Show discussion 0 | View PDF | 2604.14616v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | cond-mat/9805135v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | cmp-lg/9605019v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | cond-mat/9604065v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | adap-org/9408001v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | cmp-lg/9408008v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2410.18251v2

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 1708.05529v6

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2407.01694v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2402.05900v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2512.17194v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2510.25278v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2204.10641v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2503.21772v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2006.15408v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.15270v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.15225v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.15065v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14951v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14712v1

Show abstract | Show figures | Show BibTeX | Show discussion | View PDF | 2604.14616v1