Word spotting in offline multilingual handwritten documents based on hidden markov models

Venu Govindaraju; Safwan Wshah
{'id': 'https://openalex.org/W2740943300', 'doi': None, 'title': 'Word spotting in offline multilingual handwritten documents based on hidden markov models', 'display_name': 'Word spotting in offline multilingual handwritten documents based on hidden markov models', 'publication_year': 2012, 'publication_date': '2012-01-01', 'ids': {'openalex': 'https://openalex.org/W2740943300', 'mag': '2740943300'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'http://dl.acm.org/citation.cfm?id=2520212', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4306508160', 'display_name': 'Dissertations & Theses @ SUNY Buffalo,ProQuest Dissertations & Theses Global', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': None, 'host_organization_name': None, 'host_organization_lineage': [], 'host_organization_lineage_names': [], 'type': 'repository'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'dissertation', 'type_crossref': 'dissertation', 'indexed_in': [], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5020354604', 'display_name': 'Venu Govindaraju', 'orcid': 'https://orcid.org/0000-0002-5318-7409'}, 'institutions': [{'id': 'https://openalex.org/I63190737', 'display_name': 'University at Buffalo, State University of New York', 'ror': 'https://ror.org/01y64my43', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I63190737']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Venu Govindaraju', 'raw_affiliation_strings': ['State University of New York at Buffalo.'], 'affiliations': [{'raw_affiliation_string': 'State University of New York at Buffalo.', 'institution_ids': ['https://openalex.org/I63190737']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5001816279', 'display_name': 'Safwan Wshah', 'orcid': 'https://orcid.org/0000-0001-5051-7719'}, 'institutions': [{'id': 'https://openalex.org/I63190737', 'display_name': 'University at Buffalo, State University of New York', 'ror': 'https://ror.org/01y64my43', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I63190737']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Safwan Wshah', 'raw_affiliation_strings': ['State University of New York at Buffalo.'], 'affiliations': [{'raw_affiliation_string': 'State University of New York at Buffalo.', 'institution_ids': ['https://openalex.org/I63190737']}]}], 'institution_assertions': [], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': None, 'has_fulltext': False, 'cited_by_count': 0, 'citation_normalized_percentile': {'value': 0.0, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 0, 'max': 65}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T10601', 'display_name': 'Handwriting Recognition and Text Detection', 'score': 0.9998, 'subfield': {'id': 'https://openalex.org/subfields/1707', 'display_name': 'Computer Vision and Pattern Recognition'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T10601', 'display_name': 'Handwriting Recognition and Text Detection', 'score': 0.9998, 'subfield': {'id': 'https://openalex.org/subfields/1707', 'display_name': 'Computer Vision and Pattern Recognition'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11550', 'display_name': 'Multi-label Text Classification in Machine Learning', 'score': 0.9901, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T12707', 'display_name': 'Automatic License Plate Recognition System', 'score': 0.9855, 'subfield': {'id': 'https://openalex.org/subfields/2214', 'display_name': 'Media Technology'}, 'field': {'id': 'https://openalex.org/fields/22', 'display_name': 'Engineering'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/devanagari', 'display_name': 'Devanagari', 'score': 0.7340772}, {'id': 'https://openalex.org/keywords/keyword-spotting', 'display_name': 'Keyword spotting', 'score': 0.62340015}, {'id': 'https://openalex.org/keywords/handwriting-recognition', 'display_name': 'Handwriting Recognition', 'score': 0.600155}, {'id': 'https://openalex.org/keywords/text-detection', 'display_name': 'Text Detection', 'score': 0.557455}, {'id': 'https://openalex.org/keywords/spotting', 'display_name': 'Spotting', 'score': 0.53993285}, {'id': 'https://openalex.org/keywords/document-image-analysis', 'display_name': 'Document Image Analysis', 'score': 0.532992}, {'id': 'https://openalex.org/keywords/multi-label-learning', 'display_name': 'Multi-label Learning', 'score': 0.524205}, {'id': 'https://openalex.org/keywords/text-localization', 'display_name': 'Text Localization', 'score': 0.523171}, {'id': 'https://openalex.org/keywords/optical-character-recognition', 'display_name': 'Optical character recognition', 'score': 0.44480005}, {'id': 'https://openalex.org/keywords/text-segmentation', 'display_name': 'Text segmentation', 'score': 0.4373067}], 'concepts': [{'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.8694736}, {'id': 'https://openalex.org/C2780144916', 'wikidata': 'https://www.wikidata.org/wiki/Q38592', 'display_name': 'Devanagari', 'level': 4, 'score': 0.7340772}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.68090224}, {'id': 'https://openalex.org/C204321447', 'wikidata': 'https://www.wikidata.org/wiki/Q30642', 'display_name': 'Natural language processing', 'level': 1, 'score': 0.6800664}, {'id': 'https://openalex.org/C2781213101', 'wikidata': 'https://www.wikidata.org/wiki/Q6398558', 'display_name': 'Keyword spotting', 'level': 2, 'score': 0.62340015}, {'id': 'https://openalex.org/C61423126', 'wikidata': 'https://www.wikidata.org/wiki/Q187432', 'display_name': 'Scripting language', 'level': 2, 'score': 0.5886825}, {'id': 'https://openalex.org/C23224414', 'wikidata': 'https://www.wikidata.org/wiki/Q176769', 'display_name': 'Hidden Markov model', 'level': 2, 'score': 0.5470834}, {'id': 'https://openalex.org/C2779506182', 'wikidata': 'https://www.wikidata.org/wiki/Q7580141', 'display_name': 'Spotting', 'level': 2, 'score': 0.53993285}, {'id': 'https://openalex.org/C90805587', 'wikidata': 'https://www.wikidata.org/wiki/Q10944557', 'display_name': 'Word (group theory)', 'level': 2, 'score': 0.48447466}, {'id': 'https://openalex.org/C48044578', 'wikidata': 'https://www.wikidata.org/wiki/Q727490', 'display_name': 'Scalability', 'level': 2, 'score': 0.47856095}, {'id': 'https://openalex.org/C75165309', 'wikidata': 'https://www.wikidata.org/wiki/Q2258979', 'display_name': 'Search engine indexing', 'level': 2, 'score': 0.45640054}, {'id': 'https://openalex.org/C546480517', 'wikidata': 'https://www.wikidata.org/wiki/Q167555', 'display_name': 'Optical character recognition', 'level': 3, 'score': 0.44480005}, {'id': 'https://openalex.org/C98501671', 'wikidata': 'https://www.wikidata.org/wiki/Q1948408', 'display_name': 'Text segmentation', 'level': 3, 'score': 0.4373067}, {'id': 'https://openalex.org/C137293760', 'wikidata': 'https://www.wikidata.org/wiki/Q3621696', 'display_name': 'Language model', 'level': 2, 'score': 0.4360066}, {'id': 'https://openalex.org/C34736171', 'wikidata': 'https://www.wikidata.org/wiki/Q918333', 'display_name': 'Preprocessor', 'level': 2, 'score': 0.41758874}, {'id': 'https://openalex.org/C89600930', 'wikidata': 'https://www.wikidata.org/wiki/Q1423946', 'display_name': 'Segmentation', 'level': 2, 'score': 0.40300965}, {'id': 'https://openalex.org/C28490314', 'wikidata': 'https://www.wikidata.org/wiki/Q189436', 'display_name': 'Speech recognition', 'level': 1, 'score': 0.38114917}, {'id': 'https://openalex.org/C77088390', 'wikidata': 'https://www.wikidata.org/wiki/Q8513', 'display_name': 'Database', 'level': 1, 'score': 0.12359354}, {'id': 'https://openalex.org/C41895202', 'wikidata': 'https://www.wikidata.org/wiki/Q8162', 'display_name': 'Linguistics', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C138885662', 'wikidata': 'https://www.wikidata.org/wiki/Q5891', 'display_name': 'Philosophy', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C2987247673', 'wikidata': 'https://www.wikidata.org/wiki/Q167555', 'display_name': 'Character recognition', 'level': 3, 'score': 0.0}, {'id': 'https://openalex.org/C115961682', 'wikidata': 'https://www.wikidata.org/wiki/Q860623', 'display_name': 'Image (mathematics)', 'level': 2, 'score': 0.0}, {'id': 'https://openalex.org/C111919701', 'wikidata': 'https://www.wikidata.org/wiki/Q9135', 'display_name': 'Operating system', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'http://dl.acm.org/citation.cfm?id=2520212', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4306508160', 'display_name': 'Dissertations & Theses @ SUNY Buffalo,ProQuest Dissertations & Theses Global', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': None, 'host_organization_name': None, 'host_organization_lineage': [], 'host_organization_lineage_names': [], 'type': 'repository'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'score': 0.79, 'id': 'https://metadata.un.org/sdg/4', 'display_name': 'Quality education'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 0, 'referenced_works': [], 'related_works': ['https://openalex.org/W3200405029', 'https://openalex.org/W3043281709', 'https://openalex.org/W3003632081', 'https://openalex.org/W2997853693', 'https://openalex.org/W2967470787', 'https://openalex.org/W2780898422', 'https://openalex.org/W2604619303', 'https://openalex.org/W2118661780', 'https://openalex.org/W2099772385', 'https://openalex.org/W2096837246', 'https://openalex.org/W2062510230', 'https://openalex.org/W2041534828', 'https://openalex.org/W2007482361', 'https://openalex.org/W1994899004', 'https://openalex.org/W1989851945', 'https://openalex.org/W1969663682', 'https://openalex.org/W1907579547', 'https://openalex.org/W143548931', 'https://openalex.org/W124332212', 'https://openalex.org/W119661166'], 'abstract_inverted_index': {'Recognition': [0], 'of': [1, 50, 91, 116, 189, 212, 218, 240, 252, 269, 294], 'unconstrained': [2], 'handwritten': [3, 304], 'documents': [4, 338], 'continues': [5], 'to': [6, 13, 28, 41, 84, 196, 318], 'be': [7, 194], 'a': [8, 88, 135, 156], 'challenging': [9], 'task': [10], 'primarily': [11], 'due': [12], 'the': [14, 26, 77, 108, 114, 128, 169, 205, 216, 250, 253, 289, 292, 295, 307], 'vast': [15], 'variability': [16], 'in': [17, 76, 87, 204, 321, 334, 337], 'writing': [18], 'styles': [19], 'and': [20, 48, 62, 80, 119, 149, 225, 227, 237, 245, 280, 311, 344], 'applications': [21], 'which': [22], 'do': [23], 'not': [24], 'offer': [25], 'means': [27], 'constrain': [29], 'large': [30, 165], 'vocabularies.': [31], 'Word': [32, 53], 'spotting': [33, 54, 161, 260, 300], 'has': [34, 125, 248, 263], 'often': [35], 'been': [36, 82, 99, 264], 'proposed': [37, 100], 'as': [38, 101, 234, 273], 'an': [39, 102], 'alternative': [40], 'full': [42], 'transcription': [43], 'for': [44, 138, 145, 159, 171, 178, 221, 256, 275, 278, 282, 291, 302], 'keyword': [45, 199], 'based': [46, 61, 64, 68, 96, 110, 184, 258, 298], 'retrieval': [47], 'indexing': [49], 'document': [51], 'images.': [52], 'techniques': [55], 'are': [56], 'mainly': [57], 'categorized': [58], 'into': [59], 'template': [60, 67, 109], 'learning': [63, 95, 297], 'approaches.': [65], 'The': [66, 208, 324], 'approaches': [69, 97, 315], 'require': [70], 'at': [71], 'least': [72], 'one': [73, 342], 'query': [74, 147], 'image': [75], 'training': [78, 206, 224], 'set': [79], 'have': [81, 98, 113], 'found': [83], 'usually': [85], 'result': [86], 'high': [89], 'number': [90], 'false': [92], 'positives.': [93], 'Recently,': [94], 'alternative.': [103], 'While': [104], 'they': [105, 112], 'regularly': [106], 'outperform': [107], 'approaches,': [111], 'shortcomings': [115], 'being': [117], 'inefficient': [118], 'non-scalable': [120], 'across': [121, 131], 'scripts.': [122, 323], 'Previous': [123], 'work': [124], 'dealt': [126], 'with': [127, 142, 164], 'non-scalability': [129], 'issue': [130], 'scripts': [132, 232], 'by': [133], 'considering': [134], 'separate': [136], 'system': [137, 301], 'each': [139, 179], 'script/language': [140], 'independently': [141], 'independent': [143], 'modules': [144], 'preprocessing,': [146], 'representation': [148], 'word': [150, 160, 174, 259, 299], 'segmentation.\r\nIn': [151], 'this': [152], 'dissertation,': [153], 'we': [154], 'describe': [155], 'new': [157], 'methodology': [158, 239], 'that': [162, 192], 'deals': [163], 'background': [166, 246], 'vocabularies': [167], 'without': [168], 'need': [170], 'separately': [172], 'built': [173], 'or': [175, 284], 'character': [176, 242], 'segmentation': [177], 'script.': [180], 'Our': [181], 'approach': [182, 214, 330], 'is': [183, 215], 'on': [185, 266], 'Hidden': [186], 'Markov': [187], 'Models': [188], 'trained': [190], 'characters': [191], 'can': [193], 'used': [195, 317], 'simulate': [197], 'any': [198], 'query,': [200], 'even': [201], 'those': [202], 'unseen': [203], 'corpus.': [207], 'main': [209], 'contributing': [210], 'idea': [211], 'our': [213], 'utilization': [217], 'script-independent': [219], 'methods': [220, 255], 'feature': [222], 'extraction,': [223], 'recognition': [226], 'their': [228], 'scalability': [229], 'over': [230], 'multiple': [231], 'such': [233, 272], 'English,': [235, 276], 'Arabic,': [236, 279], 'Devanagari.\r\nOur': [238], 'combining': [241], 'filler': [243], 'models': [244, 247], 'outperformed': [249], 'state': [251], 'art': [254], 'line': [257], 'system.': [261], 'It': [262, 287], 'evaluated': [265], 'public': [267], 'datasets': [268], 'different': [270], 'languages': [271], 'IAM': [274], 'AMA': [277], 'LAW': [281], 'Devanagari': [283], 'Indian': [285], 'languages.': [286], 'lays': [288], 'foundation': [290], 'development': [293], 'first': [296], 'multilingual': [303, 322], 'documents.': [305], 'Both': [306], "'script": [308, 312], 'identifier': [309, 313], "based'": [310], "free'": [314], 'were': [316], 'spot': [319], 'keywords': [320, 336], "'initial": [325], 'script': [326, 343], "identifier'": [327], 'switch': [328], 'free': [329], 'showed': [331], 'higher': [332], 'accuracy': [333], 'detecting': [335], 'containing': [339], 'more': [340], 'than': [341], 'also': [345], 'better': [346], 'performance': [347], 'overall.': [348]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2740943300', 'counts_by_year': [], 'updated_date': '2024-09-15T16:03:51.886317', 'created_date': '2017-08-08'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works