Can type-token ratio be used to show morphological complexity of languages? K Kettunen Journal of Quantitative Linguistics 21 (3), 223-245, 2014 | 45 | 2014 |
To stem or lemmatize a highly inflectional language in a probabilistic IR environment? K Kettunen, T Kunttu, K Järvelin Journal of Documentation, 2005 | 44 | 2005 |
Exporting Finnish digitized historical newspaper contents for offline use T Pääkkönen, J Kervinen, A Nivala, K Kettunen, E Mäkelä D-Lib Magazine 22 (7/8), 2016 | 32 | 2016 |
Measuring Lexical Quality of a Historical Finnish Newspaper Collection―Analysis of Garbled OCR Data with Basic Language Technology Tools and Means K Kettunen, T Pääkkönen Proceedings of the Tenth International Conference on Language Resources and …, 2016 | 29 | 2016 |
Functional classification of records and organisational structure P Henttonen, K Kettunen Records Management Journal, 2011 | 27 | 2011 |
Is a morphologically complex language really that complex in full-text retrieval? K Kettunen, E Airio International Conference on Natural Language Processing (in Finland), 411-422, 2006 | 27 | 2006 |
Complexity of European Union Languages: A comparative approach∗ M Sadeniemi, K Kettunen, T Lindh-Knuutila, T Honkela Journal of Quantitative Linguistics 15 (2), 185-211, 2008 | 25 | 2008 |
Restricted inflectional form generation in management of morphological keyword variation K Kettunen, E Airio, K Järvelin Information Retrieval 10 (4-5), 415-444, 2007 | 25 | 2007 |
Analyzing and improving the quality of a historical news collection using language technology and statistical machine learning methods K Kettunen, T Honkela, K Lindén, P Kauppinen, T Pääkkönen, J Kervinen IFLA World Library and Information Congress Proceedings 80th IFLA General …, 2014 | 23 | 2014 |
Analysis of EU languages through text compression K Kettunen, M Sadeniemi, T Lindh-Knuutila, T Honkela International Conference on Natural Language Processing (in Finland), 99-109, 2006 | 22 | 2006 |
Improving optical character recognition of finnish historical newspapers with a combination of Fraktur & Antiqua models and image preprocessing M Koistinen, K Kettunen, T Pääkkönen Proceedings of the 21st Nordic Conference on Computational Linguistics, 277-283, 2017 | 20 | 2017 |
Information retrieval from historical newspaper collections in highly inflectional languages: A query expansion approach A Järvelin, H Keskustalo, E Sormunen, M Saastamoinen, K Kettunen Journal of the Association for Information Science and Technology 67 (12 …, 2016 | 20 | 2016 |
Old content and modern tools-searching named entities in a Finnish OCRed historical newspaper collection 1771-1910 K Kettunen, E Mäkelä, T Ruokolainen, J Kuokkala, L Löfberg arXiv preprint arXiv:1611.02839, 2016 | 19 | 2016 |
Reductive and generative approaches to management of morphological variation of keywords in monolingual information retrieval K Kettunen Journal of Documentation, 2009 | 18 | 2009 |
Missing in action? Content of records management metadata in real life K Kettunen, P Henttonen Library & information science research 32 (1), 43-52, 2010 | 14 | 2010 |
How to Improve Optical Character Recognition of Historical Finnish Newspapers Using Open Source Tesseract OCR Engine M Koistinen, K Kettunen, J Kervinen Proc. of LTC, 279-283, 2017 | 13 | 2017 |
Keep, change or delete? setting up a low resource ocr post-correction framework for a digitized old finnish newspaper collection K Kettunen Italian Research Conference on Digital Libraries, 95-103, 2015 | 13 | 2015 |
Reductive and generative approaches to morphological variation of keywords in monolingual information retrieval K Kettunen Tampere University Press, 2007 | 13 | 2007 |
Names, right or wrong: Named entities in an OCRed historical Finnish newspaper collection K Kettunen, T Ruokolainen Proceedings of the 2nd International Conference on Digital Access to Textual …, 2017 | 11 | 2017 |
Using Syllables As Indexing Terms in Full-Text Information Retrieval. K Kettunen, P McNamee, F Baskaya Baltic HLT, 225-232, 2010 | 11 | 2010 |