Kimmo Kettunen
Kimmo Kettunen
University of Eastern Finland, Joensuu
Verifierad e-postadress på welho.com
Titel
Citeras av
Citeras av
År
Can type-token ratio be used to show morphological complexity of languages?
K Kettunen
Journal of Quantitative Linguistics 21 (3), 223-245, 2014
452014
To stem or lemmatize a highly inflectional language in a probabilistic IR environment?
K Kettunen, T Kunttu, K Järvelin
Journal of Documentation, 2005
442005
Exporting Finnish digitized historical newspaper contents for offline use
T Pääkkönen, J Kervinen, A Nivala, K Kettunen, E Mäkelä
D-Lib Magazine 22 (7/8), 2016
322016
Measuring Lexical Quality of a Historical Finnish Newspaper Collection―Analysis of Garbled OCR Data with Basic Language Technology Tools and Means
K Kettunen, T Pääkkönen
Proceedings of the Tenth International Conference on Language Resources and …, 2016
292016
Functional classification of records and organisational structure
P Henttonen, K Kettunen
Records Management Journal, 2011
272011
Is a morphologically complex language really that complex in full-text retrieval?
K Kettunen, E Airio
International Conference on Natural Language Processing (in Finland), 411-422, 2006
272006
Complexity of European Union Languages: A comparative approach∗
M Sadeniemi, K Kettunen, T Lindh-Knuutila, T Honkela
Journal of Quantitative Linguistics 15 (2), 185-211, 2008
252008
Restricted inflectional form generation in management of morphological keyword variation
K Kettunen, E Airio, K Järvelin
Information Retrieval 10 (4-5), 415-444, 2007
252007
Analyzing and improving the quality of a historical news collection using language technology and statistical machine learning methods
K Kettunen, T Honkela, K Lindén, P Kauppinen, T Pääkkönen, J Kervinen
IFLA World Library and Information Congress Proceedings 80th IFLA General …, 2014
232014
Analysis of EU languages through text compression
K Kettunen, M Sadeniemi, T Lindh-Knuutila, T Honkela
International Conference on Natural Language Processing (in Finland), 99-109, 2006
222006
Improving optical character recognition of finnish historical newspapers with a combination of Fraktur & Antiqua models and image preprocessing
M Koistinen, K Kettunen, T Pääkkönen
Proceedings of the 21st Nordic Conference on Computational Linguistics, 277-283, 2017
202017
Information retrieval from historical newspaper collections in highly inflectional languages: A query expansion approach
A Järvelin, H Keskustalo, E Sormunen, M Saastamoinen, K Kettunen
Journal of the Association for Information Science and Technology 67 (12 …, 2016
202016
Old content and modern tools-searching named entities in a Finnish OCRed historical newspaper collection 1771-1910
K Kettunen, E Mäkelä, T Ruokolainen, J Kuokkala, L Löfberg
arXiv preprint arXiv:1611.02839, 2016
192016
Reductive and generative approaches to management of morphological variation of keywords in monolingual information retrieval
K Kettunen
Journal of Documentation, 2009
182009
Missing in action? Content of records management metadata in real life
K Kettunen, P Henttonen
Library & information science research 32 (1), 43-52, 2010
142010
How to Improve Optical Character Recognition of Historical Finnish Newspapers Using Open Source Tesseract OCR Engine
M Koistinen, K Kettunen, J Kervinen
Proc. of LTC, 279-283, 2017
132017
Keep, change or delete? setting up a low resource ocr post-correction framework for a digitized old finnish newspaper collection
K Kettunen
Italian Research Conference on Digital Libraries, 95-103, 2015
132015
Reductive and generative approaches to morphological variation of keywords in monolingual information retrieval
K Kettunen
Tampere University Press, 2007
132007
Names, right or wrong: Named entities in an OCRed historical Finnish newspaper collection
K Kettunen, T Ruokolainen
Proceedings of the 2nd International Conference on Digital Access to Textual …, 2017
112017
Using Syllables As Indexing Terms in Full-Text Information Retrieval.
K Kettunen, P McNamee, F Baskaya
Baltic HLT, 225-232, 2010
112010
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20