KH Coder
KH Coder is an open source software for computer assisted qualitative data analysis, particularly quantitative content analysis and text mining. It can be also used for computational linguistics. It supports processing and etymological information of text in several languages, such as Japanese, English, French, German, Italian, Portuguese and Spanish. Specifically, it can contribute factual examination co-event system hub structure, computerized arranging guide, multidimensional scaling and comparative calculations.[1]
Developer(s) | Koichi Higuchi |
---|---|
Stable release | 2.00f
/ Dec 2015 |
Preview release | 3.Beta.01
/ Mar 2020 |
Repository | |
Operating system | Microsoft Windows, Linux, macOS |
Type | Qualitative data analysis, Text mining, Content analysis |
License | GPL2 license |
Website | khcoder |
It is well received by researchers worldwide and used in a large number of disciplines, including neuroscience, sociology, psychology, public health, media studies, education research and computer science. There are more than 500 English research papers listed in Google scholar.[2] More than 3500 academic research papers were published that use KH Coder according to a list compiled by the author.[3]
KH Coder has been reviewed as a user friendly tool "for identifying themes in large unstructured data sets, such as online reviews or open-ended customer feedback"[4] and has been reviewed in comparison to WordStat.[5]
Features
Its features include:
- on word-level: Searching, KWIC concordance, collocation statistics, and correspondence analysis.
- on category-level: Development of categories or dictionaries, cross tabulation, and correspondence analysis.
- on word- and category-level: Frequency lists, multi-dimensional scaling, co-occurrence network, and hierarchical cluster analysis.
- on document-level: Searching, clustering, and Naive Bayes classifier
KH Coder allows for further search and statistical analysis functions using back-end tools such as Stanford POS Tagger, the natural language processing toolkit FreeLing, Snowball stemmer, MySQL and R.
Alternatives
- qdap (Windows, Linux, macOS) for quantitative analysis of qualitative transcripts and natural language processing.
References
- S. N. Vinithra, S.N; Arun Selvan, S.J.; Anand Kumar, M.; Soman, K.P. (2015): Simulated and Self-Sustained Classification of Twitter Data based on its Sentiment. Indian Journal of Science and Technology. Vol. 8, Issue 24
- Google Scholar search using Keywords "KH Coder" and "KHCoder"
- Higuchi, Koichi (2017): Scholarly research using KH Coder
- Towler, Will (2014): Text Analytics For Everyone. UX Magazine, July 31, 2014.
- Huirong, Cheng;Guobin, Huang; Lin, Zheng (2015): Comparison of Software for Unstructured Text Analysis:KH Coder vs. Wordstat. 图书与情报, 2015(04): 110-117.