• Text mining: information extraction, term recognition, named entity recognition, sentiment analysis, text classification, information retrieval, language resources
  • Knowledge representation: development, application & standardisation of ontologies
  • Machine learning: feature engineering, case-based reasoning, naive Bayesian learning, support vector machines, genetic algorithms, genetic programming
  • Information management: data modelling, data mining, relational and XML databases, user interface development
  • Application areas: healthcare, life sciences, social sciences & social media


  • CMT207: Information modelling and database systems (postgraduate)
  • NOTE: All course resources are available through Learning Central.


  • Anastazia Žunić (PhD funded by Vice-Chancellor's International Scholarship for Research Excellence, 2017-present): natural language processing, sentiment analysis, deep learning
  • Vigneshwaran Muralidaran (PhD, funded by CorCenCC, 2017-present): natural language processing, corpus linguistics
  • Dr Steven Neale (PDRA, 2016-present): natural language processing, corpus linguistics, crowdsourcing
  • David Owen (RA/PhD, 2016-present): text mining, ontologies, health informatics
  • Lowri Williams (PhD, funded by EPSRC Doctoral Training Partnership, 2013-2017, submitted): text mining, sentiment analysis, language resources
  • David Rogers (RA/PhD, 2012-present): text mining, sentiment analysis, social media
  • Bathilde Ambroise (PhD, 2012-2016, submitted): text mining, genomics, bioinformatics


  • Dr Bo Zhao (PhD, 2011-2015): text mining, ontologies, health informatics
  • Dr Christian Bannister (PhD, funded by MRC Doctoral Training Grant, 2011-2015): machine learning, health informatics, epidemiology
  • Dr Mark Greenwood (PhD, funded by Cardiff University President's Research Scholarship, 2010-2014): text mining, health informatics, social media