Difference between revisions of "Projects:2017s1-150 Statistical Natural Language Processing"
(Created page with "This project aims to investigate syntactic methods for classifying English language documents. Background research in Natural Language Processing (NLP) will be undertaken, and...") |
|||
Line 1: | Line 1: | ||
+ | |||
+ | '''Description''' | ||
+ | ---- | ||
+ | |||
This project aims to investigate syntactic methods for classifying English language documents. Background research in Natural Language Processing (NLP) will be undertaken, and candidate methods suitable for document classification will be selected. On-line resources such as large corpuses of documents labelled according to various categories will be used to test these methods for classifying unlabelled documents. The performance of the selected methods will be assessed using statistical techniques. The project also aims to develop new ideas to improve the performance of selected methods. | This project aims to investigate syntactic methods for classifying English language documents. Background research in Natural Language Processing (NLP) will be undertaken, and candidate methods suitable for document classification will be selected. On-line resources such as large corpuses of documents labelled according to various categories will be used to test these methods for classifying unlabelled documents. The performance of the selected methods will be assessed using statistical techniques. The project also aims to develop new ideas to improve the performance of selected methods. | ||
+ | |||
+ | '''Group members''' | ||
+ | |||
+ | ---- | ||
+ | Zheng Li | ||
+ | Runsheng Liu | ||
+ | |||
+ | '''Supervisor''' | ||
+ | |||
+ | ---- | ||
+ | Prof Lang White |
Revision as of 13:03, 15 September 2017
Description
This project aims to investigate syntactic methods for classifying English language documents. Background research in Natural Language Processing (NLP) will be undertaken, and candidate methods suitable for document classification will be selected. On-line resources such as large corpuses of documents labelled according to various categories will be used to test these methods for classifying unlabelled documents. The performance of the selected methods will be assessed using statistical techniques. The project also aims to develop new ideas to improve the performance of selected methods.
Group members
Zheng Li Runsheng Liu
Supervisor
Prof Lang White