|
Language of Instruction
|
English
|
|
Level of Course Unit
|
Doctorate's Degree
|
|
Department / Program
|
ELECTRICAL AND COMPUTER ENGINEERING
|
|
Type of Program
|
Formal Education
|
|
Type of Course Unit
|
Elective
|
|
Course Delivery Method
|
Face To Face
|
|
Objectives of the Course
|
(1) Recall the basics of probability concepts (2) Learn the fundamental notions of data & text mining (3) Discuss essential NLP approaches for free-text data structures (4) Apply machine learning algorithms for text mining problems
|
|
Course Content
|
Intro. To Data & Text Mining, Why Text Mining, Issues and difficulties in Text Mining Basic text processing commands in Unix like operating systems & regular expressions Recap basic probabilities, n-gram language models, perplexity, and smoothing techniques n-gram model interpolation and backoff, Naïve Bayes algorithm for text classification Introduction to named entity recognition, information extraction Conditional vs generative models, maximum entropy models for named entity recognition Part-of-speech tagging using maxent models, rel. extraction (supervised, distant supervision) Intro. to parsing, PCFGs, CNF, CKY algorithm & issues with PCFGs, Lexicalized PCFG Dependency parsing, arc-eager parser, Malt parser, relation extraction through dependency structure Lexical semantics, synonymy/homonymy/polysemy, word sense disambiguation Word similarity, term-document matrices, tf-idf weighting, vector space model Intro. to open-source text mining libraries (NLTK, spaCy -in python), building a model for prediction models
|
|
Course Methods and Techniques
|
- In-person course instruction, - Additional asynchronous resource sharing, - Development of a course project,
|
|
Prerequisites and co-requisities
|
None
|
|
Course Coordinator
|
Asist Prof.Dr. Mehmet Gökhan Bakal gokhan.bakal@agu.edu.tr
|
|
Name of Lecturers
|
Asist Prof.Dr. MEHMET GÖKHAN BAKAL gokhan.bakal@agu.edu.tr
|
|
Assistants
|
None
|
|
Work Placement(s)
|
No
|
Recommended or Required Reading
|
Resources
|
Relevant Video Resources
|
|
Course Notes
|
- Shared topic-related slides and other articles. - SPEECH and LANGUAGE PROCESSING, An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition Second Edition by Daniel Jurafsky and James H. Martin.
|
|
Documents
|
https://web.stanford.edu/~jurafsky/slp3/ed3book.pdf
|
|
Assignments
|
https://drive.google.com/drive/u/0/folders/1wMx0x2M5I18jelH3UtCRpUmbvlLTigXQ
|
Course Category
|
Mathematics and Basic Sciences
|
%30
|
|
|
Engineering
|
%60
|
|
|
Engineering Design
|
%0
|
|
|
Social Sciences
|
%0
|
|
|
Education
|
%0
|
|
|
Science
|
%10
|
|
|
Health
|
%0
|
|
|
Field
|
%0
|
|
|