Course Title: Speech Communication
Type of Course: Optional, Theory
Offered to: EEE
Pre-requisite Course(s): None
Speech production and phonetics: articulatory and acoustic features; Speech analysis: formant, pitch, time and frequency domain analysis techniques, spectrogram; Speech coding: linear predictive coding, vocoders, vector quantization; Speech enhancement: spectral subtraction based techniques; Speech synthesis: formant synthesizers; Speech and speaker recognition: feature extraction and conventional recognition methods.
To demonstrate fundamental concepts, algorithms, and applications of digital speech signal processing.
To enable students to apply digital signal processing theories to speech communication and application fields to provide a basis for the study of more advanced topics.
Fundamental understanding of concepts of Digital Signal Processing course and Mathematics courses.
CO No. | CO Statement | Corresponding PO(s)* | Domains and Taxonomy level(s)** | Delivery Method(s) and Activity(-ies) | Assessment Tool(s) |
---|---|---|---|---|---|
1 | apply the digital signal processing principles to solve problems relevant to the time and frequency domain operations of speech signals | PO(a) | C3 | Lectures, Discussions, Practice problem solving sessions | Assignment, Class test, Final exam |
2 | analyse the speech signal processing techniques applied to real speech data based on the underlying principles | PO(b) | C4 | Lectures, Discussions, Practice problem solving sessions | Assignment, Presentation, Class test, Final exam |
3 | design efficient algorithms for speech feature extraction, different applications, i.e. speech recognition and speaker identification | PO(c) | C5, C6 | Lectures, Discussions, Practice problem solving sessions | Assignment, Final exam |
Cognitive Domain Taxonomy Levels: C1 – Knowledge, C2 – Comprehension, C3 – Application, C4 – Analysis, C5 – Synthesis, C6 – Evaluation, Affective Domain Taxonomy Levels: A1: Receive; A2: Respond; A3: Value (demonstrate); A4: Organize; A5: Characterize; Psychomotor Domain Taxonomy Levels: P1: Perception; P2: Set; P3: Guided Response; P4: Mechanism; P5: Complex Overt Response; P6: Adaptation; P7: Organization
Program Outcomes (PO): PO(a) Engineering Knowledge, PO(b) Problem Analysis, PO(c) Design/development Solution, PO(d) Investigation,
PO(e) Modern tool usage, PO(f) The Engineer and Society, PO(g) Environment and sustainability, PO(h) Ethics, PO(i) Individual work and team work,
PO(j). Communication, PO(k) Project management and finance, PO(l) Life-long Learning
* For details of program outcome (PO) statements, please see the departmental website or course curriculum
K1 | K2 | K3 | K4 | K5 | K6 | K7 | K8 | P1 | P2 | P3 | P4 | P5 | P6 | P7 | A1 | A2 | A3 | A4 | A5 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Week | Lectures | Topic |
---|---|---|
1 | 1-3 | Speech Production, excitation, voiced and unvoiced speech and silence |
2 | 4-6 | Phonetics: articulatory and acoustic features |
3 | 7-9 | Speech analysis: formant, pitch, time and frequency domain analysis techniques |
4 | 10-12 | Spectrogram analysis, non-linear scales (mel and bark scales) |
5 | 13-15 | Speech Modelling, Linear predictive coding, Speech coding |
6 | 16-18 | Vocoders, vector quantization for speech coding |
7 | 19-21 | Spectral subtraction and other speech enhancement techniques |
8 | 20-24 | Speech synthesis, Formant synthesizers |
9 | 25-27 | Speech feature extraction and applications |
10 | 28-30 | Speech Recognition |
11 | 31-33 | Speaker Identification |
12 | 34-36 | Practice Problems |
13 | 37-39 | Revision of the course materials |
Class participation will be judged by in-class evaluation; attendance will be recorded in every class.
Continuous assessment will be done in the form of quizzes, assignments, in-class evaluations.
Final Examination: A comprehensive term final examination will be held at the end of the Term following the guideline of academic Council.
Class Participation 10%
Continuous Assessment 20%
Final Examination 70%
Total 100%
O'Shaughnessy. Speech communications: Human and machine. 1999 (required)
Deller Jr, John R. "Discrete-time processing of speech signals." In Discrete-time processing of speech signals, pp. 908-908. 1993 (required)
N.B. Besides going through relevant topics of the textbook, it is strongly advised that the students follow the class Lectures and discussions regularly for a thorough understanding of the topics.
+88-02-5515548
Copyright © Department of EEE, BUET. All Rights Reserved
Built with Grav CMS by Dr. Sajid Muhaimin Choudhury | Webmaster: Mr. Rajon Chandra Gope, ATO