EEE 495 - Speech Communication

EEE 495 - Speech Communication

Section A: General Information

  • Course Title: Speech Communication

  • Type of Course: Optional, Theory

  • Offered to: EEE

  • Pre-requisite Course(s): None

Section B: Course Details

Course Content (As approved by the Academic Council)

Speech production and phonetics: articulatory and acoustic features; Speech analysis: formant, pitch, time and frequency domain analysis techniques, spectrogram; Speech coding: linear predictive coding, vocoders, vector quantization; Speech enhancement: spectral subtraction based techniques; Speech synthesis: formant synthesizers; Speech and speaker recognition: feature extraction and conventional recognition methods.

Course Objectives

  • To demonstrate fundamental concepts, algorithms, and applications of digital speech signal processing.

  • To enable students to apply digital signal processing theories to speech communication and application fields to provide a basis for the study of more advanced topics.

Knowledge required

Fundamental understanding of concepts of Digital Signal Processing course and Mathematics courses.

Course Outcomes

CO No. CO Statement Corresponding PO(s)* Domains and Taxonomy level(s)** Delivery Method(s) and Activity(-ies) Assessment Tool(s)
1 apply the digital signal processing principles to solve problems relevant to the time and frequency domain operations of speech signals PO(a) C3 Lectures, Discussions, Practice problem solving sessions Assignment, Class test, Final exam
2 analyse the speech signal processing techniques applied to real speech data based on the underlying principles PO(b) C4 Lectures, Discussions, Practice problem solving sessions

Assignment,

Presentation, Class test, Final exam

3 design efficient algorithms for speech feature extraction, different applications, i.e. speech recognition and speaker identification PO(c) C5, C6 Lectures, Discussions, Practice problem solving sessions

Assignment,
Class test,

Final exam

Cognitive Domain Taxonomy Levels: C1 – Knowledge, C2 – Comprehension, C3 – Application, C4 – Analysis, C5 – Synthesis, C6 – Evaluation, Affective Domain Taxonomy Levels: A1: Receive; A2: Respond; A3: Value (demonstrate); A4: Organize; A5: Characterize; Psychomotor Domain Taxonomy Levels: P1: Perception; P2: Set; P3: Guided Response; P4: Mechanism; P5: Complex Overt Response; P6: Adaptation; P7: Organization

Program Outcomes (PO): PO(a) Engineering Knowledge, PO(b) Problem Analysis, PO(c) Design/development Solution, PO(d) Investigation,
PO(e) Modern tool usage, PO(f) The Engineer and Society, PO(g) Environment and sustainability, PO(h) Ethics, PO(i) Individual work and team work,
PO(j). Communication, PO(k) Project management and finance, PO(l) Life-long Learning

* For details of program outcome (PO) statements, please see the departmental website or course curriculum

Mapping of Knowledge Profile, Complex Engineering Problem Solving and Complex Engineering Activities

K1 K2 K3 K4 K5 K6 K7 K8 P1 P2 P3 P4 P5 P6 P7 A1 A2 A3 A4 A5

Lecture Plan

Week Lectures Topic
1 1-3 Speech Production, excitation, voiced and unvoiced speech and silence
2 4-6 Phonetics: articulatory and acoustic features
3 7-9 Speech analysis: formant, pitch, time and frequency domain analysis techniques
4 10-12 Spectrogram analysis, non-linear scales (mel and bark scales)
5 13-15 Speech Modelling, Linear predictive coding, Speech coding
6 16-18 Vocoders, vector quantization for speech coding
7 19-21 Spectral subtraction and other speech enhancement techniques
8 20-24 Speech synthesis, Formant synthesizers
9 25-27 Speech feature extraction and applications
10 28-30 Speech Recognition
11 31-33 Speaker Identification
12 34-36 Practice Problems
13 37-39 Revision of the course materials

Assessment Strategy

  • Class participation will be judged by in-class evaluation; attendance will be recorded in every class.

  • Continuous assessment will be done in the form of quizzes, assignments, in-class evaluations.

  • Final Examination: A comprehensive term final examination will be held at the end of the Term following the guideline of academic Council.

Distribution of Marks

  • Class Participation 10%

  • Continuous Assessment 20%

  • Final Examination 70%

  • Total 100%

Textbook/References

O'Shaughnessy. Speech communications: Human and machine. 1999 (required)

Deller Jr, John R. "Discrete-time processing of speech signals." In Discrete-time processing of speech signals, pp. 908-908. 1993 (required)

N.B. Besides going through relevant topics of the textbook, it is strongly advised that the students follow the class Lectures and discussions regularly for a thorough understanding of the topics.

Back to Top