🌐

🗣️

🗣️ Pronunciation

A Pronunciation Lesson — Framework for Any Sound, Stress, or Intonation Feature

Pronunciation is not about perfection — it is about intelligibility. The goal is to be understood, not to sound like a native speaker.

45–60 minutes 6 stages 13 min read pronunciation lesson-framework elt skills

What this framework is about

Pronunciation is the area where most teachers feel least confident — and most students receive least teaching. The result is that students may have excellent grammar and vocabulary but fail to communicate because they cannot be understood, or cannot understand others. Pronunciation teaching does not require specialist equipment, recordings, or language laboratories. It requires the teacher's voice, the students' mouths, a blackboard, and a clear framework. This framework applies to any pronunciation feature: a single difficult sound, word stress, sentence stress, connected speech, or intonation.

Core principle

The principle that changes everything: awareness before production

Students cannot produce sounds they cannot hear. The most common error in pronunciation teaching is asking students to repeat before they have learned to distinguish. A student who cannot yet hear the difference between /p/ and /b/ at the end of a word will not produce the difference reliably, however many times they repeat. The framework must therefore begin with listening discrimination — training the ear before demanding production from the mouth.

The second principle is equally important: intelligibility over native-speaker accuracy. English is used internationally as a lingua franca — the majority of English communication in the world takes place between non-native speakers. Many features of native-speaker pronunciation (certain vowel reductions, specific consonant sounds, rapid connected speech) are not necessary for international intelligibility and may even reduce it. Focus pronunciation teaching on the features that most affect whether students are understood: consonant contrasts, word stress, nuclear (sentence) stress, and rhythm. These are the high-value targets. A student who controls these features will be understood by almost any listener.

Finally: pronunciation is a physical skill. Like learning to ride a bicycle, understanding the rule is not enough — the mouth, tongue, lips, and throat must practise the movement repeatedly until it becomes automatic. This means drilling is not old-fashioned; it is essential. The question is not whether to drill, but how to make drilling meaningful rather than mechanical.

The stages

Context and motivation

Why this feature matters — in real communication

5–7 min

Why this stage exists

Students who do not see why a pronunciation feature matters will not invest in practising it. This stage creates the motivation by demonstrating a genuine communication problem the feature causes — or a communication advantage it produces.

The teacher does

Begin with a real example of the communication problem. Options:

• Minimal pair confusion: say two words that students commonly confuse ('ship' / 'sheep', 'live' / 'leave', 'tree' / 'three'). Ask students which word you said. Show how the confusion causes real communication breakdowns. Give an example sentence where the wrong word would cause misunderstanding.
• Word stress error: say a word with the stress in the wrong place. Ask students what word you said. Show how stress placement affects whether a listener can process the word at all.
• Sentence stress: read a sentence with neutral stress, then with appropriate nuclear stress. Ask: which is more natural? What does the stress change communicate?

Do not teach the rule yet. Just establish: this feature matters for real communication.

Students do

Listen. Respond to the teacher's examples — which word, which sounds natural. Begin to notice the feature.

🌿 Zero-resource version

The teacher's voice is the only resource. No recordings, no equipment. Say the words and sentences yourself.

⚠ Most common mistake

Skipping this stage and going straight to drilling. Students who do not understand why they are practising a sound quickly become bored and stop trying. Two minutes of genuine motivation is worth ten minutes of uninvested drilling.

Listening discrimination

Training the ear before training the mouth

8–10 min

Why this stage exists

Students must be able to hear a distinction reliably before they can produce it. This stage trains perceptual discrimination — the ability to notice the difference between sounds, stress patterns, or intonation shapes that students currently confuse or do not notice.

The teacher does

Design tasks that require students to listen and distinguish — not produce.

For a sound contrast (e.g. /p/ vs /b/):
• Say pairs of words: 'cap / cab', 'rip / rib', 'cup / cub'. Students hold up one finger for the first sound, two fingers for the second. Or students write 1 or 2 in their notebooks.
• Say one word from a minimal pair. Students write which word they heard.

For word stress:
• Say words with different stress patterns. Students mark the stress: 'PHOtograph / phoTOgrapher / photoGRAPHic'. They mark which syllable is strongest.
• Is the stress the same or different? Teacher says two versions — students judge.

For sentence stress / intonation:
• Teacher says the same sentence two ways. Students decide which sounds like a question, a statement, or a suggestion.

Check answers after each short sequence. Clarify any disagreements by repeating the contrast.

Students do

Listen carefully. Signal what they hear (finger count, written number, thumbs up/down). Compare with a partner after each sequence.

🌿 Zero-resource version

The teacher's voice is the audio source. All listening activities done without equipment. Students respond with finger signals or write single digits in notebooks.

⚠ Most common mistake

Moving to production too quickly. If students cannot discriminate reliably in the listening stage, drilling will not produce accurate output. Spend as long as needed on discrimination before moving to production — even if that means reducing the length of the practice stages.

Focused explanation — form and articulation

What the mouth does, and how to do it

5–7 min

Why this stage exists

Students need to understand how the target feature is physically produced — where the tongue goes, how the lips are positioned, whether the sound is voiced or unvoiced, where the stress falls, what the intonation shape looks like. This is the 'how to' stage that makes production possible.

The teacher does

Explain the physical production of the target feature clearly and concisely.

For consonant sounds:
• Describe tongue position: 'For /θ/, the tongue tip goes between the teeth. Touch your top teeth with your tongue.'
• Describe voicing: 'For /s/ the voice box is not vibrating. For /z/ it is. Put your fingers on your throat and feel the difference.'
• Use visual demonstration: exaggerate the mouth shape so students can see.

For word stress:
• Mark the stress visually on the board: use large and small circles (● ○ ○ for STUdent), capital letters (STUdent), or underline the stressed syllable.
• Show how the stressed syllable is longer, louder, and higher in pitch than the unstressed ones.
• Demonstrate with exaggeration, then at natural pace.

For sentence stress and intonation:
• Use lines or arrows on the board to show intonation shape: a rising line for a question, a falling line for a statement.
• Mark the stressed words: 'I DIDN'T steal the money.' vs. 'I didn't STEAL the money.'
• Show how stress changes meaning.

Keep this stage short and physical. Students should be moving their mouths during the explanation, not copying extensive notes.

Students do

Follow the articulation instructions physically. Look at the teacher's mouth. Move their own mouths. Ask questions about how to form the sound or stress.

🌿 Zero-resource version

Everything is demonstrated by the teacher. No handouts, no IPA charts needed — though a simple mouth diagram on the board can help for difficult consonants.

⚠ Most common mistake

Over-explaining. Two or three clear physical instructions are more effective than a long linguistic explanation. Students need to move their mouths, not fill their notebooks.

Controlled drilling

Repetition with purpose — accuracy before fluency

8–10 min

Why this stage exists

Pronunciation is a physical skill. Accurate production requires repetition — the articulatory habit must be built through practice. Controlled drilling provides that repetition in a structured, error-correctable environment. The key is that drilling is meaningful: students are practising sounds in words, and words in sentences, not isolated phonemes in a vacuum.

The teacher does

Drill in this sequence:
1. Individual sounds → words → phrases → full sentences. Build up gradually.
2. Model → repeat. Always model before asking students to produce. Never ask 'how do you say it?' before you have said it yourself.
3. Choral drill first: whole class repeats together. This reduces individual anxiety and allows students to hear themselves in the group. Correct the group's overall output.
4. Then pairs: one student says, the other responds or evaluates. More individual practice, less teacher-led.
5. Then selected individuals: listen carefully, give precise positive and corrective feedback.

Drilling techniques:
• Choral repetition: teacher models → class repeats
• Back-chaining: build a phrase backwards from the end ('exam / the exam / pass the exam / hope to pass the exam / I hope to pass the exam') — this is especially effective for stress and rhythm
• Substitution: same sentence, students swap one word for another ('She went to the MARKET / office / hospital')
• Minimal pair contrast: alternate between the two sounds rapidly to build discrimination alongside production

Correct pronunciation errors immediately and precisely: not 'not quite right' but 'your tongue is touching the top of your mouth — move it down and put it between your teeth'.

Students do

Repeat chorally, then in pairs, then individually. Focus on the specific feature being drilled. Self-monitor and adjust.

🌿 Zero-resource version

The teacher's voice is the model. No recordings needed. Write words on the board with stress marks or phonetic guides as needed.

⚠ Most common mistake

Drilling without feedback. Students who repeat incorrectly and receive no correction practice the wrong form. Listen carefully during drilling and stop to correct — whole-class and individual errors both need addressing.

Communicative practice

Using the target feature in real communication

10–12 min

Why this stage exists

Controlled drilling builds the physical habit. Communicative practice embeds it — requiring students to use the target feature while they are simultaneously focused on communicating a meaning. This is where the pronunciation feature becomes part of real language use rather than a separate skill practised in isolation.

The teacher does

Design a communicative task that naturally generates the target feature. Students should be focused on completing the task, not on the pronunciation — but the task is designed so they cannot complete it without producing the target sounds, stresses, or intonation patterns.

For a sound contrast:
• A minimal pair communication task: Student A has a list of words/sentences; Student B has to write what they hear. If the sound is wrong, the message is wrong.
• A role play that contains the target words naturally.

For word stress:
• Students give short presentations using vocabulary with the target stress patterns.
• Students describe something (a person, a place) using words with the studied stress pattern.

For intonation:
• Students have short conversations in which the intonation pattern is required — disagreeing politely (rising-falling intonation), listing (rising on each item, falling on the last), questioning.

Listening and note-taking during this stage: note any persistent errors for feedback at the end.

Students do

Complete the communicative task. Monitor their own pronunciation of the target feature while communicating. Help each other after the task.

🌿 Zero-resource version

All communicative tasks described above require only the teacher's instructions and students' voices. No materials needed.

⚠ Most common mistake

Abandoning pronunciation focus during the communicative stage. The teacher's feedback after the task should return to the pronunciation feature: 'I noticed most people said it correctly, but a few were still putting the stress on the second syllable — let's do two more repetitions of those words.'

Noticing in the wild

Connecting the feature to the English students encounter outside class

5 min optional

Why this stage exists

A single lesson cannot build a pronunciation habit. Students need to notice the target feature in the English they encounter outside class — in songs, in films, in other people's speech, in the teacher's own talk. This brief closing stage builds that noticing habit.

The teacher does

Give students one specific listening task for outside the lesson:
• 'This week, when you hear English spoken — on the radio, in a film, from me — try to notice whether people produce this feature the same way we practised it. Notice if they do something different.'
• 'Listen to one English song this week. Find three words with the stress pattern we studied today.'
• 'Notice when you say these words in English over the next week. Try to self-correct.'

Follow up briefly at the start of the next lesson: 'Did anyone notice the feature we practised? What did you find?'

Students do

Commit to the noticing task. Report back next lesson.

🌿 Zero-resource version

No materials needed. The task is entirely observational.

⚠ Most common mistake

Never connecting pronunciation to the language students encounter outside class. Pronunciation taught only inside the classroom does not transfer. The noticing habit — once built — significantly accelerates acquisition.

🌿 Complete zero-resource version

A complete pronunciation lesson using nothing except the teacher's voice and a blackboard.

Teacher says two words that cause real communication problems. Students raise hands or fingers to show which they heard. Brief discussion: why does this matter? (5 min)
Teacher says 10 minimal pairs or stress contrasts. Students write 1 or 2 in notebooks to show which they heard. Check together. Repeat any that caused confusion. (8 min)
Teacher explains the physical production: tongue position, lip shape, voicing, stress pattern. Demonstrates with exaggeration. Students imitate. Teacher adjusts physically where needed: 'your tongue is too far back'. (5 min)
Choral drill: teacher models → class repeats. 5 words, then 5 phrases, then 3 full sentences. Then pairs. Then 5 individuals — precise corrective feedback. (10 min)
Communication task: students complete a 3-minute pair task requiring the target sounds or stress (information gap, description, minimal pair message). Teacher listens and notes. Brief whole-class feedback on persistent errors. (10 min)
Noticing task set for outside class. Teacher models the feature one more time at natural pace. (3 min)

Total: 41 min. No materials of any kind needed.

Variations and adaptations

For a single difficult consonant

Focus the entire lesson on one minimal pair contrast. Spend more time in the discrimination and drilling stages. In the communicative stage, use an information gap where the two words carry different meanings — the communication succeeds only if the sound is correct.

For word stress across a word family

Teach the stress pattern of a complete word family together: PHOtograph / phoTOgrapher / photoGRAPHic. Show students the rule: the suffix determines the stress. This gives students a productive tool for predicting stress in new words, not just memorising individual items.

For connected speech features

Focus on features of natural connected speech: linking (an apple → /ənæpəl/), elision (next day → /nekˈdeɪ/), assimilation (ten boys → /tembɔɪz/). The motivation stage is especially important here: students need to understand that connected speech is not lazy speech — it is how English actually works. The discrimination task focuses on hearing connected speech accurately before students are expected to produce it.

Integrating pronunciation into other lesson types

Pronunciation does not need its own standalone lesson. A 10-minute pronunciation focus can be built into any vocabulary lesson (stress of new words), any grammar lesson (weak forms of auxiliaries), or any reading lesson (pronunciation of text vocabulary). The same sequence applies: listen first, then explain, then drill, then use.

Frequently asked questions

How can I model sounds I find difficult myself?

Two strategies. First: you do not need to produce the sound perfectly yourself — you need to be able to describe and demonstrate its physical formation. A teacher who says 'the tongue goes between the teeth — like this' and places their own tongue between their teeth is giving students useful information even if their own /θ/ is not perfect. Second: use student speakers who produce the feature well as models. A student from within the class who produces the target sound accurately is an excellent peer model — and the social dynamics of peer modelling are often more motivating than teacher modelling.

Should I correct pronunciation errors in every lesson?

No — but you should address pronunciation consistently. In fluency-focused stages (speaking tasks, discussions, free writing), do not interrupt to correct pronunciation. Note errors and address them at the end. In pronunciation-focused drilling stages, correct immediately and precisely. The principle: pronunciation correction belongs in accuracy stages, not fluency stages. A regular 10-minute pronunciation slot built into weekly lessons is more effective than infrequent standalone pronunciation lessons.

Which pronunciation features should I prioritise?

Focus on features that affect intelligibility most. Research on English as a Lingua Franca identifies these as the highest priority: (1) most consonant contrasts — especially those where your students' L1 does not have the target sound, (2) word stress — unstressed syllables in English are significantly reduced, which confuses listeners when the stress is placed on the wrong syllable, (3) nuclear (sentence) stress — which word carries the main emphasis, (4) vowel length contrasts where they affect meaning. Less urgent for international intelligibility: exact vowel quality, the /θ/ and /ð/ sounds, and features of connected speech.

Is it wrong to have a foreign accent?

No. An accent is not an error — it is an identity. The goal of pronunciation teaching is not to eliminate students' accents but to develop their intelligibility: the ability to be understood clearly by a range of listeners in a range of situations. A student with a strong regional accent who speaks clearly, with correct word stress and clear consonant contrasts, is a more effective communicator than a student whose accent is 'neutral' but whose speech is mumbled, monotone, or incorrectly stressed.

What this framework is not

This is not a complete phonetics course. It does not cover the full IPA chart, the phonology of every variety of English, or the linguistics of accent and dialect in depth. It is a classroom framework — a structure for teaching any pronunciation feature effectively, regardless of the teacher's phonetics background. The feature you choose to teach, the words you select for drilling, and the communicative tasks you design are all yours. What this framework provides is the pedagogical sequence that makes any pronunciation teaching more effective.

All lesson frameworks