Recording yourself speak Thai: a 7-day exercise

7 min read

Published

The single highest-leverage drill in Thai isn't practice. It's recording yourself, listening back, and noticing the gap between what you said and what a native says.

Most learners skip this because it's mildly embarrassing. Listening to your own voice is unpleasant. Listening to your own bad Thai is worse. But the embarrassment is the signal — the moments you cringe are the moments your ear is finally hearing what your mouth produced.

This piece is the structured version. Seven days. Twenty minutes a day. By the end you'll have moved more on tones than two months of passive listening.

Why recording works when other drills don't

Three reasons most pronunciation drills fail:

You can't hear yourself the way others hear you. Real-time, your brain is busy producing the sound. It doesn't have spare cycles to evaluate the output. By the time you've decided whether your tone was right, the syllable is gone.

Native audio is too good to copy from. When you hear a perfect Thai word and try to mimic it, you'll convince yourself you got it. Your imitation feels right because the model was right. The mismatch only shows up when you put your version next to theirs.

Tones decay across utterances. You can hit a tone in isolation and lose it the moment you string two words together. Single-word drills don't catch this. Recorded sentences do.

Recording fixes all three. You hear yourself with the production cycles spent. You compare against the model directly. You catch tone-decay across syllables. Three problems, one drill.

The 7-day plan

Twenty minutes per session. Not optional — under 15 and you don't get enough reps; over 30 and you fatigue and stop noticing.

Equipment: anything that records audio. Phone voice memo is fine. AirPods + the iOS Voice Memos app is what most learners use. The recording quality doesn't matter — what matters is that you can play it back.

Day 1 — Five tones, alone

Pick the canonical five-tone trio: ขาว / ข้าว / ข่าว (white / rice / news, rising / falling / low) and ม้า / มา / หมา (horse / come / dog, high / mid / rising).

Record yourself saying each word three times. Six words × three reps = eighteen recordings.

Listen back to each. Compare to the native version (in the app or any tonal-Thai resource). Note which tones you got wrong, which you produced inconsistently across the three reps.

Most common day-1 finding: at least one tone is unstable across reps. You'll say khǎao correctly once, then drift to mid on the second rep, then over-correct to falling on the third. That's the pattern to fix.

Day 2 — Same tones, in sentences

Take the same five tones, but now in two-word combinations:

  • khǎao khâao (white rice)
  • khàao mâi rúu (I don't know the news — colloquial)
  • mǎa khǎao (white dog)
  • maa hǎa pǒm (come find me — male speaker)

Record three reps of each. Listen back. The tones you got right in isolation on day 1 will start decaying in connected speech. That's the gap day 2 surfaces — single-word tone production isn't the same skill as connected-speech tone production.

Day 3 — Common phrases, with native model

Pick five phrases from your practice deck. The ones you say most often: sa-wàt-dii khrap, khop khun khrap, arai na, gin khao, mâi pen rai.

Listen to the native version once, then record yourself. Don't try to imitate live; reproduce from memory.

Listen back-to-back: native, then yours, then native, then yours. Cycling between them is what builds the discrimination.

Day 3 is where most learners hear "I sound nothing like that" for the first time. Good. The gap is now visible. Until you can hear the gap you can't close it.

Day 4 — Long phrases (4+ syllables)

Pick three phrases of at least four syllables:

  • ráan aa-hǎan thai (Thai restaurant)
  • phǒm ja pai talàat (I'm going to the market)
  • khun aa-yú thâo-rài (how old are you)

Three reps each. Listen back, compare to native.

Long phrases catch a different problem: pacing. Native Thai has rhythm and stress patterns; English speakers tend to compress unstressed syllables too aggressively, or stretch them too evenly, neither of which sounds right. The mismatch is rarely about individual sounds — it's about the music.

Day 5 — Free speech, 60 seconds

Set a timer. Record yourself talking about something familiar — what you ate today, what you did this morning — in Thai for 60 seconds. Stumble. Restart. Don't worry about being good.

Listen back the next day, not immediately. The day-old version is easier to evaluate critically because the production memory has faded.

Day 5 is the hardest. You'll hear yourself trip over words you "know", lose tones you "have", forget vocabulary mid-sentence. That's the real diagnostic. The cleanest day-1 tone production doesn't survive contact with a 60-second monologue.

Day 6 — Re-record the day-5 monologue

Same prompt as day 5. Same length. Different recording. Listen to the day-5 version first, then record day 6.

Day 6 will be measurably better than day 5 — not because you've practised, but because day 5 surfaced specific weaknesses your brain has been quietly working on overnight.

Day 7 — Compare day 1 to day 7

Re-record the five-tone words from day 1. Listen back-to-back: day 1 vs day 7.

The improvement is usually obvious. Tones that wobbled across reps on day 1 are stable on day 7. Words that drifted toward English defaults sit better in the Thai contour space.

If there's no audible improvement, the diagnosis is mechanical: you skipped the listen-back step on some days. The recording itself doesn't help; the comparison does.

What to do after the seven days

Three options, in order of leverage:

Most-impact: keep recording, weekly. A 60-second monologue every Sunday, listening back the following Sunday before recording the next one. The week-old comparison reveals slow-burn problems that day-by-day comparison can't.

Second-most: targeted minimal-pair drills. When you find a tone you keep losing in connected speech, pull it back to single-word drills for a week. Then re-test in connected speech.

Lowest-leverage: more passive listening. Useful for vocabulary acquisition and natural cadence. Not what fixes tones. Don't substitute it for recording.

Two reasons. First, it doesn't feel like progress. You're not learning new words. You're not adding cards to your deck. Twenty minutes of recording-and-listening produces zero tangible vocabulary growth.

Second, it's uncomfortable. The honest gap between your imitation and a native version is easier to ignore than to face. Most learners prefer the gentle dopamine of streak counters and unlocked-words badges over the cringe of listening to themselves butcher khop-khun.

But the cringe is the signal. If you're not cringing, you're not improving. Skip the comfortable drills; do the awkward one. The week of recording will move your tones further than a month of passive practice.

The tones reference covers what each tone should sound like. The pronunciation mistakes piece covers the six specific errors English speakers default to. Together with this drill, that's the production-fluency cluster on ThaiDai.

Drill the tones, hear the difference.

Every word in the deck plays in a native-trained Thai voice. The minimal-pair drills (มา / ม้า / หมา) train your ear directly. Free tier covers the foundational set — get started in two minutes.

Open the deck →

Read next