CELPIP Speaking template: Describing a Scene

CELPIP Speaking Task 3 — Describing a Scene: 30 seconds prep, 60 seconds to speak. You describe what's happening in a picture in as much relevant detail as you can.

3 min read Spoken response Works for any prompt

What raters reward

Organized coverage of the image (Content/Coherence), precise nouns, position words and present-continuous verbs (Vocabulary, Readability), and steady delivery that fills the time (Listenability, Task Fulfillment).

Your time plan

Prep (30s) — Plan a path: overview → foreground (left to right) → background → mood. Note 4–5 specific things you'll name.
Speak (60s) — Overview (~8s) → foreground people/actions (~30s) → background/setting (~12s) → atmosphere or guess (~10s).

How to structure it

Fill the [slots] with your own ideas — adapt the frames, don't recite them.

1. Overview — One sentence on what the picture shows overall.
2. Foreground details — Describe the main people and actions, left to right.
3. Background / setting — Add what's behind or around the main action.
4. Atmosphere / guess — Comment on the mood or what might be happening.
1 Overview ~8s

Set the type of scene before any detail.

Grammar Present simple

Phrases to adapt
  • This picture shows [type of scene], probably at [place].
  • At first glance, it looks like [overall impression].
2 Foreground details ~30s

Use present continuous and position words; say what people are doing, wearing, holding.

Grammar Present continuous + prepositions of place

Phrases to adapt
  • In the foreground, [person] is [action].
  • On the left, [detail], while on the right, [detail].
3 Background / setting ~12s

Mention the place, weather, objects, or signs.

Grammar There is / there are + prepositions

Phrases to adapt
  • In the background, there is [detail].
  • Behind them, you can see [detail].
4 Atmosphere / guess ~10s

A small inference shows range.

Grammar Modals of deduction (must / might be)

Phrases to adapt
  • Everyone seems [mood], so they might be [guess].
  • The overall feeling is [adjective].

A worked model answer

A high-scoring sample that follows this shape — use it as a model, not a script.

This illustration portrays a multi-generational sanctuary of learning, where various individuals are immersed in literary and digital pursuits. In the foreground, a young student focuses intently on her laptop amidst stacks of reference materials, while an elderly gentleman nearby enjoys a traditional newspaper. The center of the room features a colorful rug where a group of small children gathers around an adult for storytime, highlighting a communal love for reading. Flanking the walls are floor-to-ceiling wooden bookshelves brimming with volumes, accessible by a tall library ladder. Students occupy the background, utilizing modern workstations and quiet study nooks. Sunlight streams through large windows, illuminating a space that seamlessly bridges the gap between classical printed media and contemporary technology, creating an atmosphere of quiet intellectual productivity and shared discovery.

Useful vocabulary

Vocabulary is one of the four scored dimensions — weave a few in (don't force all of them).

Position & place
in the foregroundin the backgroundon the left / rightin the middlenext tobehind
Actions (present continuous)
is chattingare gatheringis reaching forseems to be …-ingappears to be
Describing detail
brightly colouredcrowdedspaciousin casual clothesa row ofa pile of

Common mistakes to avoid

  • Listing random objects with no order.
  • Using only 'there is/are' instead of action verbs.
  • Running out of things to say at ~30s.
  • Describing what isn't there.

Quick tips

  • Go wide, then zoom in — overview before details.
  • Lean on position words: in front of, behind, next to, in the corner.
  • Use present continuous (is/are …-ing) throughout.
Put it into practice
Try Describing a Scene with this template
Start practice
Next template Making Predictions (S4)

All templates

View all
Writing
Speaking