What is the CELPIP?
The Canadian English Language Proficiency Index Program (CELPIP) is a general English language proficiency test.
The CELPIP Test allows test takers to demonstrate their ability to function in English. The test clearly, accurately, and precisely assesses a test taker’s English abilities in a variety of everyday situations, such as communicating with co-workers and superiors in the workplace, interacting with friends, understanding newscasts, and interpreting and responding to written materials.
Why Webberz
We offer top-quality CELPIP coaching, both online and in-person. Our experienced trainers have deep expertise, helping numerous candidates prepare for the CELPIP test and achieve their target scores. Committed to excellence, we provide multiple CELPIP mock tests to familiarize our clients with the real exam environment.
What does the CLB level mean?
CLB 12 – Advanced proficiency in workplace and community contexts
CLB 11 – Advanced proficiency in workplace and community contexts
CLB 10 – Highly effective proficiency in workplace and community contexts
CLB 9 – Effective proficiency in workplace and community contexts
CLB 8 – Good proficiency in workplace and community contexts
CLB 7 – Adequate proficiency in workplace and community contexts
CLB 6 – Developing proficiency in workplace and community contexts
CLB 5 – Acquiring proficiency in workplace and community contexts
CLB 4 – Adequate proficiency for daily life activities
CLB 3 – Some proficiency in limited contexts
CLB 0,1,2 (M) – Minimal proficiency or insufficient information to assess
Not Administered: test taker did not receive this test component
CELPIP Listening Score
CELPIP LEVEL | LISTENING SCORE /38 |
---|---|
10-12 | 35-38 |
9 | 33-35 |
8 | 30-33 |
7 | 27-31 |
6 | 22-28 |
5 | 17-23 |
4 | 11-18 |
3 | 7-12 |
M | 0-7 |
Why is the score given in a range?
Each question is categorized into a level of difficulty. The more difficult questions will give you more ‘points’. If you answered more difficult questions correctly than another test-taker, even though the total correct answers are the same, you may be assigned a higher level.
I scored 28 in Listening. Is my level 6 or 7?
The answer is it depends, but you won’t be able to know until you get your test result. It depends on which questions you answered correctly. Difficult questions carry more weight. That’s why you may end up with a higher level than others with the same number of correct answers.
CELPIP Reading Score
CELPIP LEVEL | READING SCORE /38 |
---|---|
10-12 | 33-38 |
9 | 31-33 |
8 | 28-31 |
7 | 24-28 |
6 | 19-25 |
5 | 15-20 |
4 | 10-16 |
3 | 8-11 |
M | 0-7 |
Why is the score given in a range?
Each question is categorized into a level of difficulty. The more difficult questions will give you more ‘points’. If you answered more difficult questions correctly than another test-taker, even though the total correct answers are the same, you may be assigned a higher level.
I scored 31 in Reading. Is my level 8 or 9?
The answer is it depends, but you won’t be able to know until you get your test result. It depends on which questions you answered correctly. Difficult questions carry more weight. That’s why you may end up with a higher level than others with the same number of correct answers.
CELPIP Writing Score

CELPIP Speaking Score

How is the score determined?
Each test taker’s performance, i.e. a test taker’s responses to all tasks in the component, is assessed by multiple raters. Each CELPIP speaking performance is rated by a minimum of three speaking raters, and each CELPIP writing performance is rated by a minimum of four writing raters. Raters work independently of one another, and have no knowledge of the ratings assigned by other raters.
Rating criteria
The rating dimensions that have been developed for the writing and speaking component are listed above on this page in the Performance Standards section:
Speaking: Content/Coherence, Vocabulary, Listenability, and Task Fulfillment
Writing: Content/Coherence, Vocabulary, Readability, and Task Fulfillment
Each dimension is divided into five performance levels. Performance descriptors are provided for each level in each dimension. Raters assign a level in each dimension by identifying tangible evidence in the test taker’s performance that matches the descriptors in the rating scale.
Benchmarking
When the ratings of a test taker’s performance are complete, they are inspected for agreement. If the ratings are in disagreement, a benchmark rater is automatically assigned to assess the performance. All benchmark raters are experienced raters who have demonstrated consistent accuracy and reliability in rating. Benchmark raters have no knowledge of the initial ratings.
How is the final score determined?
The Speaking and Writing component scores are derived from the dimensional ratings assigned by the raters. These scores are then transformed into a CELPIP level. The transformation rules have been established by English language experts who participated in a standard-setting exercise. Standard setting is an extensive, research-based process. Language experts work with testing professionals to identify what language learners need to be able to do at each performance level, such as CLB 8. The experts then analyze the test in detail and determine what level of performance a test taker needs to demonstrate for each CELPIP level. This process has established a defensible link between each Speaking and Writing component score and its corresponding CELPIP level.
Why did I get more than 38 questions in listening/reading?
New items are constantly being written. Before they can be used as scored items, they are pre-tested to ensure that they are equivalent in quality to existing items. Paragon includes some new items in every test. These items look the same as the scored items but they are not used to calculate your score. Paragon does not tell the test taker which questions will be unscored because it is important that test takers try their best for every item. This ensures that the data collected on the new items can be used to evaluate their quality. Only questions that have performed well will be used as scored items in the future.