Lesson 10: Advanced Word Study
Using Corpus Tools & Dictionary Features
To achieve a truly deep understanding of vocabulary, advanced learners can go beyond standard definitions and use the same tools as linguists and dictionary-makers. This lesson introduces two powerful resources for in-depth word study: linguistic corpora and the advanced features of online dictionaries. Using these tools will allow you to analyze how words are used in the real world with incredible precision.
Key Vocabulary
-
Corpus
A large and structured set of texts (now usually electronically stored and processed) used for language research.
-
Etymology
The study of the origin of words and the way in which their meanings have changed throughout history.
-
Concordancer
A software program that shows every occurrence of a word in a corpus, presented in the context of the words surrounding it.
1. The Power of a Corpus
A corpus is a huge, searchable collection of real-world texts (books, articles, speeches, etc.). It doesn't give you definitions; instead, it shows you hundreds or thousands of real examples of a word in use. This helps you discover:
- The most common collocations (word partners).
- The word's typical grammatical patterns.
- Whether the word is more common in formal, academic, or informal contexts.
A key feature of a corpus is a concordancer, which shows you every instance of your word in the middle of a line, so you can easily see the words that typically appear before and after it.
Mini-Tutorial: `inherent` vs. `intrinsic`
These synonyms are confusing. A dictionary might define both as "essential" or "natural." Let's see how a corpus helps.
- You search for `inherent` in a corpus tool (like SkELL or Sketch Engine). You see many results like: "...the risks inherent in the strategy," or "...a problem inherent in the system." You spot a clear pattern: inherent in something.
- You search for `intrinsic`. You see results like: "...the intrinsic value of education," or "...her intrinsic motivation." You spot another pattern: intrinsic + noun.
- Conclusion: You've learned more than a definition. You've learned their grammatical behavior. A risk is `inherent in` a plan; a quality like `value` or `motivation` is `intrinsic`.
2. Advanced Online Dictionary Features
Good online learner's dictionaries (Oxford, Cambridge, Merriam-Webster) offer more than just definitions. Look for these features:
- Etymology: The history of a word. Knowing that `benevolent` comes from the Latin roots `bene` (good) and `velle` (to wish) helps you remember it means "kind" or "well-wishing."
- Usage Notes: Special notes written by lexicographers that explain the subtle differences between confusing words (e.g., `historic` vs. `historical`).
- Collocations & Thesaurus: Many dictionaries now integrate thesaurus and collocation information directly. Always check the example sentences for a synonym before using it to avoid collocational clashes.
Discourse in Action: A Learner's Thought Process
May, a C1 student, wants to improve this sentence: "The project has many good points."
- Thesaurus: She looks up "good point" and finds the synonym "merit."
- Initial Idea: "The project has many merits." This is grammatically correct, but is it the most natural collocation?
- Corpus Check: She searches a corpus for `merits`. She sees phrases like `artistic merits` and `relative merits`, but `many merits` is less common.
- Deeper Search: She searches for `has * merit` (using a wildcard). She finds hundreds of examples of "has considerable merit" and "has significant merit."
- The C1 Result: She writes: "The project has considerable merit." She has used advanced tools to find a more sophisticated and natural-sounding phrase.
Check Your Understanding
- The history of a word and its origins is called its...
- a) Concordancer
- b) Corpus
- c) Etymology
Answer: c) Etymology
- What is the main advantage of using a corpus to study a word?
- a) It provides a simple definition.
- b) It shows you how the word is used in thousands of real-world contexts.
- c) It translates the word into your native language.
Answer: b) It shows you how the word is used in thousands of real-world contexts.
- A student wants to know which preposition is most commonly used after the verb "to rely." The best tool for this is a...
- a) ...thesaurus.
- b) ...corpus concordancer.
- c) ...grammar book.
Answer: b) ...corpus concordancer. (It would immediately show hundreds of examples of "rely on").
Your Mission
- Corpus Explorer: Go to a free online corpus tool (like SkELL). Choose two similar words you are unsure about (e.g., `effective` vs. `efficient` or `historic` vs. `historical`). Search for both. Based on the examples, write down one difference you notice in their common usage.
- Etymology Detective: Look up the word `manufacture` in a good online dictionary and find its etymology. How do its Latin roots (`manus` + `facere`) help you understand its original meaning?
- The C1 Tool Challenge: Find a new, complex word from an article. Your mission is to do a complete study of it: 1) Look up its definition and etymology in a dictionary. 2) Look it up in a corpus to find its top two collocations. 3) Write one perfect sentence of your own using the word with its correct collocation.