71 Data Collection and Questionnaire Design
71.1 Concept
Data Collection = the process of gathering and measuring information on variables of interest in a systematic, established manner — enables answering of research questions and testing hypotheses. Donald Cooper & Pamela Schindler (Business Research Methods), Naresh Malhotra, Earl Babbie, C.R. Kothari are the leading textbook authors.
71.2 Types of Data
Primary data — collected first-hand by researcher for specific purpose. - Methods: Survey · Observation · Experiment · Interview · Focus group.
Secondary data — already-collected data used for new research. - Sources: govt publications, industry reports, databases, internet.
71.3 Data-Collection Methods
| Method | Description |
|---|---|
| Survey / Questionnaire | Structured questions |
| Personal Interview | One-to-one |
| Telephone Interview / CATI | Computer-Aided Telephone Interview |
| Mail / Postal Survey | Self-administered |
| Online Survey | Web-based — Google Forms, SurveyMonkey, Qualtrics |
| Mobile Survey | SMS, WhatsApp, app |
| Observation | Direct or participant |
| Experiment | Lab or field |
| Focus Group | 6-10 participants discussion |
| Depth Interview | Long, qualitative |
| Projective Techniques | Indirect (word association, TAT, Rorschach) |
| Ethnography | Immersive observation |
| Content Analysis | Documents, social media |
| Big Data / Web Scraping | Digital footprints |
71.4 Survey Errors
- Sampling errors — random.
-
Non-sampling errors:
- Response bias — social desirability, acquiescence.
- Non-response bias.
- Interviewer bias.
- Measurement / instrument bias.
- Processing errors.
71.5 Questionnaire Design
A questionnaire is a formalised set of questions to obtain information from respondents. Naresh Malhotra’s 10-step questionnaire design process is widely taught.
71.5.1 Malhotra’s 10-step Questionnaire Design
- Specify information needed.
- Type of interview method.
- Determine content of individual questions.
- Design questions to overcome respondent’s inability and unwillingness to answer.
- Decide question structure.
- Determine question wording.
- Arrange questions in proper order.
- Identify form and layout.
- Reproduce the questionnaire.
- Pretest, revise and prepare final.
71.6 Types of Questions
| Type | Description |
|---|---|
| Open-ended | Free response |
| Close-ended | Predefined options |
| Dichotomous | Two options (Yes/No) |
| Multiple choice | Several options |
| Rating scale | Likert, Semantic Differential |
| Ranking | Order preferences |
| Filter / Screener | Routing |
| Contingency | Conditional |
71.7 Scaling Techniques
| Scale | Description | Inventor |
|---|---|---|
| Likert | 5/7-point agreement (Strongly Disagree → Strongly Agree) | Rensis Likert (1932) |
| Semantic Differential | Bipolar adjectives (Good-Bad) | Charles Osgood (1957) |
| Thurstone | Equal-appearing intervals | L.L. Thurstone (1928) |
| Guttman | Cumulative scale | Louis Guttman (1944) |
| Stapel Scale | +5 to −5 single adjective | Jan Stapel |
| Bogardus Social Distance | Acceptance closeness | Emory Bogardus (1925) |
| Q-Sort | Forced-choice ranking | William Stephenson (1953) |
| Constant Sum | Distribute fixed points | |
| Paired Comparison | Pairs ranked |
71.8 Validity and Reliability
- Content validity — coverage.
- Construct validity — Convergent + Discriminant.
- Criterion validity — Concurrent + Predictive.
- Face validity — appears valid.
- Test-retest — consistency over time.
- Internal consistency — Cronbach’s Alpha (1951) — > 0.7 acceptable.
- Parallel forms.
- Inter-rater reliability — Cohen’s Kappa.
- Split-half.
71.9 Pretest and Pilot Study
Run questionnaire with small sample (15-30) before main study. Identifies — confusing questions, time required, response patterns.
71.10 Errors to Avoid in Questions
- Leading questions — bias response.
- Loaded questions — emotional charge.
- Double-barrelled questions — two ideas in one.
- Ambiguous questions — vague terms.
- Negative wording.
- Jargon and complex vocabulary.
- Assumptive questions.
- Generalisation questions.
71.11 Indian Research Agencies
- IMRB / Kantar India.
- Nielsen India.
- AC Nielsen ORG-MARG (now Nielsen).
- GfK India.
- Hansa Research.
- Ipsos India.
- TNS India.
- C-Voter (political).
- CMIE (economic data).
- Crisil Research.
71.12 Modern Trends
- Online and mobile surveys dominant.
- Big data and web scraping.
- Social media listening.
- Real-time / Always-on panels.
- Eye-tracking and biometrics.
- Neuro-marketing tools.
- AI-generated personas and synthetic data.
- Conversational surveys (chatbots).
- Behavioural analytics.
- Geo-location surveys.
- Privacy-first / consent management.
- Mobile ethnography.
71.13 Practice Questions
The Likert scale (1932) was developed by:
View solution
Semantic Differential scale uses:
View solution
Cronbach's Alpha (1951) measures:
View solution
Social Distance scale (1925) is by:
View solution
Pretest is conducted to:
View solution
"Do you find the website fast and easy to use?" is an example of:
View solution
Q-Sort technique is by:
View solution
"What did you like about our product?" is:
View solution
Optimal size of a focus group is:
View solution
Census of India data is:
View solution
"Does the scale measure what it claims to measure?" refers to:
View solution
Word association and TAT are examples of:
View solution
Equal-appearing intervals scale is by:
View solution
CATI stands for:
View solution
Match:
| (i) | Likert | (a) | Bipolar |
| (ii) | Semantic Differential | (b) | Cumulative |
| (iii) | Guttman | (c) | Agreement |
| (iv) | Bogardus | (d) | Social distance |
View solution
71.13.1 Advanced Format Questions
A: Likert is summated rating.
R: It uses bipolar adjective pairs.
View solution
Reliability measures: (i) Test-retest. (ii) Internal consistency. (iii) Parallel forms. (iv) Inter-rater.
View solution
Questionnaire pitfalls: (i) Leading. (ii) Double-barrelled. (iii) Ambiguous. (iv) Loaded.
View solution
71.14 Quick Recall
- Data: Primary vs Secondary.
- Methods: Survey · Interview · Observation · Experiment · Focus Group · Projective · Ethnography · Web scraping.
- Errors: Sampling vs Non-sampling (response, non-response, interviewer, measurement, processing).
- Malhotra’s 10-step questionnaire process.
- Question types: Open · Close · Dichotomous · MCQ · Rating · Ranking · Filter · Contingency.
- Scales: Likert (1932) · Semantic Differential (Osgood 1957) · Thurstone (1928) · Guttman (1944) · Stapel · Bogardus Social Distance (1925) · Q-Sort (Stephenson 1953) · Constant Sum · Paired Comparison.
- Validity: Content · Construct (Convergent/Discriminant) · Criterion (Concurrent/Predictive) · Face.
- Reliability: Test-retest · Cronbach’s α (1951) ≥ 0.7 · Parallel · Inter-rater (Cohen’s κ) · Split-half.
- Question errors: Leading · Loaded · Double-barrelled · Ambiguous · Negative · Jargon · Assumptive.
- Indian agencies: IMRB/Kantar · Nielsen · Ipsos · CMIE · Crisil · C-Voter · Hansa · GfK.
- Modern trends: online/mobile · big data · social listening · biometrics · neuro · AI/synthetic data · conversational · geo-location · privacy-first.