NFHS‑5 · 2019–21 · Methodology

How every estimate on this dashboard is computed

All figures are derived directly from NFHS‑5 (DHS India) unit‑level microdata using the survey design weights — no pre‑tabulated tables are used. This page documents the estimator, the definition of each indicator (numerator, denominator, source variables), and the known limitations, so any result can be independently verified against the official India Fact Sheet.

Source NFHS‑5 (2019–21), IIPS / MoHFW · DHS Program Recodes KR · IR · HR · PR · MR · BR Validation national estimates vs India Fact Sheet (±0.5pp; rates ±1.0; TFR ±0.1)

1The survey‑weighted estimator


NFHS‑5 is a stratified, multi‑stage probability sample. To recover nationally and sub‑nationally representative quantities, every record is weighted by its design weight. DHS distributes the weight as an integer scaled by \(10^{6}\); the analytic weight is

$$ w_i \;=\; \frac{V005_i}{1{,}000{,}000}\qquad\text{(women/children: }V005;\ \text{household: }HV005;\ \text{men: }MV005\text{)} $$

For a binary indicator \(x_i\in\{0,1\}\) defined over an eligible population, the point estimate is the weighted proportion, expressed as a percentage:

$$ \hat{p} \;=\; \frac{\displaystyle\sum_{i\in\mathcal{E}} w_i\,x_i}{\displaystyle\sum_{i\in\mathcal{E}} w_i}\times 100 $$

where \(\mathcal{E}\) is the set of records that satisfy the indicator's eligibility rule (the denominator). Records outside \(\mathcal{E}\), and those with a missing value of \(x\), are excluded — they are coded NaN and never enter either sum. The reported sample size \(n=\lvert\mathcal{E}\rvert\) is the unweighted count and drives the reliability flag below.

Every indicator is produced at nine aggregation levels — national, state/UT, urban–rural, wealth quintile, women's education, and the state×residence, state×wealth, state×education cross‑tabs, plus district — by restricting \(\mathcal{E}\) to the corresponding subgroup. Mortality and fertility (§14) are the sole exception: they are model‑based rates, not weighted means.

Design‑based inference Point estimates use the sampling weights only. For standard errors and confidence intervals that respect the design, declare the survey structure — PSU V021, strata V022, weight V005 — in a survey package (svyset in Stata, svydesign in R, samplics in Python). This dashboard reports point estimates; it does not display sampling errors.

2Reliability & small‑sample suppression


Sub‑national cells — particularly wealth or education breakdowns for small UTs, or rare‑event indicators whose eligible population is a slice of a slice — can rest on a handful of observations, where a single respondent moves the estimate by tens of points. Following the DHS reporting convention, every cell carries a reliability band derived from its unweighted denominator \(n\):

BandRuleInterpretationDashboard treatment
ok\(n \ge 50\)ReliableShown normally
caution\(25 \le n < 50\)Interpret with cautionMuted, "small sample" flag
suppress\(n < 25\)UnreliableHidden / greyed as no‑data

Model‑based rates (§14) use a more conservative band keyed to birth/exposure counts (suppress below ~250, caution below ~500), because rates require substantially larger samples than proportions for comparable precision.

3Child anthropometry & nutritional status

Children under 5 with a valid anthropometric z‑score (WHO Child Growth Standards). Z‑scores are stored ×100; the plausible range is \([-600,600]\) (i.e. \(\pm6\) SD); flagged values are excluded.

$$ \text{stunted}= \mathbb{1}[\,HAZ<-2\,],\quad \text{wasted}= \mathbb{1}[\,WHZ<-2\,],\quad \text{underweight}= \mathbb{1}[\,WAZ<-2\,] $$
Stunting / Wasting / Underweight / Overweight FS #81–85 · 35.5 / 19.3 / 32.1 / 3.4
Population
Children age <60 months with a valid z‑score for the relevant index.
Definition
Stunting hw70<−200; wasting hw72<−200; underweight hw71<−200; overweight hw72>+200; severe wasting hw72<−300.
Variables
hw70 HAZ · hw71 WAZ · hw72 WHZ · b19 age
Child anaemia (6–59 months)child_anaemia FS #92 · 67.1 approx
Population
Children 6–59 months with a valid haemoglobin reading.
Definition
Any anaemia = altitude‑adjusted Hb <11.0 g/dL, read from the DHS level variable hw57 ∈ {1,2,3} (severe/moderate/mild); 4 = not anaemic.
Variables
hw57 · b19
Documented near‑miss (+0.9pp) The survey‑weighted rate is robustly 68.0 against the published 67.1. This was traced to a weighting nuance, not an altitude or definition error: reconstructing altitude‑adjusted Hb from raw hw53 and cluster altitude reproduces hw56 to within 0.014 g/dL and moves the estimate further off, and alternative age/residence denominators all stay at 67.9–68.0 weighted. The principled weighted definition is retained for consistency with every other indicator; treat this single value as accurate to ~1pp.

4Child immunization

Children age 12–23 months, with vaccination established from the card or the mother's recall — each antigen coded received if its status is in {1,2,3} (1 = date on card, 2 = mother's report, 3 = marked on card).

Fully vaccinated (basic)full_imm FS #57 · 76.4
Population
Living children 12–23 months.
Definition
BCG and 3 doses penta/DPT and 3 doses polio (excl. birth dose) and measles‑containing vaccine, dose 1.
Variables
h2 BCG · h7 DPT3/penta · h4,h6,h8 polio 1/2/3 · h9,h9a MCV · b5 alive · b19
Three pitfalls this indicator corrects (1) Dead children. The KR file holds deceased children with blank vaccination fields; counting them as "not vaccinated" deflates every antigen ~3.5pp. The denominator is restricted to living children (b5==1).
(2) Polio birth dose. The fact‑sheet footnote defines polio‑3 as three doses excluding the dose given at birth (h0); the count uses routine doses h4,h6,h8 only.
(3) Measles / MR. India's measles‑rubella rollout records some first doses in the second‑dose slot, so MCV1 = h9∈{1,2,3,4} OR h9a. Without these corrections the naïve estimate is ~60%.
AntigenVariableFact sheet
BCGh2 ∈ {1,2,3}95.2 (#59)
DPT/penta 3h7 ∈ {1,2,3}86.7 (#61)
Polio 3 (excl. birth)h4 & h6 & h880.5 (#60)
MCV 1h9∈{1‑4} | h9a87.9 (#62)

5Infant & young child feeding

Feeding indicators use the 24‑hour dietary recall, collected for the youngest child living with the mother. Food items are mapped to the eight WHO‑2021 IYCF food groups.

Early initiation of breastfeedingearly_bf FS #75 · 41.8
Population
Last child born in the 3 years before the survey (bidx==1 & age<36 mo).
Definition
Put to the breast within one hour of birth: m34 ∈ {0,100} (0 = immediately, 100 = within the first hour). Never‑breastfed / missing remain in the denominator as 0.
Exclusive breastfeeding (<6 months)excl_bf FS #76 · 63.7
Population
Youngest child <6 months, alive, living with mother.
Definition
Breastfed (v404==1) and given nothing else in the 24‑hour recall (no water, other milk, or foods across v409–v414*).
Adequate diet — minimum acceptable diet (6–23 mo)adequate_diet_6_23 FS #80 · 11.3
Definition
Two formulas by breastfeeding status (per fact‑sheet footnote):
$$ \text{MAD}=\begin{cases}\mathbb{1}[\,\text{food groups}\ge 4\,]\;\wedge\;\text{MMF}, & \text{breastfed}\\[4pt] \mathbb{1}[\,\text{milk feeds}\ge 2\,]\;\wedge\;\text{MMF}\;\wedge\;\mathbb{1}[\,\text{non‑milk food groups}\ge 4\,], & \text{non‑breastfed}\end{cases} $$

MMF = minimum meal frequency (breastfed: ≥2 solid feeds at 6–8 mo, ≥3 at 9–23 mo; non‑breastfed: ≥4 total feeds with ≥1 solid). Validated against the breastfed (11.1, #78) and non‑breastfed (12.7, #79) sub‑components.

Timely complementary feeding (6–8 mo)comp_feeding_6_8 FS #77 · 45.9
Definition
Breastfed and receiving solid/semi‑solid food, among children 6–8 months.
Definitional choice — dietary diversity Minimum dietary diversity (min_diet_diversity) has no NFHS‑5 fact‑sheet target. It is computed to the WHO‑2021 standard of ≥5 of 8 food groups (national value 23.6%). NFHS‑5's own "adequate diet" composite (above) instead uses the ≥4 food‑group threshold defined in its footnote, so the two are intentionally not identical.

6Childhood illness & care‑seeking

Denominators here are rare‑event slices (only children who were ill), so sub‑national cells lean heavily on the reliability flag.

ORS / Zinc for diarrhoeaors_diarrhoea · zinc_diarrhoea FS #70 / #71 · 60.6 / 30.5
Population
Children with diarrhoea in the 2 weeks before the survey (h11 ∈ {1,2}).
Definition
Given ORS h13 ∈ {1,2}; given zinc h15e==1.
Care‑seeking for fever / ARIcareseek_ari_fever FS #74 · 69.0
Population
Children with fever (h22==1) or ARI symptoms (cough h31∈{1,2} with short, rapid breathing h31b==1).
Definition
Advice or treatment sought from any health facility or provider (the h32* source series), excluding traditional healer, friend/relative, and no‑treatment.

7Maternal & newborn care

Antenatal and postnatal indicators refer to the woman's most recent live birth in the 5 years before the survey; delivery indicators are computed at the birth level over all live births in that window.

Institutional deliveryinst_delivery FS #50 · 88.6
Population
All live births in the last 5 years (m15_1..m15_6 reshaped to birth level, age ≤59 mo).
Definition
Delivered in a health facility: m15 ∈ [20,89] (10–19 = home, 96 = other → 0).

Restricting to the most recent birth only would bias this ~1.7pp high, as recent births are more likely institutional.

Caesarean sectioncsection · _public · _private FS #54/56/55 · 21.5 / 14.3 / 47.4
Definition
m17==1 over all live births; the public split is conditioned on m15∈[20,29], the private split on m15∈[30,39].
Antenatal careanc4plus · anc_1st_tri FS #41 / #40 · 58.1 / 70.0
Definition
4+ ANC visits: m14_1 ≥ 4. First‑trimester ANC: m13_1 ≤ 3 months. Both over all most‑recent births in the window.
Denominator rule "Don't know" / missing number of ANC visits are counted as not 4+ (i.e. 0), not dropped — dropping them would inflate the rate ~1pp. With this rule the national estimate is 58.4 (published 58.1).
IFA supplementationifa_100plus · ifa_180plus FS #43 / #44 · 44.1 / 26.0
Definition
Iron‑folic‑acid taken for ≥100 / ≥180 days in pregnancy: m46_1 ≥ 100 / 180 (with <900 to drop the "don't know" code 998).
Neonatal‑tetanus protection & postnatal carennt_protection · pnc_mother_2days · pnc_newborn_2days FS #42 / #46 / #49 · 92.0 / 78.0 / 79.1
Definition
NNT: ≥2 lifetime tetanus injections (m1_1 + m1a_1 ≥ 2). PNC: a check by health personnel (provider 11/12/13) within ≤2 days (timing code ≤202).

8Family planning

Contraception indicators are defined over currently‑married (in‑union) women 15–49 — the standard CPR denominator. Computing over all women is a frequent error that understates use by ~14pp and inverts the education gradient.

Modern contraceptive use (mCPR)modern_contra FS #29 · 56.5
Population
Currently‑married women 15–49 (v502==1) with valid method status.
Definition
Currently using a modern method: v313==3.
Unmet need for family planningunmet_need FS #36 · 9.4
Definition
DHS‑derived v626a ∈ {1,2} (unmet need for spacing or limiting), among currently‑married women. The derived variable is used directly — the multi‑clause definition is never rebuilt by hand.

9Women's nutrition & anaemia

Women 15–49. BMI is stored ×100 (so 18.5 kg/m² → 1850); pregnant women and those with a very recent birth are excluded from BMI per the survey convention.

Thin / Overweight‑or‑obesethin_women · women_overweight FS #86 / #88 · 18.7 / 24.0
Definition
Thin v445 < 1850; overweight/obese v445 ≥ 2500 (valid BMI 1200–6000).
Anaemia (women 15–49)anemia_women FS #95 · 57.0
Definition
Any anaemia v457 ∈ {1,2,3}, altitude‑ and smoking‑adjusted. NFHS uses capillary blood, so values are not comparable to venous‑blood surveys.

10Women's empowerment, literacy & digital access

All women 15–49 (menstrual hygiene: 15–24).

IndicatorKeyDefinitionFS
Literacywomen_literatev155∈{1,2} (reads a sentence) OR v133≥971.5 (#14)
10+ yrs schoolingwomen_edu10v133 ≥ 1041.0 (#16)
Owns house/landown_house_landv745a or v745b ∈ {1,2,3} (alone or jointly)43.3 (#121)
Uses a bank accountbank_account_selfv170 == 178.6 (#122)
Uses a mobile phonemobile_selfv169a == 154.0 (#123)
Ever used internetinternet_everv171a ∈ {1,2,3}33.3 (#18)
Menstrual hygiene (15–24)menstrual_hygieneany hygienic method s260{b,c,d,e}==177.3 (#124)
Married before 18 (20–24)child_marriage_womenv511 < 18 among women 20–2423.3 (#20)
Begun childbearing (15–19)teen_pregv201>0 or currently pregnant v213==16.8 (#23)

11Household environment & assets

Household‑level indicators, weighted by HV005.

IndicatorKeyDefinitionFS
Electricityelectricityhv206 == 196.8 (#7)
Improved drinking waterimp_waterhv201 ∈ JMP improved set95.9 (#8)
Improved sanitationimp_sanitationhv205 ∈ improved set AND not shared (hv225==0)70.2 (#9)
Clean cooking fuelclean_fuelhv226 ∈ {1,2,3,4} (electricity/LPG/gas/biogas)58.6 (#10)
Iodized saltiodized_salthv234a == 1 among tested households94.3 (#11)
Health insurancehealth_insuranceany member covered by a scheme41.0 (#12)
Two definitional points Sanitation must exclude shared facilities (hv225==0); the improved‑regardless‑of‑sharing figure is ~9pp higher and does not match the published indicator. Health insurance is the household coverage flag (any usual member covered), validated at 41.2% against the published 41.0%; note that coverage predates the full rollout of national schemes at survey time, so it understates current coverage.

12Adult biomarkers & non‑communicable disease (15+)

Person‑level (PR recode), de‑facto household members aged 15+, computed separately for women (_w) and men (_m). Anaemia is offered in two framings — see the note.

IndicatorDefinitionFS (W / M)
High blood sugarrandom plasma glucose shb74 > 140 mg/dL OR on glucose medication13.5 / 15.6
Hypertensionmean of 2nd & 3rd readings ≥140 systolic and/or ≥90 diastolic, OR on BP medication21.3 / 24.0
Any tobacco usesh25 == 18.9 / 38.0
Any alcohol usesh26 == 11.3 / 18.8
Anaemia (15+, biomarker)ha57 / hb57 ∈ {1,2,3}57.0 / 25.0

Blood pressure uses the average of the 2nd and 3rd measurements (the 1st is discarded); out‑of‑range special codes are treated as missing.

Two anaemia framings, kept separate The person‑level (15+, biomarker) anaemia here and the woman‑level (15–49) anaemia in §9 are distinct indicator keys and are never merged. They agree closely nationally but diverge by state because they use different weights and denominators — that divergence is real and is itself worth exploring in the correlation view.

13Men's indicators (15–49)

Men's recode (MR), restricted to 15–49 for fact‑sheet parity. Men's BMI is not carried in MR and is merged from the person recode (hb40) on the household‑member key, weighted by mv005.

IndicatorKeyDefinitionFS
Literacymen_literatemv155∈{1,2} OR mv133≥984.4 (#15)
10+ yrs schoolingmen_edu10mv133 ≥ 1050.2 (#17)
Thin (BMI<18.5)men_thinhb40 < 185016.2 (#87)
Overweight (BMI≥25)men_overweighthb40 ≥ 250022.9 (#89)
Ever used internetmen_internetmv171a ∈ {1,2,3}57.1 (#19)
Comprehensive HIV knowledgemen_hiv_knowledge5‑condition composite (2 prevention facts, healthy‑can‑have, reject 2 myths)30.7 (#116)

14Mortality & fertility model‑based rate

These are not weighted proportions. They are synthetic‑cohort rates following the DHS Guide to Statistics, computed from the full birth history (BR) with woman‑level exposure (IR). Available at national / state / residence / wealth / education only — never by district, where they would be unstable.

Childhood mortality

Over a reference window of the 60 months before interview, deaths and child‑months of exposure are accumulated in eight age segments \(s\in\{[0,1),[1,3),[3,6),[6,12),[12,24),[24,36),[36,48),[48,60)\}\) months. Each segment yields a conditional death probability

$$ q_s \;=\; \frac{\sum_i w_i\,d_{i,s}}{\sum_i w_i\,e_{i,s}} $$

where \(d_{i,s}\) flags a death of child \(i\) inside segment \(s\) and within the window, and \(e_{i,s}\) is the fraction of the segment the child actually lived in‑window. Segment probabilities are chained into cumulative mortality:

$$ \text{NNMR}=1000\,q_{[0,1)},\qquad {}_{1}q_{0}=1000\Big(1-\!\!\prod_{s:\,<12\text{mo}}\!\!(1-q_s)\Big),\qquad {}_{5}q_{0}=1000\Big(1-\prod_{\text{all }s}(1-q_s)\Big) $$

Targets — NNMR 24.9 (#25), IMR \({}_1q_0\) 35.2 (#26), U5MR \({}_5q_0\) 41.9 (#27), per 1,000 live births.

Fertility

Over the 36 months before interview, age‑specific fertility rates divide births in the window by woman‑years of exposure in each five‑year age group \(g\) (exposure drawn from all women, including the childless, in the IR file):

$$ \text{ASFR}_g=\frac{\sum w\cdot(\text{births to women aged }g)}{\sum w\cdot(\text{woman‑years aged }g)},\qquad \text{TFR}=5\sum_{g} \text{ASFR}_g $$

Targets — TFR 2.0 (#22), adolescent fertility ASFR(15–19) 43 per 1,000 (#24), sex ratio at birth 929 females per 1,000 males (#4).

15Known limitations


ItemStatusDetail
Child anaemia±0.9ppWeighted 68.0 vs published 67.1; a weighting nuance, not a definition error. Accurate to ~1pp.
Dietary diversity thresholdresolvedUses the WHO‑2021 ≥5/8 food‑group standard (national 23.6%). No fact‑sheet target exists; the NFHS "adequate diet" composite uses the older ≥4‑group rule separately.
Sampling errorsnot shownPoint estimates only. Confidence intervals require a design‑based survey package (PSU/strata/weight).
District‑level ratesexcludedMortality/fertility are not estimated at district level — sample sizes are too small for stable rates.
Small‑UT breakdownssuppressedWealth/education cells for small UTs (e.g. Lakshadweep) often fall below n=25 and are hidden; national wealth quintiles mean affluent UTs may have no "poorest" cell at all.
Capillary‑blood anaemianoteNFHS measures haemoglobin in capillary blood; values should not be compared with venous‑blood surveys.

Every numeric target above (FS #n) refers to the official NFHS‑5 India Fact Sheet. A national estimate is accepted when it falls within ±0.5pp of its target (±1.0 for mortality rates, ±0.1 for TFR); the one documented exception is child anaemia.