Smoking Start Age Tendency and Your Genetics
By the ExomeDNA Research Team
Age of smoking initiation refers to the age at which an individual first begins using tobacco products, a behavioral trait with a measurable heritable component. Like many complex behavioral phenotypes, the timing of smoking onset is shaped by a combination of environmental context, social factors, and inherited genetic tendencies that influence brain development, reward circuitry, and behavioral disposition. A large-scale genome-wide study by Liu et al. (2019) — covering approximately 1.2 million individuals — identified 566 genetic variants in 406 loci associated with multiple stages of tobacco use, including initiation timing. Genes in the smoking start age signal include AUTS2, BCL11B, ARID5B, IGSF21, and CDH6, converging on themes of neurodevelopment, synaptic architecture, and chromatin regulation.
What is age of smoking initiation?
Age of smoking initiation captures when, across the lifespan, a person first begins smoking regularly. It is distinct from smoking quantity, dependence severity, or cessation success — each of which has its own partially overlapping genetic architecture. MTAG (multi-trait analysis of GWAS) methodology, applied in this dataset, combines signals across related traits to increase statistical power, allowing the identification of loci that might not reach genome-wide significance in single-trait analyses.
Research base: Robust.
Studying the timing of smoking onset as a genetic phenotype reveals the biological scaffolding that makes certain individuals more or less inclined toward early tobacco experimentation. This is a behavioral tendency shaped by neurodevelopmental processes that genetics helps define, not a predetermined outcome.
The genetics of smoking onset timing
The 1.2 million-person dataset analyzed by Liu et al. (2019) provided sufficient statistical power to detect 406 genetic loci associated with tobacco use phenotypes including age of initiation (Liu et al., 2019). The scale of this meta-analysis, combined with MTAG methodology, means that the loci identified reflect robust, replicated signals rather than single-study findings.
The gene set for smoking initiation age spans neurodevelopmental transcription factors, synaptic adhesion molecules, chromatin regulators, and RNA-binding proteins. This breadth reflects the complexity of the behavioral circuitry involved: the age at which a person begins smoking is not reducible to any single biological pathway but reflects the intersection of impulse regulation, reward sensitivity, social integration, and neurodevelopmental timing.
Key genes: AUTS2, BCL11B, ARID5B, and neural circuits
AUTS2 (autism susceptibility candidate 2) is, despite its historical name, a broadly neurodevelopmental gene with roles in neuronal differentiation, synaptic function, and transcriptional regulation in the brain. It has been associated with multiple behavioral phenotypes in large-scale GWAS datasets, including alcohol use, educational attainment, and cognitive function. Its appearance in the smoking initiation timing signal suggests that the neurodevelopmental processes AUTS2 regulates — particularly the maturation of prefrontal and striatal circuits that govern impulse control and reward learning — are relevant to the timing of behavioral experimentation with tobacco.
BCL11B encodes a zinc finger transcription factor that is essential for the development of both T-cells and neurons, particularly cortical and striatal neurons. Striatal neurons form the core of the basal ganglia reward circuit, and the expression of BCL11B in developing striatum directly positions it as a gene relevant to the behavioral neuroscience of reward-seeking and initiation of substance use. Variants in BCL11B that alter its transcriptional activity during neurodevelopment may subtly reshape the reward circuitry architecture associated with behavioral initiation tendencies.
ARID5B encodes AT-rich interactive domain 5B, a transcription factor involved in chromatin remodeling and gene expression regulation in multiple tissue types including the brain. ARID5B has been associated with educational attainment and cognitive traits in genome-wide studies, suggesting a broad role in regulating neurological development relevant to behavior. Its high-confidence positioning in this signal (appearing in both the filtered gene set and in the top-scoring candidates from this analysis) indicates robust evidence for its involvement.
IGSF21 encodes immunoglobulin superfamily member 21, a synaptic adhesion molecule with a specific role in the formation of inhibitory GABAergic synapses in the brain. GABAergic inhibitory tone in the prefrontal cortex and associated circuits modulates impulsivity and the response to novelty; variants that alter inhibitory synapse formation through IGSF21 may influence these circuits in ways relevant to the timing of behavioral experimentation.
CDH6 (cadherin-6) is a neural cadherin involved in axon guidance and circuit formation during brain development, particularly in regions including the olfactory system and limbic areas. ELAVL2 is an ELAV-like RNA binding protein expressed in neurons that stabilizes the mRNA of synaptic proteins, making it a post-transcriptional regulator of how strongly synaptic genes are expressed during development.
What the research says
Liu et al. (2019) analyzed approximately 1.2 million individuals and identified 566 genetic variants in 406 loci associated with multiple tobacco use phenotypes including age of smoking initiation (Liu et al., 2019). A notable finding was the substantial overlap in genetic signals between tobacco and alcohol use traits, suggesting that some of the same neurodevelopmental and behavioral circuits shape the timing and patterns of substance experimentation broadly rather than specifically.
Stat block: In the Liu et al. (2019) meta-analysis of approximately 1.2 million individuals, 566 variants in 406 loci were identified for tobacco and alcohol use phenotypes, representing one of the largest substance use genetics datasets assembled to date and substantially expanding the genetic map for behavioral traits including smoking initiation timing (Liu et al., 2019).
The use of MTAG methodology — which combines signals from correlated traits to boost detection power — means that some of the loci identified reflect shared genetic architecture across related phenotypes such as smoking quantity and dependence, in addition to initiation age specifically.
Stat block: The genetic overlap between tobacco use traits and alcohol use traits in the Liu et al. (2019) dataset points to shared neurobiological pathways underlying substance use behaviors more broadly, with genes in neurotransmitter systems, brain development, and reward circuitry appearing across multiple correlated phenotypes (Liu et al., 2019).
How smoking initiation genetics relates to your biology
The genetic architecture of smoking start age converges on brain development, inhibitory synapse formation, reward circuit maturation, and chromatin regulation. Variants in these genes influence the architecture of neural circuits — particularly those governing impulse regulation, reward sensitivity, and response to novel stimuli — that are relevant to when and whether individuals experiment with tobacco.
Genetic variants describe biological tendencies within the full landscape of influences on behavior. Social environment, family structure, peer context, access to tobacco products, and early life experiences all interact with genetic predispositions. The presence of variants in AUTS2, BCL11B, or IGSF21 does not predetermine behavioral outcomes; it describes the biological substrate upon which those other influences operate.
Importantly, the same neural circuitry relevant to smoking initiation timing is also relevant to a range of health-supporting behaviors: impulse regulation, response to rewards, and engagement with long-term goals. Understanding your genetic architecture in this domain can inform approaches to behavioral wellness that are grounded in the biology of the circuits involved.
Working with your smoking initiation genetic profile
The neural circuits highlighted by this genetic profile — particularly GABAergic inhibitory systems, striatal reward circuitry, and prefrontal regulatory regions — respond to both behavioral and lifestyle inputs across the lifespan.
Practical starting points:
- Regular aerobic exercise has documented effects on prefrontal and striatal function, including enhanced inhibitory control and reduced impulsivity, which are directly relevant to the circuits identified in this gene set
- Sleep quality and consistency support prefrontal regulatory function; disrupted sleep disproportionately impairs the frontal circuits involved in impulse regulation
- Mindfulness-based practices and stress management have been shown to strengthen prefrontal regulatory engagement with striatal reward signals, targeting the circuitry relevant to IGSF21 and BCL11B biology
- If tobacco use is part of your history, cessation support programs that combine behavioral therapy with understanding of individual reward-circuit biology may be particularly aligned with the neurodevelopmental architecture highlighted by this genetic profile
- Consider discussing this profile with a healthcare provider alongside other substance use history, as the genetic overlap with alcohol use traits noted in the research literature may be relevant context
Frequently asked questions
Q: What does age of smoking initiation measure genetically? A: It captures the heritable component of when individuals typically begin using tobacco products. The genetic variants identified in large-scale studies reflect how the biological scaffolding of brain development, reward circuitry, and behavioral disposition interacts with environmental and social factors to shape the timing of tobacco experimentation.
Q: Why do neurodevelopmental genes appear in a smoking behavior study? A: The timing of behavioral experimentation with substances is shaped by the developmental maturation of brain circuits governing impulse control, reward sensitivity, and response to novelty. Genes like AUTS2, BCL11B, and IGSF21 are expressed in the prefrontal cortex and striatum during development, directly shaping the neural architecture relevant to these behavioral tendencies.
Q: What is IGSF21, and why is it relevant to smoking initiation? A: IGSF21 is a synaptic adhesion molecule involved in forming inhibitory GABAergic synapses in the brain. GABAergic inhibitory tone in the prefrontal cortex modulates impulsivity and novelty response. Variants in IGSF21 that alter inhibitory synapse formation may subtly reshape circuits relevant to behavioral initiation tendencies, including the timing of tobacco experimentation.
Q: Does this genetic profile mean I am more or less likely to start smoking? A: Genetic variants in this trait describe biological tendencies within the full landscape of social, environmental, and developmental influences on behavior. The research identifies genetic architecture associated with smoking initiation timing at the population level; individual outcomes depend on the interaction of those tendencies with many other factors. Genetic profiles describe biological context, not behavioral certainty.
Q: Is there overlap between this gene set and other behavioral traits? A: Yes. Liu et al. (2019) found substantial overlap between the genetic signals for tobacco use traits and alcohol use traits, and many of the genes identified — including AUTS2 and ARID5B — have also appeared in GWAS studies of educational attainment and cognitive function. This reflects shared neurobiological pathways underlying substance use behaviors and broader behavioral tendencies.
References
Liu M, et al. (2019). Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nature Genetics. PMID: 30643251.
Data sources: GWAS Catalog, Open Targets, ClinVar, ClinGen, NCBI Gene, dbSNP, PheGenI.