Agroecology 3 min

83% of the genetic heritage of vanilla has been determined

PRESS RELEASE - Almost all the vanilla sold worldwide comes from a single species: Vanilla planifolia. The limited genetic variability of cultivated vanilla makes the sector particularly vulnerable to climate and health risks. In publishing the sequence for 83% of the Vanilla planifolia genome, a research consortium coordinated by CIRAD Réunion has paved the way for more effectively targeted, faster creation of new varieties.

Published on 06 May 2022

illustration 83% of the genetic heritage of vanilla has been determined
© CIRAD - R. Carayol

The sequence of virtually the entire genome of Vanilla planifolia, the species that provides one of the world’s most widely consumed flavourings, is now available to the public and to researchers. This discovery is the result of four years of research by a French consortium comprising two private firms, Eurovanille and V. MANE FILS, and six public research organizations: CIRAD, the University of La Réunion, INRAE, CNRS, Paris-Saclay University and Etablissement Vanille de Tahiti.

More than 59 000 genes identified

This vast sequencing operation began in 2017, and served to compile a catalogue of more than 59 000 genes from the world’s most widely cultivated vanilla species. Quentin Piet, a genetics PhD student at CIRAD and the University of La Réunion and lead author of the study, points out that: “Only a third of the Vanilla planifolia genome had been sequenced until now. Our work served to cover 83% of the genome and to determine gene structure in much greater detail. For instance, we now know that vanilla has sixteen pairs of chromosomes, whereas previous studies had hesitated between fourteen and sixteen”. 

The vanilla genome has inherited a very specific genetic characteristic from plants in the orchid family: partial endoreplication, which makes it difficult to sequence. “The cells of living beings often have nuclei containing several copies of their genomic DNA”, Quentin Piet explains. “In most of the beings that exhibit the phenomenon, DNA replication is total. However, in vanilla, only part of the genome is replicated in the nucleus, up to 64 times, while the rest stays as it is. In Vanilla planifolia, the non-replicated part accounts for 72% of its DNA! That part of the genome therefore finds itself in vastly in the minority within the genome, and is difficult to access with sequencing tools.” 

To at least partly overcome that difficulty, the scientists used tissues rich in nuclei whose DNA was not significantly replicated, in other words vanilla plant nodes. The plants used were propagated in vitro, and came from the vanilla collection held by CIRAD’s Vatel Biological Resource Centre (BRC). 

This unique collection was set up in 2004, and contains more than 500 vanilla varieties (500 accessions) and 250 specimens from some thirty vanilla species representative of central and Latin America, Africa and Asia. For Carine Charron, a geneticist with CIRAD, co-author of the study and manager of the collection, it has two aims: “to play a part in conserving that biodiversity, and also to be able to learn something from it, either for scientific purposes or in order to make it available to producers”.

To this end, the results of the sequencing operation have been made public and are available via the Vanilla Genome Hub online platform.

Towards new vanilla varieties?

A better understanding of the genome is vital for varietal improvement of a crop whose diversity is currently very limited, notably via the identification of genes that could benefit the sector, for instance those coding for the synthesis of vanillin, a major vanilla flavour component, or for disease resistance. 

Breeding a new vanilla variety from seed takes between seven and eight years per generation”, says Michel Grisoni, a CIRAD researcher and co-author of the study. “We are talking of a long-term process before we can offer producers new varieties. To save time, we need to choose parents with the right genes and rapidly breed the most promising progenies. This is why it is vital to know the genome inside out”.

In particular, CIRAD and its partners are working to develop a vanilla variety with greater resistance to stem rot, a worldwide disease caused by a soil fungus that can kill up to 67% of the plants in a plantation.

Being able to use the scientific progress made in this research to develop more vigorous, resistant and productive vanilla varieties would really benefit our industry, as it could have a significant impact on the quality and price of the pods produced”, say Laurent Bourgois (Eurovanille) and Joseph Zucca (V. MANE FILS).

Vanilla planifolia, the vanilla species most commonly grown worldwide

There are some 120 vanilla species worldwide, of which around twenty produce fruits with aromatic properties. Currently, almost 98% of the vanilla sold in the world comes from a single species, Vanilla planifolia. Vanilla tahitensis, the Tahitian vanilla makes up the remaining 2%.

France is the world’s third leading vanilla importing country, behind the USA and Canada, while some 80% of global output comes from Madagascar.


Piet Q., Droc G., Marande W., Sarah G., Bocs S., Klopp C., Bourge M., Siljak-Yakovlev S., Bouchez O., Lopez-Roques C., Lepers-Andrzejewski S., Bourgois L., Zucca J., Dron M., Besse P., Grisoni M., Jourda C., and Charron C. 2022. A chromosome-level, haplotype-phased genome assembly for Vanilla planifolia highlights that partial endoreplication challenges accurate whole genome assembly. Plant Communications. DOI: 10.1016/j.xplc.2022.100330



Learn more


Oak genomics proves its worth

PRESS RELEASE - Some 18 months after the full pedunculate oak genome sequence was published by a French consortium led by INRAE and CEA, some initial results based on this genomic resource have been written up in a series of articles in 16 April 2020 issue of the New Phytologist. These new results help clarify the oak's evolution, from the deep roots of its diversification through the more recent evolution of the European white oaks, and identify key genes involved in its adaptation to certain environments or resistance to pathogens.

17 April 2020


The genome of the pea assembled for the first time

PRESS RELEASE - An international team* led by researchers from INRA and CEA succeeded in assembling the first sequence of the pea genome. This study, published on September 2, 2019 in Nature Genetics, will, in addition to increasing knowledge of this genome compared to that of other legumes, help to improve traits of interest for peas, such as disease resistance, regularity of yield and nutritional value.

10 December 2019


Truffle genomes unlock secret of how its aromas are made

PRESS RELEASE - An international consortium coordinated by INRA and including the Joint Genome Institute (JGI), the CEA-Genoscope, the University of Turin, Université de Lorraine and the CNRS has sequenced the genomes of several prized species of truffle, including the Alba white truffle, the summer or Burgundy truffle and the desert truffle. This breakthrough provides new insight: not only into the ecologically important role of tree/fungi symbiosis, but most importantly into the mechanisms involved in truffle growth and the creation of their famous odours. The consortium’s findings appear in the 12 November 2018 edition of Nature Ecology and Evolution.

09 February 2020