Genome sequence of the palaeopolyploid soybean

Schmutz, J. and Cannon, S.B. and Schlueter, J. and et al, . (2010) Genome sequence of the palaeopolyploid soybean. Nature, 463 (8670). pp. 178-183.

[img] PDF - Published Version
Restricted to ICRISAT researchers only


Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

Item Type: Article
Uncontrolled Keywords: SNNigam Collection
Author Affiliation: Hudson Alpha Genome Sequencing Center, 601 Genome Way, Huntsville, Alabama 35806, USA.
Subjects: Crop Improvement
Divisions: Other Crops
Depositing User: Mr B Krishnamurthy
Date Deposited: 25 Apr 2013 14:07
Last Modified: 25 Apr 2013 14:07
Official URL:

Actions (login required)

View Item View Item