Major proliferation of transposable elements shaped the genome of the soybean rust pathogen Phakopsora pachyrhizi

Gupta, Yogesh K.; Marcelino-Guimarães, Francismar C.; Lorrain, Cécile; Farmer, Andrew; Haridas, Sajeet; Ferreira, Everton Geraldo Capote; Lopes-Caitar, Valéria S.; Oliveira, Liliane Santana; Morin, Emmanuelle; Widdison, Stephanie; Cameron, Connor; Inoue, Yoshihiro; Thor, Kathrin; Robinson, Kelly; Drula, Elodie; Henrissat, Bernard; LaButti, Kurt; Bini, Aline Mara Rudsit; Paget, Eric; Singan, Vasanth; Daum, Christopher; Dorme, Cécile; van Hoek, Milan; Janssen, Antoine; Chandat, Lucie; Tarriotte, Yannick; Richardson, Jake; Melo, Bernardo do Vale Araújo; Wittenberg, Alexander H. J.; Schneiders, Harrie; Peyrard, Stephane; Zanardo, Larissa Goulart; Holtman, Valéria Cristina; Coulombier-Chauvel, Flavie; Link, Tobias I.; Balmer, Dirk; Müller, André N.; Kind, Sabine; Bohnert, Stefan; Wirtz, Louisa; Chen, Cindy; Yan, Mi; Ng, Vivian; Gautier, Pierrick; Meyer, Maurício Conrado; Voegele, Ralf Thomas; Liu, Qingli; Grigoriev, Igor V.; Conrath, Uwe; Brommonschenkel, Sérgio H.; Loehrer, Marco; Schaffrath, Ulrich; Sirven, Catherine; Scalliet, Gabriel; Duplessis, Sébastien; van Esse, H. Peter

doi:10.1038/s41467-023-37551-4

Download PDF

Article
Open access
Published: 01 April 2023

Major proliferation of transposable elements shaped the genome of the soybean rust pathogen Phakopsora pachyrhizi

Nature Communications volume 14, Article number: 1835 (2023) Cite this article

7768 Accesses
7 Citations
108 Altmetric
Metrics details

Subjects

Abstract

With >7000 species the order of rust fungi has a disproportionately large impact on agriculture, horticulture, forestry and foreign ecosystems. The infectious spores are typically dikaryotic, a feature unique to fungi in which two haploid nuclei reside in the same cell. A key example is Phakopsora pachyrhizi, the causal agent of Asian soybean rust disease, one of the world’s most economically damaging agricultural diseases. Despite P. pachyrhizi’s impact, the exceptional size and complexity of its genome prevented generation of an accurate genome assembly. Here, we sequence three independent P. pachyrhizi genomes and uncover a genome up to 1.25 Gb comprising two haplotypes with a transposable element (TE) content of ~93%. We study the incursion and dominant impact of these TEs on the genome and show how they have a key impact on various processes such as host range adaptation, stress responses and genetic plasticity.

Nuclear exchange generates population diversity in the wheat leaf rust pathogen Puccinia triticina

Article Open access 26 October 2023

A thousand-genome panel retraces the global spread and adaptation of a major fungal crop pathogen

Article Open access 24 February 2023

The Green Revolution shaped the population structure of the rice pathogen Xanthomonas oryzae pv. oryzae

Article Open access 30 October 2019

Introduction

Rust fungi are an order of >7000 species of highly specialized plant pathogens with a disproportionately large impact on agriculture, horticulture, forestry, and foreign ecosystems¹. The infectious spores are typically dikaryotic, a feature unique to fungi in which two haploid nuclei reside in the same cell. Asian soybean rust caused by the obligate biotrophic fungus Phakopsora pachyrhizi, is a prime example of the damage that can be caused by rust fungi. It is a critical challenge for food security and one of the most damaging plant pathogens of this century (Fig. 1a)². The disease is ubiquitously present in the soybean growing areas of Latin America, where 210 million metric tons of soybean are projected to be produced in 2022/23 (https://apps.fas.usda.gov/psdonline/app/index.html), and on average representing a gross production value of U.S. $ 115 billion per season (https://www.ers.usda.gov/data-products/season-average-price-forecasts.aspx). A low incidence of this devastating disease (0.05%) can already affect yields and, if not managed properly, yield losses are reported of up to 80%^3,4. Chemical control in Brazil to manage the disease started in the 2002/03 growing season⁴. In the following season, ~20 million hectares of soybeans were sprayed with fungicides to control this disease (Fig. 1a)^4,5. The cost of managing P. pachyrhizi exceeds $2 billion USD per season in Brazil alone.

**Fig. 1: Impact of *P. pachyrhizi* incidence in a soybean field, comparative genome assembly size, and TE content.**

The pathogen is highly adaptive and individually deployed resistance genes have been rapidly overcome when respective cultivars have been released^6,7. Similarly, the fungal tolerance to the main classes of site-specific fungicides is increasing, making chemical control less effective^8,9,10. Another remarkable feature for an obligate biotrophic pathogen is its wide host range, encompassing 153 species of legumes within 54 genera to date^11,12,13. Epidemiologically, this is relevant as it allows the pathogen to maintain itself in the absence of soybean on other legume hosts, such as overwintering on the invasive weed Kudzu in the United States¹⁴. Despite the importance of the pathogen, not much was known about its genetic makeup as the large genome size (an estimated 1 Gbp), coupled to a high repeat content, high levels of heterozygosity and the dikaryotic nature of the infectious urediospores of the fungus have hampered whole genome assembly efforts¹⁵.

In this work, we provide reference quality assemblies and genome annotations of three P. pachyrhizi isolates. We uncover a genome with a total assembly size of up to 1.25 Gb. Approximately, 93% of the genome consists of TEs, of which two superfamilies make up 80% of the TE content. The three P. pachyrhizi isolates collected from South America represent a single clonal lineage with high levels of heterozygosity. Studying the TEs in detail, we demonstrate that the expansion of TEs within the genome happened over the last 10 My and accelerated over the last 3 My, and did so in several bursts. Although TEs are tightly controlled during sporulation and appressoria formation, we can see a clear relaxation of repression during the in planta life stages of the pathogen. Due to the nested TEs, it is not possible at present to correlate specific TEs to specific expanded gene families. However, we can see that the P. pachyrhizi genome is expanded in genes related to amino acid metabolism and energy production, which may represent key lifestyle adaptations. Overall, our data unveil that TEs that started their proliferation during the radiation of the Leguminosae play a prominent role in the P. pachyrhizi’s genome and may have a key impact on a variety of processes such as host range adaptation, stress responses and plasticity of the genome. The high-quality genome assembly and transcriptome data presented here are a key resource for the community. It represents a critical step for further in-depth studies of this pathogen to develop new methods of control and to better understand the molecular dialogue between P. pachyrhizi and its agriculturally relevant host, Soybean.

Results and discussion

Two superfamilies of transposons dominate the P. pachyrhizi genome

The high repeat content and dikaryotic nature of the P. pachryrhizi genome poses challenges to genome assembly methods¹⁵. Recent improvements in sequencing technology and assembly methods have provided contiguous genome assemblies for several rust fungi^{16,17,18,19,20,21}. Here, we have expanded the effort and provided reference-level genome assemblies of three P. pachyrhizi isolates (K8108, MT2006, and UFV02) using long-read sequencing technologies. All three isolates were collected from different regions of South America. We have used PacBio sequencing for the K8108 and MT2006 isolates and Oxford Nanopore for the UFV02 isolate to generate three high-quality genomes (Supplementary Fig. 1). Due to longer read lengths from Oxford nanopore, the UFV02 assembly is more contiguous compared to K8108 and MT2006 and is used as a reference in the current study (Table 1). The total genome assembly size of up to 1.25 Gb comprising two haplotypes, makes the P. pachyrhizi genome one of the largest fungal genomes sequenced to date (Fig. 1b). Analysis of the TE content in the P. pachyrhizi genome indicates ~93% of the genome consist of repetitive elements, one of the highest TE contents reported for any organism to date (Fig. 1b and Supplementary Data 1). This high TE content may represent a key strategy to increase genetic variation in P. pachyrhizi²². The largest class of TEs are class 1 retrotransposons, that account for 54.0% of the genome. The class II DNA transposons content is 34.0% (Supplementary Data 1 and 2). This high percentage of class II DNA transposons appear to be present in three lineages of rust fungi, the Melampsoraceae (Melampsora larici-populina), Pucciniaceae (Puccinia graminis f. sp. triciti) and Phakopsoraceae (P. pachyrhizi) (Fig. 1b). The recently assembled large genome (haploid genome size, 1 Gb) of the rust fungus Austropuccinia psidii in the family Sphaerophragmiaceae, however seems to mainly have expanded in retrotransposons²³. This illustrates that TEs exhibit different evolutionary tracjectories in different rust taxonomical families. Over 80% of the P. pachyrhizi genome is comprised of only two superfamilies of TEs: long terminal repeat (LTR) and terminal inverted repeat (TIR) (Fig. 1b, and Supplementary Data 2). The largest single family of TE are the Gypsy retrotransposons comprising 43% of the entire genome (Fig. 2a, and Supplementary Data 2).

Table 1 P. pachyrhizi genome assembly metrics

Full size table

**Fig. 2: Transposable element superfamilies in the *P. pachyrhizi* genomes, K8108, MT2006 and UFV02.**

To understand the evolutionary dynamics of the different TE families present in the P. pachyrhizi genome, we compared the sequence similarities of TEs with their consensus sequences in the three genomes, which ranges from 65 to 100% sequence identity (Supplementary Fig. 2). Based on the concept of burst and decay evolution of TEs, the extent of sequence similarity between each TE copy to its cognate consensus is proportional to the divergence time of copies²⁴. This approach allows us to compare within-genome relative insertion ages of TE insertions using consensus of TE families, a proxy for the ancestral sequence. TEs were categorised as (1) conserved TEs (copies with more than 95% identity), (2) intermediate TEs (copies with 85 to 95% identity) and (3) divergent TEs (copies with less than 85% identity)²⁴. The average TE composition of the three isolates is 13.2−18.3% conserved, 29.4–29.9% intermediate and represent 51.7−57.3% divergent (Supplementary Fig. 3, and Supplementary Data 3-5). The average Gypsy retrotransposon composition of the three isolates is 16.5–20.7% conserved, 30.4–31.03% intermediate, and 48.8–52.5% divergent (Supplementary Fig. 3, and Supplementary Data 3-5). Similarly, average TIR composition of the three isolates is 12.2–18.4% conserved, 29.0–29.7 % intermediate and 51.8–57.8% divergent (Supplementary Fig. 3, and Supplementary Data 3-5). This suggests that i) multiple waves of TE proliferation have occurred during the history of the species, ii) the invasion of the two major TE families into the P. pachyrhizi genome is not a recent event, and iii) the presence of conserved TEs indicates ongoing bursts of expansion of TEs in the P. pachyrhizi genome. Therefore, the proportion and distribution of TEs indicate that different categories of TEs differentially shaped the genomic landscape of P. pachyrhizi during different times in its evolutionary history (Fig. 2b).

We set out to date the Gypsy and Copia TEs in P. pachyrhizi, using a TE insertion age estimation^25,26. We observe that most retrotransposon insertions were dated less than 100 million years ago (Mya). We, therefore, decided to perform a more granulated study taking 1.0 million year intervals over this period. We approximated the start of TEs expansion at around 65 Mya after which the TE content gradually accumulates (Fig. 2c). We can see a more rapid expansion of TEs in the last 10 Mya, indeed over 40% of the Gypsy and Copia TEs in the genome seem to have arisen between today and 5 Mya (Fig. 2c). The climatic oscillations during the past 3 Myr are well known as the period of differentiation for multiple species²⁷. Therefore, the genome expansion through waves of TE proliferation in P. pachyrhizi correlates with periods in which other species, including their host species the legumes started their main radiation, and differentiation due to external stressors^24,25,26,27. This suggests that TEs either play an important role in generating the variation needed to adaptation of various stressors and/or proliferation of TEs is triggered by stressful events. Although a clear causal and or mechanistical role of TEs in adaptation, like in many other systems is still lacking^28,29, it is clear TEs have had a major impact on the architecture of the P. pachyrhizi genome.

A subset of TEs is highly expressed during early in planta stages of infection

To build a high-quality resource that can facilitate future in-depth analyses, within the consortium, we combined several robust, independently generated RNAseq datasets from all three isolates that include major soybean infection-stages and in vitro germination (Fig. 3a, b). Altogether, eleven different stages are captured with seven having an overlap of two or more isolates, representing a total of 72 different transcriptome data sets (Fig. 3c). These data were used to support the prediction of gene models with the de novo annotation pipeline of JGI MycoCosm³⁰. Those proteins secreted by the pathogen that impact the outcome of an interaction between host and pathogen are called effectors and are of particular interest^31,32. We used a variety of complementary methods to identify 2,183, 2,027, and 2,125 secreted proteins (the secretome) encoded within the genome assembly of K8108, MT2006 and UFV02, respectively^{33,34,35,36,37} (Supplementary Data 6-8). This is a two-fold improvement when compared to previous transcriptomic studies^{38,39,40,41,42}. In P. pachyrhizi, depending on methodology, 36.73 − 42.30% of these secreted proteins are predicted to be effectors (Supplementary Data 6-8). We identified 437 common secreted proteins (shared by at least two isolates) that are differentially expressed at least in one time-point in planta, of which 246 are predicted to be effectors providing a robust set of proteins to investigate in follow-up functional studies (Supplementary Fig. 4, and Supplementary Data 9).

**Fig. 3: Infection cycle of *P. pachyrhizi* and gene expression on the critical infection stages.**

We performed expression analysis on the annotated TEs and observed that 6.66−11.65% of TEs are expressed in the three isolates (Supplementary Data 10 and 11). We compared the TE expression from different infection stages versus in vitro stages (Fig. 2a, and Supplementary Data 12-14) and used the in planta RNAseq data from the isolates K8108 and UFV02. A relatively small subset of TEs (0.03 – 0.25%) are expressed during the early infection stages between 10 to 72 hours post-inoculation (HPI) (Supplementary Fig. 5 and 6, and Supplementary Data 12-14). Remarkably, for this subset, we observed a 20 to 70-fold increase in the expression when compared to the spore and germinated-spore stages, with the expression levels reaching a peak at 24 HPI (Supplementary Fig. 5 and 6). To estimate the impact of the insertion age of this in planta-induced TE subset, we performed expression analysis on the conserved, intermediate, and divergent TEs. Although there is a slight overrepresentation of the conserved TEs, several intermediate TEs and divergent TEs are also highly expressed during 10–24 HPI (Supplementary Fig. 7).

To compare the expression profile of this subset of TEs to the predicted effectors, we used the 246 core effectors and compared these with 25 known and constitutively expressed housekeeping genes across three isolates. We found that both TE and effector expression peaked at 24 HPI (Fig. 3d). While expression of effectors remained higher than the 25 selected housekeeping genes during infection, expression of TEs started to be repressed after 72 HPI (Fig. 3d). This observation would corroborate the hypothesis of stress-driven TE de-repression observed in other patho-systems^43,44,45. However, it also shows that in P. pachyrhizi only a small percentage of the TEs are highly expressed during early infection stages.

In several different phytopathogenic species a distinct genomic organization or compartmentalization can be observed for effector proteins. For example, the bipartite genome architecture of Phytophthora infestans and Leptospheria maculans in which gene sparse, repeat-rich compartments allow rapid adaptive evolution of effector genes⁴⁶. Other fungi display other organizations such as virulence chromosomes^47,48 or lineage-specific regions^49,50. However, when interrogating both genomic location and genomic distribution of the predicted candidate effector genes in P. pachyrhizi, we could not detect an analogous type of organization (Supplementary Fig. 8a-c). In addition, we did not observe evidence of the specific association between TE superfamilies and secreted protein genes (Supplementary Fig. 9), as has been observed in other fungal species^{46,48,51,52,53}. Additional analyses comparing the distance between BUSCO (Benchmarking Universal Single-Copy Orthologue) genes and genes encoding secreted proteins also showed no specific association (Supplementary Fig. 8d). Therefore, despite the large genome size and high TE content of P. pachyrhizi, its genome appears to be organized in a similar fashion to other rust fungi with smaller genome sizes^17,18,23,54. The lack of detection of a specific association between TE and genes in P. pachyrhizi may be due to the level of TE invasion with 93% TE observed for this genome.

P. pachyrhizi in South America is a single lineage with high levels of heterozygosity

Rust fungi are dikaryotic, therefore variation can exist both between isolates and between the two nuclei present in each cell of a single isolate. Long-term asexual reproduction is predicted to promote divergence between alleles of loci⁵⁵, which in principle can increase indefinitely⁵⁶. Some rusts can reproduce both sexually and asexually leading to a mixed clonal/sexual reproduction. In the rust fungus P. striiformis f.sp. tritici, asexual lineages showed a higher degree of heterozygosity between two haploid nuclei when compared to the sexual lineages⁵⁷. In the case of P. pachyrhizi, there are clear indications that the population is propagating asexually in South America based on early studies using simple-sequence repeats (SSR) and internal transcribed spacer (ITS) sequences^58,59. Our data utilizing high coverage raw Illumina data corroborate these earlier studies as we observed high levels of heterozygosity; 2.47% for UFV02, 1.61% for K8108 and 1.43% in MT2006, respectively (Supplementary Fig. 1a). This was further corroborated by mapping the Illumina reads to the genome assembly. In total, 283.355, 359.939, and 458.719 SNPs were identified from K8108, MT2006 and UFV02, respectively. The average heterozygous SNPs across the genome is 2.97 SNPs per Kb in UFV02 compared to 2.58 and 3.34 SNPs per Kb in K8108 and MT2006, respectively (Supplementary Data 15).

We subsequently studied the structural variation (insertions and deletions, repeat expansion and contractions, tandem expansion and contractions) as well as the haplotype variation between the three isolates (Supplementary Data 16)⁶⁰. Remarkably, the structural variation between the haplotypes of UFV02 is 163.3 Mb, while the variation between the complete genomes of the three isolates is 8 to 13 Mb (Fig. 4a). For example, the total number of repeat expansion and contractions is 7 and 16 times higher between the haplotypes than the variation between the isolates (Fig. 4a). To look at this inter-haplotype variation in more detail, we selected contigs larger than 1 Mb to study large syntenic blocks between isolates and haplotigs. The largest of these contigs, the 1.3 Mb contig 148 from UFV02 has synteny with contig 5809 from K8108, and contigs 220 and 362 from MT2006 (Fig. 4c-e), but not with its haplotig genome counterpart within UFV02, which indicates lack of recombination between haplotypes. This corroborates earlier studies that in South America P. pachyrhizi reproduces only asexually⁶¹.

**Fig. 4: Structural variation between *P. pachyrhizi* haplotypes is higher than variation between isolates.**

Collection of the monopustule isolates K8108, MT2006, UFV02 is separated in both time and geographical location (i.e. K8108 from Colonia, Uruguay, 2015; MT2006 from Mato Grosso do Sul, Brazil, 2006; UFV02 from Minas Gerais, Brazil, 2006). To study SNP variation, we mapped the Illumina data of all three isolates to the reference assembly of UFV02. Given the high level of heterozygosity and TE content, we focused our analysis on the now annotated exome space (Supplementary Data 15a). After removal of SNPs shared between either all three or two of the isolates, we identified only three non-synonymous mutations unique for K8180, eight non-synonymous mutations for MT2006 and five unique non-synonymous mutations for UFV02. For these 16 predicted genes, we found evidence for expression in our transcriptome analyses for ten genes. This total number of non-synonymous mutations within exons between the isolates may appear counterintuitive given the time and space differences between collection of these isolates. Nonetheless, it is likely that other single pustule isolates identified from another field would yield a similar number of mutations. Approximately 6 million spores may be produced per plant in a single day resulting in 3 × 10¹² spores per hectare per day⁶². Therefore, the ability to generate variation through mutation cannot be underestimated. We observed an enrichment of mutations in the upstream and downstream regions of protein-coding genes (Supplementary Data 15b), similar to other rust fungi^63,64,65. In contrast to the low number of mutated exons, the number of uniquely expressed genes between the three isolates is relatively high when compared to the core set of differentially expressed genes (Supplementary Data 17-19). This may reflect a mechanism in which transcriptional variation is generated via modification of promotor regions which would have the advantage that coding sequences that are not beneficial in a particular situation can be “shelved” for later use. This would result in a set of differentially transcribed genes for different isolates, and a core set of genes that are transcribed in each isolate.

The P. pachyrhizi genome is expanded in genes related to amino acid metabolism and energy production

We subsequently set out to identify expanding and contracting gene families within P. pachyrhizi. To this end, a phylogenetic tree of 17 selected fungal species (Supplementary Data 20a) was built using 408 conserved orthologous markers. We estimated that P. pachyrhizi diverged from its most recent common ancestor 123.2−145.3 million years ago (Supplementary Fig. 10 and Supplementary Data 20b), a time frame that coincides with the evolution of the Pucciniales^66,67. We derived gene families including orthologues and paralogues from a diverse set of plant-interacting fungi and identified gene gains and losses (i.e. family expansions and contractions) using computational analysis of gene family evolution (CAFÉ) (Supplementary Data 20a)⁶⁸. Genomes of rust fungi including P. pachyrhizi underwent more extensive gene losses than gains, as would be anticipated for obligate biotrophic parasites (Supplementary Fig. 11). In total, we identified 2,366 contracted families and 833 expanding families within UFV02, including 792 and 669 families with PFAM domains, respectively. The most striking and significant contraction in the P. pachyrhizi genome is related to DEAH helicase which is involved in many cellular processes, e.g., RNA metabolism and ribosome biogenesis (Supplementary Data 21). In contrast, significant expansions in 12 gene families were found, including genes encoding glutamate synthase, GMC (glucose-methanol-choline) oxidoreductase and CHROMO (CHRromatin Organisation MOdifier) domain-containing proteins (Supplementary Data 22). Glutamate synthase plays a vital role in nitrogen metabolism, and its ortholog in the ascomycete Magnaporthe oryzae MoGLT1 is required for conidiation and complete virulence on rice⁶⁹. GMC oxidoreductase exhibits important auxiliary activity 3 (AA3_2) according to the Carbohydrate-Active enzymes (CAZy) database⁷⁰ and is required for the induction of asexual development in Aspergillus nidulans⁷¹. An extensive approach was used for the global annotation of CAZyme genes in P. pachyrhizi genomes, and after comparison with other fungal genomes, we also found clear expansions in glycoside hydrolases (GH) family 18 and glycosyltransferases (GT) family 1 (Supplementary Data 23). GH18 chitinases are required for fungal cell wall degradation and remodelling, as well as multiple other physiological processes, including nutrient uptake and pathogenicity^72,73.

The Phakopsoraceae to which P. pachyrhizi belongs represents a new family branch in the order Pucciniales¹. With three P. pachyrhizi genome annotation replicates available, next to the above CAFÉ-analysis, we can directly track gene family expansions and contractions in comparison to genomes previously sequenced. We, therefore, compared P. pachyrhizi to the taxonomically related families Coleosporiaceae, Melampsoraceae and Pucciniaceae, which in turn may reveal unique lifestyle adaptations (Table 2).

Table 2 Expansion of gene families in the P. pachyrhizi genome

Full size table

The largest uniquely expanded gene family (531-608 members) in P. pachyrhizi comprises sequences containing the Piwi (P-element Induced Wimpy testes in Drosophila) domain (Table 2). Typically, the Piwi domain is found in the Argonaute (AGO) complex, where its function is to cleave ssRNA when guided by dsRNA⁷⁴. Interestingly, classes of longer-than-average miRNAs known as Piwi-interacting RNAs (piRNAs) that are 26-31 nucleotides long are known in animal systems. In Drosophila, these piRNAs function in nuclear RNA silencing, where they associate specifically with repeat-associated small interfering RNA (rasiRNAs) that originate from TEs⁷⁵. As in other fungal genomes, the canonical genes coding for large AGO proteins with canonical Argonaute, PAZ and Piwi domains can be observed in the genome annotation of the three P. pachyrhizi isolates. The hundreds of expanded predicted Piwi genes consist of short sequences of less than 500 nt containing only a partial Piwi domain aligning with the C-terminal part of the Piwi domain in the AGO protein. Some of these genes are pseudogenes marked by stop codons or encoding truncated protein forms, while others exhibit a partial Piwi domain starting with a methionine and eventually exhibiting a strong prediction for an N-terminal signal peptide. These expanded short Piwi genes are surrounded by TEs, several hundreds of which, but not all, are found in close proximity to specific TE consensus identified by the REPET analysis in the three P. pachyrhizi isolates (e.g. Gypsy, CACTA and TIR; Supplementary Fig. 12). However, no systematic and significant association could be made due to the numerous nested TEs present within the genome⁷⁶. Moreover, none of the expanded short Piwi domain genes are expressed in the conditions we tested. However, in many systems, Piwis and piRNAs play crucial roles during specific developmental stages where they influence epigenetic, germ cell, stem cell, transposon silencing, and translational regulation⁷⁷. Finally, the domain present in these short Piwi genes is partial, and we do not know whether they retain any RNase activity. Therefore, we cannot validate at this stage the function of this family, which warrants further study and attention as it may represent either a new type of TE-associated regulator within P. pachyrhizi, or an expansion of a control mechanism to deal with this highly repetitive genome.

Several families related to amino acid metabolism have expanded greatly when compared to the respective families in other rust fungi, most notably Asparagine synthase (KOG0573), which has ~75 copies in P. pachyrhizi compared to two copies in Pucciniaceae and one copy in Melampsoraceae (Table 2). Similarly, expanded gene families can be observed in citrate synthase (KOG2617), malate synthase (KOG1261), NAD-dependent malate dehydrogenase (KOG1494). These enzymes are involved in energy production and conversion via the citrate cycle required to produce certain amino acids and the reducing agent NADH (Table 2). Next to the molecular dialogue with effector proteins, plant-pathogen interactions are a “tug-of-war” of resources between the host and the pathogen⁷⁸. A key resource to secure in this process is nitrogen, a raw material needed to produce proteins. Therefore, the expansion in amino acid metabolism may reflect an adaptation to become more effective at securing this resource. Alternatively, the expanded categories also may reflect the metabolic flexibility needed to facilitate the broad host range of P. pachyrhizi, which to date comprises 153 leguminous species in 56 genera¹³.

Associations with TEs are often a sign for adaptive evolution as they facilitate the genetic leaps required for rapid phenotypic diversification^44,79,80,81. Gene duplication and gene family expansion can be directly linked to transposition activity due to imprecise excision and re-insertions and carry other genetic sequences⁸². Transposition-independent mechanisms may also promote structural rearrangements leading to gene family expansions through the recombination of homologous regions between TE copies. The TEs in these expansions may potentially be inactive⁸². We, therefore, investigated whether the expansion in amino acid metabolism could reflect a more recent adaptation by studying the TEs in these genomic regions. Furthermore, as described above, a distinction can be made between more recent bursts of TE activity (high conservation of the TEs) and older TE bursts leading to degeneration of the TE sequence consensus⁸³. However, despite the presence of several copies of specific TE subfamilies (i.e. related to the same annotated TE consensus) in the vicinity of the surveyed expanded families such as amino acid metabolism, CAZymes and transporter related genes (Supplementary Fig. 13 and 14), no significant enrichment could be observed for any particular TE when compared to the overall TE content of the genome. This may reflect the challenge of making such clear associations due to the continuous transposition activity, which results in a high plasticity of the genomic landscape and a highly nested TE structure. Alternatively, it may suggest a more ancient origin of these expansions that have subsequently been masked by repetitive episodes of relaxed TE expression (Supplementary Fig. 15 and 16).

Methods

Fungal strain and propagation

P. pachyrhizi isolates, K8108, MT2006 and UFV02⁸⁴ are single uredosoral isolates collected from Uruguay (Colonia in 2015), Brazil (Mato Grosso do Sul in 2006) and Brazil (Minas Gerais in 2006), respectively. The isolates were propagated on susceptible soybean cultivars Abelina, Thorne, Toliman and Williams 82 by spraying a suspension of urediospores 1 mg ml⁻¹ in 0.01 % (vol/vol) Tween-20 in distilled water onto 21-day-old soybean plants followed by 18 h incubation in an incubation chamber at saturated humidity, and at 22 °C in the dark. Infected plants were kept at 22 °C, 16-h day/8-h night cycle and 300 µmol s⁻¹ m⁻² light. After 14 DPI (days post-inoculation), the pustules were formed, and the urediospores were harvested using a Cyclone surface sampler (Burkard Manufacturing Co. Ltd.) and stored at −80 °C. The genomic DNA extraction methods are explained in Supplementary methods.

Genomic DNA extraction and genome sequencing

The high molecular weight (HMW) genomic-DNA was extracted using a carboxyl-modified magnetic bead protocol⁸⁵ for K8108, a CTAB-based extraction for MT2006⁸⁶, and a modified CTAB protocol for UFV02⁸⁷.

For K8108, a 20-kb PacBio SMRTbell library was prepared by Genewiz (South Plainfield, NJ) with 15-kb Blue Pippin size selection being performed prior to sequencing on a PacBio Sequel system (Pacific Biosciences, Menlo Park, CA). The K8108 PacBio Sequel genomic reads yielding 69 Gbp of sequence data were error corrected using MECAT⁸⁸; following parameter optimization for contiguity and completeness, the longest corrected reads yielding 50x coverage were assembled with MECAT’s mecat2canu adaptation of the Canu assembly workflow⁸⁹, using an estimated genome size of 500 Mbp and an estimated residual error rate of 0.02. The resulting assembly had further base pair-level error correction performed using the Arrow polishing tool from PacBio SMRTTools v5.1.0.26412⁹⁰.

MT2006 genome was sequenced using the Pacific Biosciences platform. The DNA sheared to >10 kb using Covaris g-Tubes was treated with exonuclease to remove single-stranded ends and DNA damage repair mix, followed by end repair and ligation of blunt adapters using SMRTbell Template Prep Kit 1.0 (Pacific Biosciences). The library was purified with AMPure PB beads and size selected with BluePippin (Sage Science) at >6 kb cutoff size. PacBio Sequencing primer was then annealed to the SMRTbell template library, and sequencing polymerase was bound to them using Sequel Binding kit 2.0. The prepared SMRTbell template libraries were then sequenced on a Pacific Biosystem’s Sequel sequencer using v2 sequencing primer, 1 M v2 SMRT cells, and Version 2.0 sequencing chemistry with 1 × 360 and 1 × 600 sequencing movie run times. The Phakopsora pachyrhizi MG2006 v1.0 genome was sequenced with PacBio, assembled with MECAT, polished with arrow, and annotated with the JGI Annotation Pipeline.

For UFV02, the PromethION platform of Oxford nanopore technology (ONT) (Oxford, UK) was used for long-read sequencing at KeyGene N.V. (Wageningen, The Netherlands). The libraries with long DNA fragments were constructed and sequenced on the PromethION platform. The raw sequencing data of 110 Gbp was generated and was base-called using ONT Albacore v2.1 available at https://community.nanoporetech.com. The UFV02 genome assembly, the longest 15, 20, 25, 30, 34, 40 and 56x nanopore reads were assembled using the Minimap2 and Miniasm pipeline⁹¹. To improve the consensus, error correction was performed three times with Racon using all the nanopore reads⁹². The resulting assembly was polished with 50x Illumina PCR-free 150 bp paired-end reads mapped with bwa⁹³ and Pilon⁹⁴, and repeated three times. We assessed the BUSCO scores after each step to compare the improvement in the assemblies.

Genome annotation

The gene predictions and annotations were performed in the P. pachyrhizi genomes K8108, MT2006 and UFV02 in parallel using the JGI Annotation Pipeline³⁰. TE masking was done during the JGI procedure, which detects, and masks repeats and TEs. Later, the extensive TE classification performed with REPET was imported and visualized as a supplementary track onto the genome portals. RNAseq data from each isolate was used as intrinsic support information for the gene callers from the JGI pipeline. The gene prediction procedure identifies a series of gene models at each gene locus and proposes the best gene model to define a filtered gene catalogue. Translated proteins deduced from gene models are further used for functional annotation according to international reference databases. All the annotation information is collected into an open public JGI genome portal in the MycoCosm (https://mycocosm.jgi.doe.gov/Phakopsora) with dedicated tools for community-based annotation^30,95. In total, 18,216, 19,618 and 22,467 gene models were predicted from K8108, MT2006 and UFV02, respectively (Supplementary Data 24); of which 10,492, 10,266 and 9,987 genes were functionally annotated. We have performed differential expression analyses using the germinated spores as a reference point in each of the three isolates (Supplementary Fig. 17, and Supplementary Data 17-19). A total of 3,608 common differentially expressed genes (DEGs) were identified in at least one condition shared between two or more isolates (Supplementary Fig. 18, and Supplementary Data 25).

Quality assessment of the whole‐genome assemblies

The whole-genome assemblies of P. pachyrhizi were evaluated using two different approaches. First, we used BUSCO version 5.0⁹⁶ to assess the genic content based on near-universal single-copy orthologs with basidiomycetes_odb10 database, including 1764 gene models. Second, K-mer’s from different assemblies were compared using KAT version 2.4.1⁹⁷. Genome heterozygosity was estimated using GenomeScope 2.0⁹⁸.

Insertion age of LTR-retrotransposons

Full-length LTR-retrotransposons were identified from the P. pachyrhizi genomes using LTRharvest with default parameters, and this tool belongs to the GenomeTools genome analysis software v1.6.1⁹⁹. LTRs annotated as Gypsy or Copia were used for molecular dating, and selection was based on a BLASTX against Repbase v20.11¹⁰⁰. 3’ and 5’ LTR sequences were extracted and aligned with mafft v7.471¹⁰¹, and alignments were used to calculate Kimura’s 2 P distances¹⁰². The insertion age was determined using the formula T = K / 2r, with K the distance between the 2 LTRs and r the fungal substitution rate of 1.05 × 10-9 nucleotides per site per year^25,26.

Molecular dating and Phylogenetic analysis

The phylogenetic tree was generated after the alignment of 408 conserved orthologous markers identified from at least 13 out of 17 genomes using PHYling (https://github.com/stajichlab/PHYling_unified). The sequences were aligned and concatenated into a super-alignment with 408 partitions. The phylogenetic tree was built with RAxML-NG (v0.9.0) using a partitioned analysis, and 200 bootstraps replicates. Molecular dating was established with mcmctree from PAML v4.8. Calibration points were extracted to Puccinalies⁶⁷ and Sordariomycetes–Leotiomycetes¹⁰³. The 95% highest posterior density (HPD) values are calibrated to the node.

Sample preparation for RNAseq

For expression analysis, 11 different stages were evaluated, with eight stages having an overlap of two or more isolates. These stages were nominated 1-11, as illustrated in Fig. 3c. For K8108, seven in vitro, one on planta and eight in planta samples, each with three biological replicates, were generated and used to prepare RNA libraries. To get in vitro germ tubes and fungal penetration structures, a polyethylene foil (dm freezer bag, Karlsruhe, Germany) was placed in glass plates and inoculated with a spore suspension (2 mg ml⁻¹). Each biological replicate corresponded to 500 cm² foil and ~4 mg urediospores. The plates were incubated at 22 °C in the dark at saturated humidity for 0.5, 2, 4 or 8 h. After incubation, the spores were collected using a cell scraper. For the appressoria-enriched sample, urediospore concentration was doubled and the plates rinsed with sterile water after 8 h of incubation prior to collection. The material was ground with mortar and pestle in liquid nitrogen. The time 0.5 h was considered as spore (Spore, Psp - stage 1), the 2 h as a germinated spore (Germinated spore, PspG – stage 2), and the 8 h rinsed as appressoria enriched sample in vitro (stage 3). The samples of spores collected after 4 and 8 h were not used for expression analysis. To obtain on planta fungal structures, three-week-old soybean plants (Williams 82) were inoculated as mentioned above. After 8 HPI, liquid latex (semi-transparent low ammonium, Latex-24, Germaringen, Germany) was sprayed (hand spray gun with gas unit, Preval, Bridgeview, USA) until complete leaf coverage. After drying off, latex was removed. It contained the appressoria and spores from the leaf surface but no plant tissue. This sample was considered as enriched in appressoria on plant and is exclusive for K8108 isolate (stage 4). Three middle leaflets of different plants were bulked for each sample and ground in liquid nitrogen using a mortar and pestle. The inoculated leaf samples were harvested at 10, 24, 72 and 192 HPI (stages 5, 6, 8 and 10) for the in planta gene expression studies.

For MT2006, the germ tubes, and appressorium were produced on polyethylene (PE) sheets where urediospores were finely dusted with household sieves held in a double layer of sifting. The PE sheets were then sprayed with water using a chromatography vaporizer and were kept at 20 °C, 95% humidity in the dark. For germ tubes the structures were scratched from the PE sheets after 3 h (stage 2) and for appressoria after 5 h (stage 3). The formation of both germ tubes and appressoria was checked microscopically. The in vitro samples were only used when there were at least 70% germ tubes or appressoria. The structures were dried by vacuum filtration and stored in 2-ml microcentrifuge tubes at −70 °C after freezing in liquid nitrogen. The resting spores came directly from storage at −70 °C (stage 1). For the in planta samples, 21 days old soybean cultivar Thorne was sprayed with a suspension containing 0.01% Tween-20, 0.08% milk-powder and 0.05% urediospores. The inoculated plants were kept, as mentioned previously. The samples were taken using a cork borer (18 mm diameter) at 192 and 288 HPI (stages 10 and 11). Three leaf pieces were collected for each sample (three times and from three different plants) for every time-point, stored in liquid nitrogen and kept at −80 °C.

For UFV02, the spore suspension of 1×10⁶ spores ml⁻¹ concentration was prepared in 0.01% v/v Tween-20. Four weeks old soybean plants were sprayed thoroughly on the abaxial surface of the leaves, and the plants were kept at saturated humidity in the dark for 24 h. After 24 h, plants were kept at 22 °C and 16/8-h light/dark cycle. The leaf samples were collected from non-inoculated plants (0 h) and infection-stages at 12, 24, 36, 72 and 168 HPI (stages 5, 6, 7, 8 and 9). An infection assay was performed in three biological replicates, and three plants were used for each replicate. All the samples were stored in liquid nitrogen after collection and kept at −80 °C for further processing (stage 1). Spores were harvested after 14 days post-inoculation and used for the RNA extraction. The urediospores were germinated in vitro on the water surface in a square petri dish and kept for 6 h at 24 °C (stage 2). The germinated-urediospores were collected in a falcon tube and snap freeze in liquid nitrogen. The samples were freeze-dried and kept at −80 °C until further processing. The un-inoculated plants (0 h) were not used in the expression analysis.

RNA isolation and sequencing

All the samples were ground in liquid nitrogen, and the total RNA was extracted using the Direct-zol RNA Miniprep Plus Kit (ZymoResearch, Freiburg, Germany), the mirVana™ miRNA Isolation Kit (Ambion/life technologies, Calsbad, CA, USA), and TRIzol^TM reagent (Invitrogen) according to the manufacturer’s protocols for K8108, MT2006, and UFV02, respectively. The quality of RNA was assessed using the TapeStation instrument (Agilent, Santa Clara, CA) or the Agilent 2100 bioanalyzer.

The RNA libraries from K8108 were normalized to 10 mM, pooled, and sequenced at 150-bp paired-end on the HiSeq X instrument at Genewiz (South Plainfield, NJ), with ten samples per lane. The transcriptome of MT2006 was sequenced with Illumina. Stranded cDNA libraries were generated using the Illumina Truseq Stranded mRNA Library Prep Kit. mRNA was purified from 1 ug of total RNA using magnetic beads containing poly-T oligos. mRNA was fragmented and reversed transcribed using random hexamers and SSII (Invitrogen) followed by second-strand synthesis. The fragmented cDNA was treated with end-pair, A-tailing, adapter ligation, and 8 cycles of PCR. The prepared libraries were quantified using KAPA Biosystem’s next-generation sequencing library qPCR kit (Roche) and run on a Roche LightCycler 480 real-time PCR instrument. The quantified libraries were then multiplexed, and the pool of libraries was prepared for sequencing on the Illumina HiSeq sequencing platform utilising a TruSeq paired-end cluster kit, v4, and Illumina’s cBot instrument to generate a clustered flow cell for sequencing. Sequencing of the flow cell was performed on the Illumina HiSeq 2500 sequencer using HiSeq TruSeq SBS sequencing kits, v4, following a 2×150 indexed run recipe. The RNA samples of UFV02 were sequenced at the Earlham Institute (Norwich, UK) on Illumina HiSeq 2500 platform with 250-bp paired-end reads. Eight different samples (as mentioned above) in three biological replicates were used for the RNA library preparation. All 24 libraries were multiplexed and sequenced on six lanes of HiSeq 2500.

TE analysis

The TE insertions are categorised based on the sequence identity 1) TEs with less than 85% sequence identity to the consensus, called old insertions, 2) TEs with 85-95% sequence identity are intermediate, and 3) TEs with more than 95% identity represent recent insertions (Supplementary Fig. 2 and 3)²⁴. All three isolates show common patterns of consensus identity, and a majority of the TEs show an intermediate age of insertions (Supplementary Fig. 2). The retrotransposon superfamilies such as terminal-repeat retrotransposons in Miniature (TRIMs) are the most recent expansion and long interspersed nuclear element (LINE), and large retrotransposon derivative (LARD) superfamilies are the most ancient insertion in the P. pachyrhizi genome (Supplementary Fig. 3). To verify the relationship between secreted genes and TEs, we calculated the distance between these features using Bedtools¹⁰⁴ with Closest algorithm, which returns the smallest genomic distance between two features. From the results obtained, we calculated the number of TEs neighbouring each secreted gene, grouped them by each TE superfamily and built the graphs. The tools used for analysis and graphs construction were Pandas v.1.3.4 and Seaborn 0.11.2 libraries, together with Python 3.9.7.

Identification of assembly haplotigs

The haplotypes were phased using the purge-haplotig pipeline¹⁰⁵ using Illumina WGS data. The haplotigs were aligned with their corresponding primary contigs using Mummer-4.0 for UFV02¹⁰⁶. Assemblytics was subsequently used to define six major types of structural variants⁶⁰, including insertions and deletions, repeat expansion and contractions, and tandem expansion and contractions.

The assembly was compared to itself using blastn (NCBI-BLAST + 2.7.1) with max_target_seqs = 10 and culling_limit = 10. After filtering for sequences matching themselves, overlaps among the remaining high-scoring segment pairs (HSPs) of > = 500 bp and > = 95% identities were consolidated with an interval tree requiring 60% overlap, then chained using MCScanX_h¹⁰⁷ to determine collinear series of matches, requiring three or more collinear blocks and choosing as a candidate haplotig sequences having at least 40% of their length subsumed by a chain corresponding to a longer contig sequence. For downstream analyses requiring a single haplotype representation, hard masking was applied to remove overlapped regions from the haplotigs using BEDtools v2.27.0¹⁰⁴. To identify gene correspondence among the three isolates, we used Liftoff software¹⁰⁸. The genome assembly of each isolate was used as a reference to map the other two isolates’ gene catalogue with >95% coverage and identity of >95%. The correspondence was established based on the gene annotation coordinates of each reference genome and the mapping coordinates from liftoff results (Supplementary Data 26).

Read mapping, variant calling and SNP effect prediction

Illumina paired-end reads of the three isolates were trimmed with Trimmomatic v0.36¹⁰⁹ to remove adapters, barcodes, and low-quality sequences with the following parameters: illuminaclip = TruSeq3-PE-2.fa:2:30:10, slidingwindow = 4:20, minlen = 36. Then, sequence data from all three isolates were aligned to the reference assembly of P. pachyrhizi UFV02 v2.1 using BWA version 0.7.17 with the BWA-mem algorithm⁹³, with the options -M -R. Alignment files were converted to BAM files using SAMtools v1.9¹¹⁰, and duplicated reads were removed using the Picard package (https://broadinstitute.github.io/picard/). The GATK v3.8.1 software¹¹¹ was used to identify and realign poorly aligned reads around InDels using Realigner Target Creator and Indel Realigner tools, creating a merged bam file for all the three isolates. The subsequent realigned BAM file was used to calling SNPs and InDels using HaplotypeCaller in GATK and filtering steps were performed to kept only high-quality variants, as following: the thresholds setting as: “QUAL < 30.00 || MQ < 40.00 || SOR > 3.00 || QD < 2.00 || FS > 60.00 || MQRankSum < −12.500 || ReadPosRankSum < −8.00 || ReadPosRankSum > 8.00”. The resulting SNPs and InDels were annotated with snpEffect v4.1¹¹².

Infection and disease progression

P. pachyrhizi is an obligate biotrophic fungus which forms a functional appressorium to penetrate the host epidermal layer within 12 HPI (hours post-inoculation)¹¹³. The penetrated epidermal cell dies after fungus establishes the penetration hyphae (PH) and forms the primary invasive hyphae (PIH) in the mesophyll cells after 24 HPI (Fig. 3a, b). The PIH differentiates and forms a haustorial mother cell, establishing the haustorium in the spongy parenchyma cells. At 72 HPI, the fungus colonises the spongy and palisade parenchyma cells (spc and ppc)¹¹⁴ (Fig. 3a, b). At 168 HPI, the uredinium starts to develop in the palisade parenchyma. At 196 HPI, the epidermal layer is broken, and the fully developed uredinia emerge. Each pustule forms thousands of urediospores and carries on the infection (Supplementary Fig. 19).

RNA transcriptome assembly

The low-quality RNA-seq reads were processed and trimmed using Trimmonatic version 0.39¹⁰⁹ with the parameters ILLUMINACLIP:2:30:10 LEADING:3 HEADCROP:10 SLIDINGWINDOW:4:25 TRAILING:3 MINLEN:40 and read quality was assessed with FastQC version 0.11.5 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). The high quality reads were filtered for any potential contamination among the fungi reads using Kraken2 software and parameter –unclassified-out for soybean genome and any possible contaminant species¹¹⁵. After all filtering steps, reads from each library were mapped against the three isolates assemblies using STAR v2.7.6a¹¹⁶. Parameters for mapping were (--outSAMtype BAM SortedByCoordinate, --outFilterMultimapNmax 100, --outFilterMismatchNmax 2, --outSAMattrIHstart 0, --winAnchorMultimapNmax 200, and –outWigType bedGraph). After mapping, duplicated reads were removed using Picard v.2.23.2. Htseq was used to count reads and Deseq2 to identify the differentially expressed genes in appressorium or during the host colonization relative to expression levels in the germinated-spore condition.

To validate gene annotation dedup-BAM files were analysed using StringTie v2.1.2¹¹⁷, and the gtf files obtained were merged (-m 600 -c 5) for genes and (-m 200 -c 5) for TE (TE). The final gtf file was compared with each genome annotation file per isolate using gffcompare¹¹⁸ software to validate the annotate genes and TEs. We detected 18,132, 19,467, and 22,347 genes presenting transcriptional evidence in K8108, MT2006 and UFV02 genomes, respectively, demonstrating high sensitivity (>93.9%) and precision in a locus level (>75.4%) in all three isolates (Supplementary Fig. 20 and Supplementary Data 27). For functional annotation, genes were considered expressed when each transcriptome reads were mapped against its respective reference genome, considering the criteria of TPM (Transcripts Per Kilobase Million) values > 0 in at least two biological replicates.

The BAM-dedup files obtained as above described were applied for TE expression analyses using TEtranscript software¹¹⁹. TE read counts were normalised between replicates in different conditions using R/Bioconductor package EdgeR v.3.1^120,121. Only TEs with a minimum of one read in at least two replicates were considered in this normalisation step. Libraries were normalised with the TMM method¹²², and CPM (counts per million) were generated with the EdgeR v.3.13. To better understand the expression distribution of TEs in the K8108, MT2006 and UFV02 genomes, we constructed boxplot plots to visualize the variation of expression values (average CPM) in each of their conditions. For this, we calculated the arithmetic means, the standard deviation, and the quartile values of the TEs expression in each condition for the isolates K8108, MT2006 and UFV02.

Prediction and annotation of secreted proteins

To predict classically secreted proteins, we initially searched for proteins containing a classic signal peptide and no transmembrane signal using SignalP (versions 3 and 5)³⁶, TMHMM¹²³ and Phobius¹²⁴ programs. For the identification of additional secreted proteins without a classic peptide signal and no transmembrane signal (non-classically secreted), we used EffectorP (versions 1 and 2)^33,34 and TMHMM programs. In both approaches, we kept the proteins having a TM in the N-term region. The proteins selected by both approaches were analysed by PS-SCAN program¹²⁵ to remove putative endoplasmic reticulum proteins. All programs were performed considering default parameters. The secreted proteins predicted in the previous step were annotated using Blast¹²⁶, RPSBlast, PredGPI¹²⁷, InterProScan¹²⁸ and hmmsearch¹²⁹ programs. Similarity searches using Blast program were performed against the NCBI non-redundant (nr), FunSecKb¹³⁰, Phi-base¹³¹, and LED¹³² databases, applying an e-value of 10⁻⁵. To search for domains in sequences, we used the programs RPSBlast and hmmsearch against the Conserved Domain Database (CDD)¹³³ and PFAM database¹³⁴, respectively, using an e-value of 10⁻⁵ in both cases. Orthologue mapping was done through similarity searches with the hmmsearch program against profile HMMs obtained from eggNOG database¹³⁵. To predict the localisation of proteins in the cellular compartments, ApoplastP¹³⁶, Localizer¹³⁷, targetP¹³⁸, WoLFPSORT¹³⁹, and DeepLoc¹⁴⁰ programs were used using default parameters. To assign a final localisation for each protein, the following criteria were considered: if at least two programs found the same result, that result was considered as a predicted location. Otherwise, the term “Not classified” was assigned to the protein. To identify the motifs [Y/F/W]xC in the sequences, we used a proprietary script developed in Perl language. A summary of the prediction and annotation pipelines for the secreted proteins is illustrated in Supplementary Fig. 21 and 22.

For the prediction of putative effector proteins, we used the list of predicted secreted proteins containing a classical signal peptide. For the prediction of candidate effector proteins in each genome, we defined three different approaches. In the first one, sequences predicted as “Extracellular” or “Not Classified” by the location programs and with no annotation were selected as candidates for effector proteins. We obtained 618, 531 and 598 candidates to effector proteins in K8108, MT2006 and UFV02 with this approach. In the second approach, we selected proteins with PFAM domains present in effector proteins^{141,142,143,144,145,146,147,148,149,150,151,152}. Applying this criterion, we selected 142, 128 and 55 candidates in K8108, MT2006 and UFV02, respectively. Finally, in the third approach, we ran EffectorP program to classify the effector candidates, and we obtained 802, 851 and 899 candidates in K8108, MT2006 and UFV02 genomes, respectively (Supplementary Data 6-8).

Staining of leaf samples and microscopy

Plants were inoculated by spray inoculation, and leaves were harvested at the indicated time points. Samples were destained in 1 M KOH with 0.01% Silwet L-77 (Sigma Aldrich) for at least 12 h at 37 °C and stored in 50 mM Tris-HCl pH 7.5 at 4 °C. Fungal staining was obtained with wheat germ agglutinin (WGA) FITC conjugate (Merck L4895), samples were incubated 30 min to overnight in a 20 µg/ml solution in Tris-HCl pH 7.5. Co-staining of plant tissue with propidium iodide (Sigma-Aldrich P4864) was performed according to the manufacturer’s instructions. Images were obtained with a Leica SP5 confocal microscope (Leica Microsystems) with an excitation of 488 nm and detection at 500-550 nm and 625-643 nm, respectively. Z-stacks were opened in the 3D viewer of the LAS X software (Leica Application Suite X 3.5.7.23225), and the resulting images were exported. Clipping was performed as indicated in the pictures. Shading was performed in some cases for better visualisation.

For cryo-scanning electron microscopy, inoculated soybean leaves were cut and mounted on an aluminium stub with Tissue Tek OCT (Agar Scientific Ltd, Essex, UK) and plunged frozen in slushed liquid nitrogen to cryo-preserve the material before transfer to the cryo-stage of a PP3010 cryo-SEM preparation system (Quorum Technologies, Laughton, UK) attached to a Zeiss Gemini 300 field emission gun scanning electron microscope (Zeiss UK Ltd, Cambridge, UK). Surface frost was sublimated by warming the sample to −90 °C for 4 minutes before the sample was cooled to −140 °C and sputter coated with platinum for 50 seconds at 5 mA. The sample was loaded onto the cryo-stage of the main SEM chamber and held at −140 °C during imaging at 3 kV using an Everhart-Thornley detector. False colouring of images was performed with Adobe Photoshop 22.4.2.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Source data are provided with this paper. The raw sequencing data of MT2006, K8108 and UFV02 isolates has been deposited at NCBI under the accession numbers PRJNA368291, PRJEB46918, and PRJEB44222, respectively. Source data are provided with this paper.

References

Aime, M. C. & McTaggart, A. R. A higher-rank classification for rust fungi, with notes on genera. Fungal Syst. Evol. 7, 21–47 (2021).
Article CAS PubMed Google Scholar
Savary, S. et al. The global burden of pathogens and pests on major food crops. Nat. Ecol. Evolution 3, 430–439 (2019).
Article Google Scholar
Scherm, H., Christiano, R. S. C., Esker, P. D., Del Ponte, E. M. & Godoy, C. V. Quantitative review of fungicide efficacy trials for managing soybean rust in Brazil. Crop Prot. 28, 774–782 (2009).
Article CAS Google Scholar
Yorinori, J. T. et al. Epidemics of Soybean Rust (Phakopsora pachyrhizi) in Brazil and Paraguay from 2001 to 2003. Plant Dis. 89, 675–677 (2005).
Article CAS PubMed Google Scholar
Melo Reis, E., Deuner, E. & Zanatta, M. In vivo sensitivity of Phakopsora pachyrhizi to DMI and QoI fungicides. Summa Phytopathol. 41, 21–24 (2015).
Article Google Scholar
Akamatsu, H. et al. Pathogenic diversity of soybean rust in Argentina, Brazil, and Paraguay. J. Gen. Plant Pathol. 79, 28–40 (2013).
Article Google Scholar
Paul, C., Hartman, G. L., Marois, J. J., Wright, D. L. & Walker, D. R. First report of Phakopsora pachyrhizi adapting to soybean genotypes with Rpp1 or Rpp6 rust resistance genes in field plots in the United States. Plant Dis. 97, 1379–1379 (2013).
Article CAS PubMed Google Scholar
Godoy, C. V. et al. Asian soybean rust in Brazil: past, present, and future. Pesqui. Agropecu.ária Brasileira 51, 407–421 (2016).
Article Google Scholar
Müller, M. A., Stammler, G. & May De Mio, L. L. Multiple resistance to DMI, QoI and SDHI fungicides in field isolates of Phakopsora pachyrhizi. Crop Prot. 145, 105618 (2021).
Article Google Scholar
Barro, J. P. et al. Performance of dual and triple fungicide premixes for managing soybean rust across years and regions in Brazil: A meta-analysis. Plant Pathol. 70, 1920–1935 (2021).
Article CAS Google Scholar
Ono, Y., Buriticá, P. & Hennen, J. F. Delimitation of Phakopsora, Physopella and Cerotelium and their species on Leguminosae. Mycological Res. 96, 825–850 (1992).
Article Google Scholar
Bonde, M. R. et al. Comparative susceptibilities of legume species to infection by Phakopsora pachyrhizi. Plant Dis. 92, 30–36 (2008).
Article CAS PubMed Google Scholar
Slaminko, T. L., Miles, M. R., Frederick, R. D., Bonde, M. R. & Hartman, G. L. New legume hosts of Phakopsora pachyrhizi based on greenhouse evaluations. Plant Dis. 92, 767–771 (2008).
Article CAS PubMed Google Scholar
Harmon, C. L., Harmon, P. F., Mueller, T. A., Marois, J. J. & Hartman, G. L. First report of Phakopsora pachyrhizi telia on kudzu in the United States. Plant Dis. 90, 380–380 (2006).
Article CAS PubMed Google Scholar
Loehrer, M. et al. On the current status of Phakopsora pachyrhizi genome sequencing. Front Plant Sci. 5, 377–377 (2014).
Article PubMed PubMed Central Google Scholar
Li, F. et al. Emergence of the Ug99 lineage of the wheat stem rust pathogen through somatic hybridisation. Nat. Commun. 10, 5068 (2019).
Article ADS PubMed PubMed Central Google Scholar
Schwessinger, B. et al. A near-complete haplotype-phased genome of the dikaryotic wheat stripe rust fungus Puccinia striiformis f. sp. tritici reveals high interhaplotype diversity. mBio 9, e02275–02217 (2018).
Article CAS PubMed PubMed Central Google Scholar
Miller, M. E. et al. De Novo assembly and phasing of dikaryotic genomes from two isolates of Puccinia coronata f. sp. avenae, the causal agent of oat crown rust. mBio 9, e01650–01617 (2018).
Article CAS PubMed PubMed Central Google Scholar
Duan, H. et al. Physical separation of haplotypes in dikaryons allows benchmarking of phasing accuracy in Nanopore and HiFi assemblies with Hi-C data. Genome Biol. 23, 84 (2022).
Article CAS PubMed PubMed Central Google Scholar
Henningsen, E. C. et al. A chromosome-level, fully phased genome assembly of the oat crown rust fungus Puccinia coronata f. sp. avenae: a resource to enable comparative genomics in the cereal rusts. G3 (Bethesda) 12, jkac149 (2022).
Article CAS PubMed Google Scholar
Schwessinger, B. et al. A Chromosome Scale Assembly of an Australian Puccinia striiformis f. sp. tritici Isolate of the PstS1 Lineage. Mol. Plant Microbe Interact. 35, 293–296 (2022).
Article CAS PubMed Google Scholar
Oggenfuss, U. et al. A population-level invasion by transposable elements triggers genome expansion in a fungal pathogen. eLife 10, e69249 (2021).
Article CAS PubMed PubMed Central Google Scholar
Tobias, P.A. et al. Austropuccinia psidii, causing myrtle rust, has a gigabase-sized genome shaped by transposable elements. G3 (Bethesda) 11, jkaa015 (2020).
Article Google Scholar
Maumus, F. & Quesneville, H. Ancestral repeats have shaped epigenome and genome composition for millions of years in Arabidopsis thaliana. Nat. Commun. 5, 4104 (2014).
Article ADS CAS PubMed Google Scholar
Castanera, R. et al. Transposable elements versus the fungal genome: impact on whole-Genome architecture and transcriptional profiles. PLOS Genet. 12, e1006108 (2016).
Article PubMed PubMed Central Google Scholar
Dhillon, B., Gill, N., Hamelin, R. C. & Goodwin, S. B. The landscape of transposable elements in the finished genome of the fungal wheat pathogen Mycosphaerella graminicola. BMC Genom. 15, 1132 (2014).
Article Google Scholar
Hewitt, G. The genetic legacy of the Quaternary ice ages. Nature 405, 907–913 (2000).
Article ADS CAS PubMed Google Scholar
Catlin, N. S. & Josephs, E. B. The important contribution of transposable elements to phenotypic variation and evolution. Curr. Opin. plant Biol. 65, 102140 (2022).
Article CAS PubMed Google Scholar
Almojil, D. et al. The Structural, Functional and Evolutionary Impact of Transposable Elements in Eukaryotes. Genes 12, 918 (2021).
Article CAS PubMed PubMed Central Google Scholar
Grigoriev, I. V. et al. MycoCosm portal: gearing up for 1000 fungal genomes. Nucleic Acids Res 42, D699–D704 (2014).
Article CAS PubMed Google Scholar
Dodds, P. N. & Rathjen, J. P. Plant immunity: towards an integrated view of plant-pathogen interactions. Nat. Rev. Genet 11, 539–548 (2010).
Article CAS PubMed Google Scholar
de Jonge, R., Bolton, M. D. & Thomma, B. P. How filamentous pathogens co-opt plants: the ins and outs of fungal effectors. Curr. Opin. Plant Biol. 14, 400–406 (2011).
Article PubMed Google Scholar
Sperschneider, J., Dodds, P. N., Gardiner, D. M., Singh, K. B. & Taylor, J. M. Improved prediction of fungal effector proteins from secretomes with EffectorP 2.0. Mol. Plant Pathol. 19, 2094–2110 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sperschneider, J. et al. EffectorP: predicting fungal effector proteins from secretomes using machine learning. N. Phytolog. 210, 743–761 (2016).
Article CAS Google Scholar
Käll, L., Krogh, A. & Sonnhammer, E. L. A combined transmembrane topology and signal peptide prediction method. J. Mol. Biol. 338, 1027–1036 (2004).
Article PubMed Google Scholar
Almagro Armenteros, J. J. et al. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol. 37, 420–423 (2019).
Article CAS PubMed Google Scholar
Dyrløv Bendtsen, J., Nielsen, H., von Heijne, G. & Brunak, S. Improved prediction of signal peptides: SignalP 3.0. J. Mol. Biol. 340, 783–795 (2004).
Article Google Scholar
Link, T. I. et al. The haustorial transcriptomes of Uromyces appendiculatus and Phakopsora pachyrhizi and their candidate effector families. Mol. Plant Pathol. 15, 379–393 (2014).
Article CAS PubMed Google Scholar
Kunjeti, S. G. et al. Identification of Phakopsora pachyrhizi candidate effectors with virulence activity in a distantly related pathosystem. Front Plant Sci. 7, 269–269 (2016).
Article PubMed PubMed Central Google Scholar
de Carvalho, M. C. et al. Prediction of the in planta Phakopsora pachyrhizi secretome and potential effector families. Mol. Plant Pathol. 18, 363–377 (2017).
Article PubMed Google Scholar
Qi, M. et al. Suppression or activation of immune responses by predicted secreted proteins of the soybean rust pathogen Phakopsora pachyrhizi. Mol. Plant Microbe Interact. 31, 163–174 (2018).
Article CAS PubMed Google Scholar
Elmore, M. G., Banerjee, S., Pedley, K. F., Ruck, A. & Whitham, S. A. De novo transcriptome of Phakopsora pachyrhizi uncovers putative effector repertoire during infection. Physiol. Mol. Plant Pathol. 110, 101464 (2020).
Article CAS Google Scholar
Fouché, S. et al. Stress-driven transposable element de-repression dynamics and virulence evolution in a fungal pathogen. Mol. Biol. Evol. 37, 221–239 (2019).
Article Google Scholar
Fouché, S., Oggenfuss, U., Chanclud, E. & Croll, D. A devil’s bargain with transposable elements in plant pathogens. Trends Genet. 28, 222–230 (2021).
Google Scholar
Torres, D.E., Thomma, B.P.H.J. & Seidl, M.F. Transposable elements contribute to genome dynamics and gene expression variation in the fungal plant pathogen Verticillium dahliae. Genome Biology and Evolution 13, evab135 (2021).
Article CAS PubMed PubMed Central Google Scholar
Raffaele, S. et al. Genome evolution following host jumps in the Irish potato famine pathogen lineage. Science 330, 1540–1543 (2010).
Article ADS CAS PubMed Google Scholar
van der Does, H. C. & Rep, M. Virulence genes and the evolution of host specificity in plant-pathogenic fungi. Mol. Plant Microbe Interact. 20, 1175–1182 (2007).
Article PubMed Google Scholar
Li, J., Fokkens, L., Conneely, L. J. & Rep, M. Partial pathogenicity chromosomes in Fusarium oxysporum are sufficient to cause disease and can be horizontally transferred. Environ. Microbiol 22, 4985–5004 (2020).
Article CAS PubMed PubMed Central Google Scholar
Harting, R. et al. A 20-kb lineage-specific genomic region tames virulence in pathogenic amphidiploid Verticillium longisporum. Mol. Plant Pathol. 22, 939–953 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jonge, R. D. et al. Tomato immune receptor Ve1 recognizes effector of multiple fungal pathogens uncovered by genome and RNA sequencing. PNAS 109, 5110–5115 (2012).
Article ADS PubMed PubMed Central Google Scholar
Croll, D. & McDonald, B. A. The accessory genome as a cradle for adaptive evolution in pathogens. PLoS Pathog. 8, e1002608 (2012).
Article CAS PubMed PubMed Central Google Scholar
Schmidt, S. M. et al. MITEs in the promoters of effector genes allow prediction of novel virulence genes in Fusarium oxysporum. BMC Genom. 14, 119 (2013).
Article CAS Google Scholar
de Jonge, R. et al. Extensive chromosomal reshuffling drives evolution of virulence in an asexual pathogen. Genome Res 23, 1271–1282 (2013).
Article PubMed PubMed Central Google Scholar
Lorrain, C., Gonçalves dos Santos, K. C., Germain, H., Hecker, A. & Duplessis S. Advances in understanding obligate biotrophy in rust fungi. N. Phytol. 222, 1190–1206 (2019).
Article Google Scholar
Judson, O. P. & Normark, B. B. Ancient asexual scandals. Trends Ecol. Evol. 11, 41–46 (1996).
Article CAS PubMed Google Scholar
Balloux, F., Lehmann, L. & de Meeûs, T. The population genetics of clonal and partially clonal diploids. Genetics 164, 1635–1644 (2003).
Article PubMed PubMed Central Google Scholar
Schwessinger, B. et al. Distinct life histories impact dikaryotic genome evolution in the rust fungus Puccinia striiformis causing stripe rust in wheat. Genome Biol. Evol. 12, 597–617 (2020).
Article CAS PubMed PubMed Central Google Scholar
Jorge, V. R. et al. The origin and genetic diversity of the causal agent of Asian soybean rust, Phakopsora pachyrhizi, in South America. Plant Pathol. 64, 729–737 (2015).
Article CAS Google Scholar
Darben, L. M. et al. Characterization of genetic diversity and pathogenicity of Phakopsora pachyrhizi mono-uredinial isolates collected in Brazil. Eur. J. Plant Pathol. 156, 355–372 (2020).
Article CAS Google Scholar
Nattestad, M. & Schatz, M. C. Assemblytics: a web analytics tool for the detection of variants from an assembly. Bioinformatics 32, 3021–3023 (2016).
Article CAS PubMed PubMed Central Google Scholar
Goellner, K. et al. Phakopsora pachyrhizi, the causal agent of Asian soybean rust. Mol. plant Pathol. 11, 169–177 (2010).
Article CAS PubMed Google Scholar
Isard, S. A., Gage, S. H., Comtois, P. & Russo, J. M. Principles of the atmospheric pathway for invasive species applied to soybean rust. BioScience 55, 851–861 (2005).
Article Google Scholar
Zheng, W. et al. High genome heterozygosity and endemic genetic recombination in the wheat stripe rust fungus. Nat. Commun. 4, 2673 (2013).
Article ADS PubMed Google Scholar
Chen, J. et al. De novo genome assembly and comparative genomics of the barley leaf rust pathogen Puccinia hordei identifies candidates for three avirulence genes. G3 (Bethesda) 9, 3263–3271 (2019).
Article CAS PubMed Google Scholar
Cuomo, C. A. et al. Comparative analysis highlights variable genome content of wheat rusts and divergence of the mating loci. G3 (Bethesda) 7, 361–376 (2017).
Article CAS PubMed Google Scholar
McTaggart, A. R. et al. Host jumps shaped the diversity of extant rust fungi (Pucciniales). N. Phytol. 209, 1149–1158 (2016).
Article Google Scholar
Aime, M. C., Bell, C. D. & Wilson, A. W. Deconstructing the evolutionary complexity between rust fungi (Pucciniales) and their plant hosts. Stud. Mycol. 89, 143–152 (2018).
Article CAS PubMed PubMed Central Google Scholar
Han, M. V., Thomas, G. W., Lugo-Martinez, J. & Hahn, M. W. Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Mol. Biol. Evol. 30, 1987–1997 (2013).
Article CAS PubMed Google Scholar
Zhou, W. et al. Glutamate synthase MoGlt1-mediated glutamate homeostasis is important for autophagy, virulence and conidiation in the rice blast fungus. Mol. Plant Pathol. 19, 564–578 (2018).
Article CAS PubMed Google Scholar
Lombard, V., Golaconda Ramulu, H., Drula, E., Coutinho, P. M. & Henrissat, B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res 42, D490–D495 (2014).
Article CAS PubMed Google Scholar
Etxebeste, O. et al. GmcA is a putative Glucose-Methanol-Choline Oxidoreductase required for the induction of asexual development in Aspergillus nidulans. PLoS ONE 7, e40292 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, W., Jiang, X. & Yang, Q. Glycoside hydrolase family 18 chitinases: The known and the unknown. Biotechnol. Adv. 43, 107553 (2020).
Article CAS PubMed Google Scholar
Langner, T. & Göhre, V. Fungal chitinases: function, regulation, and potential roles in plant/pathogen interactions. Curr. Genet 62, 243–254 (2016).
Article CAS PubMed Google Scholar
Darricarrère, N., Liu, N., Watanabe, T. & Lin, H. Function of Piwi, a nuclear Piwi/Argonaute protein, is independent of its slicer activity. PNAS 110, 1297–1302 (2013).
Article ADS PubMed PubMed Central Google Scholar
Saito, K. et al. Specific association of Piwi with rasiRNAs derived from retrotransposon and heterochromatic regions in the Drosophila genome. Genes Dev. 20, 2214–2222 (2006).
Article CAS PubMed PubMed Central Google Scholar
Bourque, G. et al. Ten things you should know about transposable elements. Genome Biol. 19, 199 (2018).
Article CAS PubMed PubMed Central Google Scholar
Thomson, T. & Lin, H. The biogenesis and function of PIWI proteins and piRNAs: progress and prospect. Annu Rev. Cell Dev. Biol. 25, 355–376 (2009).
Article CAS PubMed PubMed Central Google Scholar
Bolton, M. D. Primary metabolism and plant defense–fuel for the fire. Mol. Plant Microbe Interact. 22, 487–497 (2009).
Article CAS PubMed Google Scholar
Schrader, L. & Schmitz, J. The impact of transposable elements in adaptive evolution. Mol. Ecol. 28, 1537–1549 (2019).
Article PubMed Google Scholar
Seidl, M. F. & Thomma, B. Transposable elements direct the coevolution between plants and microbes. Trends Genet 33, 842–851 (2017).
Article CAS PubMed Google Scholar
Jordan, I. K. & Bowen, N. J. Computational analysis of transposable element sequences. Methods Mol. Biol. 260, 59–71 (2004).
CAS PubMed Google Scholar
Almeida, M. V., Vernaz, G., Putman, A. L. K. & Miska, E. A. Taming transposable elements in vertebrates: from epigenetic silencing to domestication. Trends Genet. 38, 529–553 (2022).
Article CAS PubMed Google Scholar
Lorrain, C., Feurtey, A., Möller, M., Haueisen, J. & Stukenbrock, E. Dynamics of transposable elements in recently diverged fungal pathogens: lineage-specific transposable element content and efficiency of genome defenses. G3 (Bethesda) 11, jkab068 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kawashima, C. G. et al. A pigeonpea gene confers resistance to Asian soybean rust in soybean. Nat. Biotechnol. 34, 661–665 (2016).
Article CAS PubMed Google Scholar
Mayjonade, B. et al. Extraction of high-molecular-weight genomic DNA for long-read sequencing of single molecules. BioTechniques 61, 203–205 (2016).
Article CAS PubMed Google Scholar
Persoons, A. et al. Patterns of genomic variation in the poplar rust fungus Melampsora larici-populina identify pathogenesis-related factors. Front Plant Sci. 5, 450 (2014).
Article PubMed PubMed Central Google Scholar
Schwessinger, B. & Rathjen, J.P. in Wheat Rust Diseases: Methods and Protocols. (ed. S. Periyannan) 49-57 (Springer New York, New York, NY; 2017).
Xiao, C.-L. et al. MECAT: fast mapping, error correction, and de novo assembly for single-molecule sequencing reads. Nat. Methods 14, 1072–1074 (2017).
Article CAS PubMed Google Scholar
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chin, C.-S. et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 10, 563–569 (2013).
Article CAS PubMed Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Vaser, R., Sovic, I., Nagarajan, N. & Sikic, M. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res 27, 737–746 (2017).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963 (2014).
Article ADS PubMed PubMed Central Google Scholar
Kuo, A., Bushnell, B. & Grigoriev, I.V. in Advances in Botanical Research, Vol. 70. (ed. F.M. Martin) 1-52 (Academic Press, 2014).
Simao, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article CAS PubMed Google Scholar
Mapleson, D., Garcia Accinelli, G., Kettleborough, G., Wright, J. & Clavijo, B. J. KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies. Bioinformatics 33, 574–576 (2016).
Article PubMed Central Google Scholar
Ranallo-Benavidez, T.R., Jaron, K.S. & Schatz, M.C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun 11, 1432 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinforma. 9, 18 (2008).
Article Google Scholar
Jurka, J. Repbase Update: a database and an electronic journal of repetitive elements. Trends Genet. 16, 418–420 (2000).
Article CAS PubMed Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kimura, M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16, 111–120 (1980).
Article ADS CAS PubMed Google Scholar
Prieto, M. & Wedin, M. Dating the diversification of the major lineages of ascomycota (fungi). PLOS ONE 8, e65576 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Roach, M. J., Schmidt, S. A. & Borneman, A. R. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinforma. 19, 460 (2018).
Article CAS Google Scholar
Marçais, G. et al. MUMmer4: A fast and versatile genome alignment system. PLOS Comput. Biol. 14, e1005944 (2018).
Article PubMed PubMed Central Google Scholar
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49–e49 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Shumate, A. & Salzberg, S. L. Liftoff: accurate mapping of gene annotations. Bioinformatics 37, 1639–1643 (2021).
Article CAS PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. (Austin) 6, 80–92 (2012).
Article CAS PubMed Google Scholar
Loehrer, M. et al. In vivo assessment by Mach–Zehnder double-beam interferometry of the invasive force exerted by the Asian soybean rust fungus (Phakopsora pachyrhizi). N. Phytol. 203, 620–631 (2014).
Article CAS Google Scholar
Heller, A. Host-parasite interaction during subepidermal sporulation and pustule opening in rust fungi (Pucciniales). Protoplasma 257, 783–792 (2020).
Article CAS PubMed Google Scholar
Wood, D. E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol. 20, 257 (2019).
Article CAS PubMed PubMed Central Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2012).
Article PubMed PubMed Central Google Scholar
Kovaka, S. et al. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 20, 278 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pertea, G. & Pertea, M. GFF Utilities: GffRead and GffCompare. F1000Res 9, 304 (2020).
Article Google Scholar
Jin, Y., Tam, O. H., Paniagua, E. & Hammell, M. TEtranscripts: a package for including transposable elements in differential expression analysis of RNA-seq datasets. Bioinformatics 31, 3593–3599 (2015).
Article CAS PubMed PubMed Central Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Article CAS PubMed Google Scholar
McCarthy, D. J., Chen, Y. & Smyth, G. K. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res 40, 4288–4297 (2012).
Article CAS PubMed PubMed Central Google Scholar
Robinson, M. D. & Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 11, R25 (2010).
Article PubMed PubMed Central Google Scholar
Krogh, A., Larsson, B., von Heijne, G. & Sonnhammer, E. L. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 305, 567–580 (2001).
Article CAS PubMed Google Scholar
Käll, L., Krogh, A. & Sonnhammer, E. L. Advantages of combined transmembrane topology and signal peptide prediction–the Phobius web server. Nucleic Acids Res 35, W429–W432 (2007).
Article PubMed PubMed Central Google Scholar
Gattiker, A., Gasteiger, E. & Bairoch, A. ScanProsite: A reference implementation of a PROSITE scanning tool. Appl. Bioinforma. 1, 107–108 (2002).
CAS Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Pierleoni, A., Martelli, P. L. & Casadio, R. PredGPI: a GPI-anchor predictor. BMC Bioinforma. 9, 392 (2008).
Article Google Scholar
Blum, M. et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 49, D344–D354 (2020).
Article PubMed Central Google Scholar
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput Biol. 7, e1002195 (2011).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Lum, G. & Min, X. J. FunSecKB: the fungal secretome knowledgeBase. Database (Oxf.) 2011, bar001 (2011).
Google Scholar
Urban, M. et al. PHI-base: the pathogen–host interactions database. Nucleic Acids Res. 48, D613–D620 (2019).
PubMed Central Google Scholar
Fischer, M. & Pleiss, J. The Lipase Engineering Database: a navigation and analysis tool for protein families. Nucleic Acids Res 31, 319–321 (2003).
Article CAS PubMed PubMed Central Google Scholar
Marchler-Bauer, A. et al. CDD: NCBI’s conserved domain database. Nucleic Acids Res 43, D222–D226 (2015).
Article CAS PubMed Google Scholar
Mistry, J. et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 49, D412–D419 (2021).
Article CAS PubMed Google Scholar
Huerta-Cepas, J. et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 47, D309–D314 (2018).
Article PubMed Central Google Scholar
Sperschneider, J., Dodds, P. N., Singh, K. B. & Taylor, J. M. ApoplastP: prediction of effectors and plant proteins in the apoplast using machine learning. N. Phytol. 217, 1764–1778 (2018).
Article CAS Google Scholar
Sperschneider, J. et al. LOCALIZER: subcellular localization prediction of both plant and effector proteins in the plant cell. Sci. Rep. 7, 44598 (2017).
Article ADS PubMed PubMed Central Google Scholar
Almagro Armenteros, J. J. et al. Detecting sequence signals in targeting peptides using deep learning. Life Sci. Alliance 2, e201900429 (2019).
Article PubMed PubMed Central Google Scholar
Horton, P. et al. WoLF PSORT: protein localization predictor. Nucleic acids Res. 35, W585–W587 (2007).
Article PubMed PubMed Central Google Scholar
Almagro Armenteros, J. J., Sønderby, C. K., Sønderby, S. K., Nielsen, H. & Winther, O. DeepLoc: prediction of protein subcellular localization using deep learning. Bioinformatics 33, 3387–3395 (2017).
Article PubMed Google Scholar
Saunders, D. G. O. et al. Using hierarchical clustering of secreted protein families to classify and rank candidate effectors of rust fungi. PLOS ONE 7, e29847 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Pendleton, A. L. et al. Duplications and losses in gene families of rust pathogens highlight putative effectors. Front Plant Sci. 5, 299 (2014).
Article PubMed PubMed Central Google Scholar
Persoons, A. et al. Genomic signatures of a major adaptive event in the pathogenic fungus Melampsora larici-populina. Genome Biol. Evol. 14, evab279 (2022).
Article CAS PubMed Google Scholar
Duplessis, S. et al. Obligate biotrophy features unraveled by the genomic analysis of rust fungi. PNAS 108, 9166–9171 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Toome, M. et al. Genome sequencing provides insight into the reproductive biology, nutritional mode and ploidy of the fern pathogen Mixia osmundae. N. Phytol. 202, 554–564 (2013).
Article Google Scholar
Perlin, M. H. et al. Sex and parasites: genomic and transcriptomic analysis of Microbotryum lychnidis-dioicae, the biotrophic and plant-castrating anther smut fungus. BMC Genom. 16, 461 (2015).
Article Google Scholar
Kämper, J. et al. Insights from the genome of the biotrophic fungal plant pathogen Ustilago maydis. Nature 444, 97–101 (2006).
Article ADS PubMed Google Scholar
Schirawski, J. et al. Pathogenicity determinants in smut fungi revealed by genome comparison. Science 330, 1546–1548 (2010).
Article ADS CAS PubMed Google Scholar
Martin, F. et al. The genome of Laccaria bicolor provides insights into mycorrhizal symbiosis. Nature 452, 88–92 (2008).
Article ADS CAS PubMed Google Scholar
Olson, Å. et al. Insight into trade-off between wood decay and parasitism from the genome of a fungal forest pathogen. N. Phytol. 194, 1001–1013 (2012).
Article Google Scholar
Frantzeskakis, L. et al. Signatures of host specialization and a recent transposable element burst in the dynamic one-speed genome of the fungal barley powdery mildew pathogen. BMC Genom. 19, 381 (2018).
Article Google Scholar
Dean, R. A. et al. The genome sequence of the rice blast fungus Magnaporthe grisea. Nature 434, 980–986 (2005).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Dan MacLean, Christian Schudoma, and Ram Krishna Shrestha for bioinformatics support. Bioinformatics infrastructure was supported in part by NBI Research Computing. We thank Matthew Moscou and Michael C. Schatz for many fruitful discussions. We acknowledge Heike Popovitsch for technical support. We thank Robert Dietrich and Lucio Garcia (Syngenta RTP) for their technical support with the sequencing of K8108 genome and transcriptome.

The work (Proposal 10.46936/10.25585/60000959) conducted by the U.S. Department of Energy Joint Genome Institute. (https://ror.org/04xm1d337), a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.

Sequencing and RNAseq analyses of UFV02 was supported by 2Blades.

Work on the soybean isolate K8108 in the Conrath and Schaffrath lab was supported, in part, by Syngenta Crop Protection.

EM, CL and SD were in part funded by the Labex Arbre (Programme Investissement d’Avenir, ANR-11-LABX-0002-01).

Author information

These authors contributed equally: Gabriel Scalliet, Sébastien Duplessis, H. Peter van Esse.

Authors and Affiliations

2Blades, Evanston, Illinois, USA
Yogesh K. Gupta, Everton Geraldo Capote Ferreira, Yoshihiro Inoue, Kathrin Thor, Kelly Robinson & H. Peter van Esse
The Sainsbury Laboratory, University of East Anglia, Norwich, UK
Yogesh K. Gupta, Everton Geraldo Capote Ferreira, Yoshihiro Inoue, Kathrin Thor, Kelly Robinson & H. Peter van Esse
Brazilian Agricultural Research Corporation - National Soybean Research Center (Embrapa Soja), Paraná, Brazil
Francismar C. Marcelino-Guimarães, Everton Geraldo Capote Ferreira, Valéria S. Lopes-Caitar, Liliane Santana Oliveira, Aline Mara Rudsit Bini & Maurício Conrado Meyer
Pathogen Evolutionary Ecology, ETH Zürich, Zürich, Switzerland
Cécile Lorrain
National Center for Genome Resources, Santa Fe, New Mexico, USA
Andrew Farmer & Connor Cameron
U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Sajeet Haridas, Kurt LaButti, Vasanth Singan, Christopher Daum, Cindy Chen, Mi Yan, Vivian Ng & Igor V. Grigoriev
Department of Computer Science, Federal University of Technology of Paraná (UTFPR), Paraná, Brazil
Liliane Santana Oliveira & Aline Mara Rudsit Bini
Université de Lorraine, INRAE, IAM, Nancy, France
Emmanuelle Morin & Sébastien Duplessis
Syngenta Jealott’s Hill Int. Research Centre, Bracknell Berkshire, UK
Stephanie Widdison
AFMB, Aix-Marseille Univ., INRAE, Marseille, France
Elodie Drula
Biodiversité et Biotechnologie Fongiques, INRAE, Marseille, France
Elodie Drula
Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
Bernard Henrissat
DTU Bioengineering, Technical University of Denmark, Kgs, Lyngby, Denmark
Bernard Henrissat
Bayer SAS, Crop Science Division, Lyon, France
Eric Paget, Cécile Dorme, Lucie Chandat, Yannick Tarriotte, Stephane Peyrard, Flavie Coulombier-Chauvel, Pierrick Gautier & Catherine Sirven
KeyGene N.V., Wageningen, The Netherlands
Milan van Hoek, Antoine Janssen, Alexander H. J. Wittenberg & Harrie Schneiders
The John Innes Centre, Norwich, UK
Jake Richardson
Departamento de Fitopatologia, Universidade Federal de Viçosa, Viçosa, Brazil
Bernardo do Vale Araújo Melo, Larissa Goulart Zanardo, Valéria Cristina Holtman & Sérgio H. Brommonschenkel
Institute of Phytomedicine, University of Hohenheim, Stuttgart, Germany
Tobias I. Link & Ralf Thomas Voegele
Syngenta Crop Protection AG, Stein, Switzerland
Dirk Balmer & Gabriel Scalliet
Department of Plant Physiology, RWTH Aachen University, Aachen, Germany
André N. Müller, Sabine Kind, Stefan Bohnert, Louisa Wirtz, Uwe Conrath, Marco Loehrer & Ulrich Schaffrath
Syngenta Crop Protection, LLC, Research Triangle Park, Durham, NC, USA
Qingli Liu
Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
Igor V. Grigoriev

Authors

Yogesh K. Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Francismar C. Marcelino-Guimarães
View author publications
You can also search for this author in PubMed Google Scholar
Cécile Lorrain
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Farmer
View author publications
You can also search for this author in PubMed Google Scholar
Sajeet Haridas
View author publications
You can also search for this author in PubMed Google Scholar
Everton Geraldo Capote Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Valéria S. Lopes-Caitar
View author publications
You can also search for this author in PubMed Google Scholar
Liliane Santana Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Emmanuelle Morin
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Widdison
View author publications
You can also search for this author in PubMed Google Scholar
Connor Cameron
View author publications
You can also search for this author in PubMed Google Scholar
Yoshihiro Inoue
View author publications
You can also search for this author in PubMed Google Scholar
Kathrin Thor
View author publications
You can also search for this author in PubMed Google Scholar
Kelly Robinson
View author publications
You can also search for this author in PubMed Google Scholar
Elodie Drula
View author publications
You can also search for this author in PubMed Google Scholar
Bernard Henrissat
View author publications
You can also search for this author in PubMed Google Scholar
Kurt LaButti
View author publications
You can also search for this author in PubMed Google Scholar
Aline Mara Rudsit Bini
View author publications
You can also search for this author in PubMed Google Scholar
Eric Paget
View author publications
You can also search for this author in PubMed Google Scholar
Vasanth Singan
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Daum
View author publications
You can also search for this author in PubMed Google Scholar
Cécile Dorme
View author publications
You can also search for this author in PubMed Google Scholar
Milan van Hoek
View author publications
You can also search for this author in PubMed Google Scholar
Antoine Janssen
View author publications
You can also search for this author in PubMed Google Scholar
Lucie Chandat
View author publications
You can also search for this author in PubMed Google Scholar
Yannick Tarriotte
View author publications
You can also search for this author in PubMed Google Scholar
Jake Richardson
View author publications
You can also search for this author in PubMed Google Scholar
Bernardo do Vale Araújo Melo
View author publications
You can also search for this author in PubMed Google Scholar
Alexander H. J. Wittenberg
View author publications
You can also search for this author in PubMed Google Scholar
Harrie Schneiders
View author publications
You can also search for this author in PubMed Google Scholar
Stephane Peyrard
View author publications
You can also search for this author in PubMed Google Scholar
Larissa Goulart Zanardo
View author publications
You can also search for this author in PubMed Google Scholar
Valéria Cristina Holtman
View author publications
You can also search for this author in PubMed Google Scholar
Flavie Coulombier-Chauvel
View author publications
You can also search for this author in PubMed Google Scholar
Tobias I. Link
View author publications
You can also search for this author in PubMed Google Scholar
Dirk Balmer
View author publications
You can also search for this author in PubMed Google Scholar
André N. Müller
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Kind
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Bohnert
View author publications
You can also search for this author in PubMed Google Scholar
Louisa Wirtz
View author publications
You can also search for this author in PubMed Google Scholar
Cindy Chen
View author publications
You can also search for this author in PubMed Google Scholar
Mi Yan
View author publications
You can also search for this author in PubMed Google Scholar
Vivian Ng
View author publications
You can also search for this author in PubMed Google Scholar
Pierrick Gautier
View author publications
You can also search for this author in PubMed Google Scholar
Maurício Conrado Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Ralf Thomas Voegele
View author publications
You can also search for this author in PubMed Google Scholar
Qingli Liu
View author publications
You can also search for this author in PubMed Google Scholar
Igor V. Grigoriev
View author publications
You can also search for this author in PubMed Google Scholar
Uwe Conrath
View author publications
You can also search for this author in PubMed Google Scholar
Sérgio H. Brommonschenkel
View author publications
You can also search for this author in PubMed Google Scholar
Marco Loehrer
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Schaffrath
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Sirven
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel Scalliet
View author publications
You can also search for this author in PubMed Google Scholar
Sébastien Duplessis
View author publications
You can also search for this author in PubMed Google Scholar
H. Peter van Esse
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.K.G, F.C.M.G., C.L., A.F., S.H., E.G.C.F, V.S.L., L.S.O., E.M., S.W., Co.C., Y.I., K.T., K.R., E.D., B.H., K.L., A.M.R.B., E.P., V.S., Ch.D., Cé.D., M.v.H, A.J., L.C., Y.T., J.R., B.d.V.A.M., A.W., H.S., S.P., L.G.Z., V.C.H., F.C., T.I.L., D.B., A.M., S.K., S.B., L.W., Ci.C, M.Y., M.C.M, Q.L., M.L., S.H.B., and S.D. performed research. Y.K.G, C.L., A.F., S.H., E.G.C.F, V.S.L., L.S.O., A.M.R.B., E.M., S.W., C.C., Y.I., E.D., B.H., A.J., A.W., B.d.V.A.M., L.G.Z., T.I.L., M.L., S.H.B., and S.D. analyzed the data. Y.K.G., F.C.M.G., M.L, S.D., and H.P.v.E. edited the manuscript. Y.K.G., F.C.M.G., V.S.L., L.S.O., M.L., U.S., S.D., and H.P.v.E. wrote the paper. F.C.M.G., V.N., P.G., R.T.V., I.V.G., U.C., G.S., C.S., S.D., and H.P.v.E. directed aspects of the project.

Corresponding author

Correspondence to H. Peter van Esse.

Ethics declarations

Competing interests

Connor Cameron, Andrew Farmer, Dirk Balmer, Stephanie Widdison, Qingli Liu and Gabriel Scalliet were employees of Syngenta or affiliates during the research project. Work on the soybean isolate K8108 in the Conrath and Schaffrath lab was supported, in part, by Syngenta Crop Protection. Yogesh Kumar Gupta, Everton Geraldo Capote Ferreira, Kelly Robinson, and H. Peter van Esse have a collaboration with Bayer crop science on Asian soybean rust.

The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks David Cook and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary data 1-27

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gupta, Y.K., Marcelino-Guimarães, F.C., Lorrain, C. et al. Major proliferation of transposable elements shaped the genome of the soybean rust pathogen Phakopsora pachyrhizi. Nat Commun 14, 1835 (2023). https://doi.org/10.1038/s41467-023-37551-4

Download citation

Received: 14 July 2022
Accepted: 22 March 2023
Published: 01 April 2023
DOI: https://doi.org/10.1038/s41467-023-37551-4

This article is cited by

Analysis of five near-complete genome assemblies of the tomato pathogen Cladosporium fulvum uncovers additional accessory chromosomes and structural variations induced by transposable elements effecting the loss of avirulence genes
- Alex Z. Zaccaron
- Ioannis Stergiopoulos
BMC Biology (2024)
An pair of an atypical NLR encoding genes confer Asian soybean rust resistance in soybean
- Qingnan Hao
- Hongli Yang
- Xinan Zhou
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.