Skip to main content
  • Research Article
  • Open access
  • Published:

Long-read genome sequencing reveals the sequence characteristics of pear self-incompatibility locus

Abstract

The S-RNase-based self-incompatibility locus (S-locus) in Petunia species contains 16–20 F-box genes, which collaboratively function in the recognition and subsequent degradation of non-self S-RNases, while distinguishing them from self S-RNase. However, the number of S-locus F-box genes (SFBBs) physically interacted with non-self S-RNases remains uncertain in Pyrus species. Utilizing Pacbio long-read sequencing, we successfully assembled the genome of pear cultivar ‘Yali’ (Pyrus bretschneideri), and identified 19 SFBBs from the Pyrus S17-locus spanning approximately 1.78 Mb. Additionally, we identified 17–21 SFBBs from other Pyrus and Malus S-loci spanning a range of 1.35 to 2.64 Mb. Based on the phylogenetic analysis, it was determined that Pyrus and Malus SFBBs could be classified into 22 groups, denoted as I to XXII. At amino acid level, SFBBs within a given group exhibited average identities ranged from 88.9% to 97.9%. Notably, all 19 SFBBs from the S17-locus co-segregated with S17-RNase, with 18 of them being specifically expressed in pollen. Consequently, these 18 pollen-specifically expressed SFBBs are considered potential candidates for the pollen-S determinant. Intriguingly, out of the 18 pollen-specifically expressed SFBBs, eight demonstrated interactions with at least one non-self S-RNase, while the remaining SFBBs failed to recognize any S-RNase. These findings provide compelling evidence supporting the existence of a collaborative non-self-recognition system governing self-incompatibility in pear species.

Core

A total of 17-21 S-locus F-box brothers (SFBBs) were clarified in Pyrus and Malus S-loci. Of the 19 SFBBs in Pyrus S17-locus, 18 are specifically expressed in pollen and eight of them interact with non-self S-RNase. SFBB.VIII is a universe factor in recognizing non-self S-RNases.

Gene & accession numbers

PbrSFBB.Ia-S17 accession: ON918613, PbrSFBB.Ib-S17 accession: ON918621, PbrSFBB.II-S17 accession: ON918614, PbrSFBB.III-S17 accession: ON918615, PbrSFBB.IV-S17 accession: ON918616, PbrSFBB.V-S17 accession: ON918617, PbrSFBB.VI-S17 accession: ON918618, PbrSFBB.VII-S17 accession: ON918619, PbrSFBB.VIII-S17 accession: ON918620, PbrSFBB.X-S17 accession: ON918622, PbrSFBB.XI-S17 accession: ON918623, PbrSFBB.XII-S17 accession: ON918624, PbrSFBB.XIII-S17 accession: ON918625, PbrSFBB.XIV-S17 accession: ON918626, PbrSFBB.XV-S17 accession: ON918627, PbrSFBB.XVI-S17 accession: ON918628, PbrSFBB.XVII-S17 accession: ON918629, PbrSFBB.XVIII-S17 accession: ON918630.

Introduction

Self-incompatibility (SI) is a prevalent genetic mechanism that facilitates the rejection of self-pollen by the pistil as a means to prevent inbreeding. This mechanism is observed in around 40% of flowering plant species across at least 100 families (de Nattancourt, 2001; Igic et al. 2008). SI can be categorized into two main types: sporophytic SI, primarily found in Brassicaceae family (Hiscock and Mclnnis 2003), and gametophytic SI (GSI), extensively studied in Rosaceae, Solanaceae, and Plantaginaceae families (Fuji et al., 2016). In these three families, GSI is controlled by an S locus that comprises genes encoding an S-RNase expressed in pistils and multiple S-locus F-box (SLF/SFB) proteins expressed in pollen (Ushijima et al. 2003; Kao and Tsukamoto, 2004; Franklin-Tong 2008; Kubo et al., 2010; Sun et al. 2018). Upon entering the pollen tube, the S-RNase has the potential to impede its growth if the haploid pollen's S-haplotype is identical to any of the S-haplotypes present in the style (Chen et al. 2018). The SLF/SFB proteins, as constituents of Skp1-Culin1-F-box protein degradation complex, facilitate the degradation of non-self S-RNase through E3 ligase-mediated ubiquitination (Hua and Kao 2006).

During the GSI reaction, the SLF/SFB proteins exhibit two distinct mechanisms for recognizing S-RNase (Fuji et al., 2016). Self-recognition is observed specifically in Prunus species, where the unique pollen SLF/SFB protein can selectively identify self S-RNase, thereby impeding ubiquitination and degradation mediated by other SLFs (Li et al. 2020). Non-self-recognition is reported in Petunia species, where self S-RNase failed to be recognized by any of multiple SLFs, while non-self S-RNases are recognized by at least one SLF (Kubo et al., 2010; Song et al. 2023; Sun et al. 2018). It is worth noting that certain SLFs may not interact with any of the tested S-RNases (Sun et al. 2018), suggesting their lack of involvement in non-self-recognition. Furthermore, the non-self recognition mechanism is also suggested in pear (Pyrus pyrifolia) species, wherein pollen bearing the S4-haplotype and lacking an F-box gene (SFBB1-S4) is rejected by the style containing S1-haplotype (Kakui et al. 2011). However, the specific number of F-box proteins capable of recognizing S1-RNase and the role of SFBB1-S4 in the degradation of S1-RNase remain unclear.

The S-locus in Pyrus and Malus species is believed to encompass multiple F-box genes for each S-haplotype (Kakui et al. 2011). Through the utilization of bacterial artificial chromosome (BAC) libraries, a total of 12 and 10 F-box genes were identified from the Malus S3- and S9-locus, respectively (Minamikawa et al. 2010). Similarly, 10 and six F-box genes were identified from the Pyrus S2- and S4-locus, respectively, and designated as S-locus F-box brother genes (SFBBs; Okada et al. 2011). In addition, a total of 27 F-box genes were identified from the two Pyrus S-locus (S21 and S34; Huang et al. 2023). In Petunia species, 17 F-box genes were identified from approximately 3.1 Mb within the S2-locus (Wu et al. 2020), while 16–20 SLF genes were identified from other S-haplotypes (Williams et al. 2014; Kubo et al. 2015). However, the repetitive nature of Solanaceae S-locus sequences poses challenges in accurately assembling the S-locus using short-read sequences (Wu et al. 2020). A recent study successfully assembled the Antirrhinum genome using PacBio long-read sequencing, resulting in the identification of 32 SLF genes from two S-haplotypes spanning approximately 1.2 Mb (Zhu et al., 2023). Thus, PacBio long-read sequencing is a reliable method for determining the size of the S-locus region and identifying the S-locus F-box genes. In a previous investigation, the S34-RNase detected in Pyrus bretschneideri ‘Yali’ was renamed as S17-RNase due to identical sequences (Wang et al. 2017). In this study, the reference genome of ‘Yali’ were assembled by using PacBio long-read sequences to elucidate the size of the S17-locus region and to identify the F-box genes within the region. Furthermore, protein–protein interaction was performed to investigate the cross-recognition between SFBBs and self/non-self S-RNases. The findings from this study contribute to a better understanding of the collaborative non-self recognition system in S-RNase-based GSI.

Results

Identification of S17-locus using the genome sequencing of ‘Yali’

To develop a de novo assembly of ‘Yali’ genome, we initially generated 63.57 Gb of Illumina paired-end reads. Based on the 17-mer spectrum, the estimated genome size was 545.39 Mb with a heterozygosity rate of 1.35%. Subsequently, 34.44 Gb of long-reads (a mean length of 9,721 bp) were generated using Ilumina PacBio platform. The PacBio reads were self-corrected and assembled into genomic contigs using FALCON (Chin et al. 2016), and then polished with the Illumina paired-end reads using pilon software (Walker et al. 2014). The final assembly reached 534.45 Mb and consisted of 582 contigs, with an N50 of 1.81 Mb (Table S1). Of these contigs, 566 could be mapped to and almost cover all chromosomes in ‘Cuiguan’ genome (Fig. 1).

Fig. 1
figure 1

Mapping of ‘Yali’ contigs to the ‘Cuiguan’ genome. The columns with brown color are the chromosomes of ‘Cuiguan’ genome, while the columns with green color are the contigs of ‘Yali’ assembly. In each chromosome, the contigs are linked with the homologous regions of ‘Cuiguan’ genome by the lines with different colors. A total of 566 contigs could be mapped to the ‘Cuiguan’ genome

The completeness of the assembly was assessed using benchmarking universal single-copy orthologs (BUSCO; Simão et al. 2015), which revealed that 95.2% of the universal single-copy orthologs were present (Table S1), indicating a reliable assembly. A total of 262.46 Mb of repetitive elements accounted for 49.11% of assembly (Tables S2), and approximately 293.95 Kb of non-coding RNAs accounted for 0.06% of the assembly (Tables S3). Moreover, the total of 2438 non-coding RNAs were identified, including 745 microRNAs, 720 transfer RNAs, 155 ribosomal RNAs, and 818 small nuclear RNAs. Furthermore, a total of 45,092 protein-coding genes were predicted from the assembly (Table S1). Notably, the S17-RNase was annotated in the contig 33 (Fig. 1). These results suggest a reliable completeness of the assembly.

Identification of the F-box genes in Pyrus S17-locus

To identify the F-box genes within the S17-locus, a blasting search was performed in the contig 33 of the ‘Yali’ genomes using the conserved F-box motif (Figure S1) of Pyrus and Malus SFBBs as query sequences. This search yielded 19 SFBBs, each encoding more than 150 amino acids, within the contig 33 (Fig. 2a; Table S4), which is a larger number than the 15 SFBBs reported in a previous study that identified SFBBs within the same S-locus (Huang et al. 2023). Notably, these SFBBs were found within the region from 0.94 Mb upstream and 0.83 Mb downstream of S17-RNase (Tables S4 and S5). This finding indicates that Pyrus SFBBs are distributed within a genomic region of less than 4 Mb, probably.

Fig. 2
figure 2

Sequence analysis of the F-box genes in Pyrus and Malus S-loci. a Arrangement of F-box genes in Pyrus, Malus, and Prunus S-loci. The direction of gene transcription is indicated by arrowheads, wherein black-colored arrowheads represent the S-RNase gene, and other colored arrowheads represent S-locus F-box genes. The characters above the arrowheads denote the classification of S-locus F-box genes in Pyrus, Malus, and Prunus species. The physical distance (in kilobases, Kb) between adjacent F-box genes is indicated below the dotted line, while the chromosome locations of these F-box genes are provided in Table S4. It should be noted that the abbreviation "SLF" specifically applies to S-locus F-box genes in Solanaceae, Plantaginaceae, and Prunus. b Phylogenetic classification of S-locus F-box brother (SFBB) proteins in Pyrus and Malus. The I → XXII labels signify the different groups of classified S-locus F-box proteins, with Group I including the sub-groups Ia, Ib, and Ic. All groups and sub-groups are highlighted in distinct colors

Consequently, a subsequent blasting search was performed in the region from approximately 2 Mb upstream to 2 Mb downstream of the S-RNase in the Pyrus and Malus reference genomes that has been previously assembled using long-read sequences (Daccord et al. 2017; Linsmith et al., 2019; Zhang et al. 2019; Dong et al. 2020; Sun et al. 2020; Gao et al. 2021). The results showed that in Pyrus species, 19, 17, and 20 SFBBs were identified from P. communis S101-locus (PcS101-locus), P. pyrifolia S5-locus (PpyS5-locus), and P. betuleafolia S67-locus (PbeS67-locus), respectively (Fig. 2a; Table S4). In the case of Malus domestica, 21, 21, 20, and 20 SFBBs were identified from S2-, S3-, S5-, and S9-locus, respectively (Fig. 2a; Table S4). Furthermore, the short-read sequences from the P. bretschneideri cultivar ‘Dangshansuli’ (S7S17) genome (Shi et al., 2019; Wu et al., 2013) were mapped onto contig 33, and 20 SFBBs were identified from the remaining S7-locus (Fig. 2a; Table S4). These findings indicate that Pyrus and Malus S-loci contain 17–21 SFBBs. In contrast, Prunus salicina S33-locus, Prunus avium S9-locus, and Prunus armeniaca S24-locus displayed the identification of 14, 12, and 12 SLFs/SFBs, respectively (Fig. 2a; Table S4).

Sequence analysis of protein-coding SFBBs

Phylogenetic classification of the 177 SFBBs derived from Pyrus and Malus S-haplotypes was performed based on their amino acid sequences. The SFBBs were classified into 22 distinct groups, denoted as groups I to XXII (Fig. 2b). Notably, groups I to VIII were found to correspond to the previously reported groups 1 to 8 (Kakui et al. 2011). Among these groups, group I exhibited further subdivision into sub-groups Ia, Ib, and Ic (Fig. 2b). As shown in Fig. 2a, SFBB.XVIII is positioned closest to the SFBBs in group XXI and/or XXII at one end of the S-locus in any S-haplotype, while SFBB.XVII is found closest to SFBB.XVI at another end but is absent in Pyrus S-locus. SFBBXXI is located between SFBBVIII and SFBBXXII, though it has been occasionally deleted. The remaining SFBBs appear to be randomly arranged within Pyrus and Malus S-loci (Fig. 2a). In contrast, the F-box genes within Prunus S-locus were grouped into 11 clusters, designated as SLF1 to SLF10 and SFB (Figure S2), with relatively stable positions (Fig. 2a).

The nucleotide acid sequence of all F-box genes present in the ‘Yali’ genome was utilized for phylogenetic analysis in conjunction with the 27 SFBBs previously identified in ‘Yali’ pollen (Huang et al. 2023). The results showed that all 27 SFBBs were grouped together with the 19 SFBBs identified in the Pyrus S17-locus (Figure S3). Furthermore, the SFBBs previously identified from the Pyrus S34-locus (which had been renamed as S17-locus; Wang et al. 2017) were closest clustered with those identified in the ‘Yali’ genome (Figure S3). Remarkably, the bootstrap values reached 100 for PbrSFBB.XIX-S17 and PbrSLF16-S21, PbrSFBB.VI-S17 and PbrSLF6-S21, as well as PbrSFBB.VIII-S17 and PbrSFBB8-S21 (Figure S3). This high bootstrap value indicates that the three SFBB genes, namely PbrSLF6-S21, PbrSFBB8-S21, and PbrSLF16-S21, are potentially located within the S17-locus.

The level of sequence polymorphism observed within each group of SFBBs was relatively low. At nucleotide acid level, the average identities ranged from 92.1% to 98.4%, while at amino acid level, the average identities ranged from 88.9% to 97.9% (Tables 1 and S6). The sequence identities among SFBBs within each group were similar to those observed among Prunus SLFs, but greater than those observed among Prunus SFBs and Pyrus S-RNases (Tables S7 to S9). Notably, the inter-group identity of SFBBs was significantly lower than the intra-group identity of SFBBs (p-value = 0.0022 < 0.05). Across groups I to XXII, the average identities among SFBBs ranged from 12.3% to 84.0% at nucleotide acid level and from 0.06% to 71.8% at amino acid level (Tables 1 and S6). These values were closely resemble the sequence identities observed among S-RNases (Table 1).

Table 1 Sequence similarity of F-box genes in Pyrus, Malus, and Prunus S-haplotypes

Long-read sequence analysis was utilized to examine gene duplication events in the Pyrus and Malus genomes. The findings revealed the presence of dispersed, proximal, tandem, and transposed duplications within various S-loci, such as Pyrus S17-locus and other S-locus, as well as Malus S3-locus and Pyrus S5-, S67-, and S101-locus (Figure S4; Table S10). For instance, in the Pyrus S17-locus, the tandem duplicate SFBB.X and SFBB.XI were found adjacent to each other without any intervening genes. The proximal duplicate SFBB.X and SFBB.Ib were separated by a few genes, and the dispersed duplicate SFBB.XI and SFBB.XIII were not adjacent and did not share a homologous segment. In the Pyrus S67-locus, it was observed that the transposed duplicate SFBB.XI and SFBB.XIV may have been relocated by DNA transposons. Additionally, through synteny analysis, it was determined that deletions, inversions, and/or translocations of these SFBBs were frequently observed between any two S-locus (Figures S5 to S11).

Comparison analysis of the non-coding flanking sequences of SFBBs

In order to investigate the sequence polymorphism of the non-coding flanking sequences of SFBBs, an analysis was conducted on the 5 kb upstream and 5 kb downstream sequences of SFBBs within each S-locus and group (Figures S12 to S40). The results showed that the identities between either the upstream or downstream sequences of all SFBBs in any S-locus were lower (Figs. 3a and S12 to 19), compared to those between all SFBBs within any group (Figs. 3b and S20 to S40; Table S11). Particularly, in group XXII, the downstream sequences displayed extremely high identities (> 90%) between any two SFBBs (Figs. 3b and S40d), as well as the upstream sequences between any two Malus SFBBs (Figure S40b), which closely resembled those found in the coding sequences of SFBBs (Table 1). However, although a few pairwise SFBBs in groups III (Figure S23b,d), VIII (Figure S28b,d), XI (Figure S30b,d), XII (Figure S31b,d), XIV (Figure S33b,d), XVI (Figure S35b,d), XVII (Figure S36b,d), XVIII (Figure S37b,d), and XIX (Figure S38b,d), as well as sub-group Ia (Figure S20b,d), exhibited extremely high identities in either the upstream or downstream sequences, the average identity in the upstream or downstream sequences between SFBBs in any group (excluding XXII) was lower than in the coding sequences of SFBBs (Figs. 3b; Tables 1 and S11). These results demonstrate that genetic variation frequently occurs in the non-coding flanking sequences of Pyrus and Malus SFBBs.

Fig. 3
figure 3

Comparison analysis of the non-coding sequences in Pyrus and Malus S-loci. a Pairwise identity of the 5 kb upstream and downstream sequences of SFBBs within each group and sub-group. The specific data can be found in Figures S11 to S31. b Pairwise identity of the 5 kb upstream and downstream sequences of SFBBs in Pyrus and Malus S-loci. The corresponding data can be found in Figures S32 to S39. c Location of SFBBs, LTRs, and other genes within Pyrus and Malus S-loci. The outer region is the length of DNA sequences that contain S-locus. SFBBs, LTRs, and other genes are depicted in red, green, and blue colors, respectively. The phylogenetic tree was constructed using MEGA5 with the neighbor-joining method. Bootstrap values were calculated from 1000 replicates

The significant role of long terminal repeat (LTR) retrotransposon in genetic recombination is widely acknowledged (Kumar and Bennetzen 1999). In this study, a survey of LTR retrotransposons within the S-loci of Pyrus and Malus species revealed that the total length of LTR retrotransposons accounted for approximately 13.49% of the tested regions (Table S12). A total of 568, 459, 521, and 515 LTR retrotransposons were detected in Pyrus S17-, S5-, S67-, and S101-locus, respectively, and 459, 633, 665, and 650 LTR retrotransposons were detected in Malus S2-, S3-, S5-, and S9-locus, respectively (Tables S12 and S13). On average, the number of LTR retrotransposons was 1.64 times (p-value < 0.001) that of the genes in the tested region (Table S12). Notably, the number of LTR retrotransposons was 4.26 times (p-value < 0.001) that of the genes in the region between SFBB.XVI and SFBB.VIII, but only 1.09 times (p-value = 0.55) that of the genes in the other region (Table S12). This result indicated that the density of LTR retrotransposons in the region between SFBB.XVI and SFBB.VIII was higher (3.89-fold; p-value < 0.005) compared to the other region (Fig. 3c). The elevated density of LTR retrotransposons between SFBB.XVI and SFBB.VIII may be associated with the genetic variations, such as gene duplication, deletion, inversion, and translocation of SFBBs in Pyrus and Malus species.

Inheritance and tissue specific expression of SFBBs in Pyrus S17-locus

To investigate the inheritance pattern of SFBBs within the Pyrus S17-locus, a total of 42 individuals were selected from the progeny resulting from a cross between ‘Yali’ (S17S21) and ‘Xueqing’ (S3S16). By analyzing the segregation of S-RNase alleles, four different S-genotypes in this progeny, which aligned with the expected 1:1:1:1 ratio (χ2 = 0.286 < χ20.05, 3 = 7.815; Figure S41). Furthermore, 19 pairs of primer were employed to amplify the full-length sequences of the 19 SFBBs within the Pyrus S17-locus. The resultant amplification products were subsequently cloned and sequenced. Alignment of the sequences revealed the presence of all SFBBs exclusively in the hybrids harboring the S17-RNase allele (Fig. 4a and S42 to S60). This finding strongly indicates that the SFBBs within the Pyrus S17-locus exhibit co-segregation with S17-RNase.

Fig. 4
figure 4

The co-segregation of pollen-specifically expressed SFBBs from the Pyrus S17-locus with S17-RNase. a A diagram shows the alignments of 19 SFBBs in the ‘Yali’ genome and in the hybrids containing S17-RNase. b Integrative Genomics Viewer (IGV) snapshots show the transcript levels of S-RNase and SFBB genes from the Pyrus S17-locus in various pear tissues. The pollen-specifically expressed SFBBs are highlighted in pink color. Detailed sequence alignments can be found in Figures S42 to S60

To reveal the expression pattern of the 19 SFBBs within the Pyrus S17-locus across various tissues, the transcriptome sequence reads from ‘Dangshansuli’ were utilized (Shi et al. 2017; Qiao et al. 2018; Li et al., 2019; Wu et al. 2023). These reads were previously generated and mapped to the genome of ‘Yali’. The analysis showed that 18 SFBBs exhibited specific expression in pollen, whereas S17-RNase displayed specific expression in the style (Fig. 4b and S61). Notably, the remaining SFBB, PbSFBB.XXII-S17, exhibited expression in the style, root, stem, petal, leaf, and fruit, but not in pollen (Figs. 4b and S61). These findings suggest that the 18 SFBBs specifically expressed in pollen are the candidates of pollen-S determinant and may directly interact with non-self S-RNase.

Physical interaction between S-RNases and SFBBs

To examine the potential interaction between SFBBs and self/non-self S-RNases, yeast-two-hybrid (Y2H) and firefly luciferase complementation imaging (FLCI) assays were conducted. The Y2H assay demonstrated the absence of self-activation for any S-RNase or SFBB within yeast cells (Figure S62). Notably, none of the SFBBs expressed in the S17-locus exhibited interaction with S17-RNase (Fig. 5a; Table 2). In contrast, eight SFBBs, PbrSFBB.Ia-S17, PbrSFBB.Ib-S17, PbrSFBB.II-S17, PbrSFBB.VIII-S17, PbrSFBB.X-S17, PbrSFBB.XI-S17, PbrSFBB.XII-S17, and PbrSFBB.XVIII-S17, were found to interact with at least one S-RNase (Fig. 5a, b). This finding further supports the involvement of a collaborative non-self-recognition system in the regulation of S-RNase-based GSI in pear. However, it is worth noting that 10 SFBBs did not exhibit any interaction with the tested S-RNases (Fig. 5a). This outcome may be attributed to the limited number of non-self S-RNases utilized in this particular study.

Fig. 5
figure 5

Interaction of self and non-self S-RNases with the pollen-specifically expressed SFBBs from the Pyrus S17-locus. a Yeast-two-hybrid assay showing the interaction of 18 SFBBs with 9 S-RNases. b Firefly luciferase complementation assay were performed between the potentially interacting proteins identified by the yeast-two-hybrid assay. The intensity of the interaction is depicted using different colors, which can be evaluated using the color column ruler. c Yeast-two-hybrid assay showed that S-RNases lacking the HV region (S5-RNase* and S7-RNase*) are unable to interact with PbSFBB.VIII-S17. AD and BD represent the pGADT7 and pGBKT7 vectors, respectively. Arabic numeral indicates the serial number assigned to each S-RNase. Ia, Ib, and II → XX are the serial numbers assigned to SFBBs in the Pyrus S17-locus (SFBB-S17). SD/-Trp/-Leu denotes the SD medium lacking Trp and Leu; SD/-Trp/-Leu/-His/-Ade denotes the SD medium lacking Trp, Leu, His, and Ade; SD/-Trp/-Leu-His/-Ade + X-α-gal indicates the addition of x-a-gal to the SD medium lacking Trp, Leu, His, and Ade. P53/T-antigen represents the positive controls, while Lamin/T-antigen represents the negative control

Table 2 Summary of the interaction of the SFBBs expressed in Pyrus S17-locus with self- and non-self S-RNases

Previous studies have showed the significance of the hypervariable (HV) region of S-RNase in mediating the recognition between S-RNase and pollen-S determinant (Matton et al. 1997, 1999; Wu et al. 2013). To test whether the HV region of Pyrus S-RNase is involved in the interaction between SFBBs and non-self S-RNases, an Y2H assay was conducted after removing the HV region from S5- and S7-RNases. The results showed that PbrSFBB.VIII-S17 was unable to interact with the HV region-deleted S5- and S7-RNases within yeast cells (Fig. 5c). This outcome suggests that the HV region of Pyrus S-RNase plays a crucial role in facilitating the cross-recognition of non-self S-RNases by the interacting SFBBs.

To clarify the sub-cellular localization of the eight SFBBs physically interacting with non-self S-RNases, the SFBBs were fused with the green fluorescence protein GFP (SFBB::GFP). Arabidopsis aquaporin PIP2A fused with red fluorescence marker mCherry (PIP2A::mCherry) was used to show plasma membrane. Image analysis showed that in the control tobacco mesophyll cells co-transformed with PIP2A::mCherry and GFP empty vector, the GFP was detected in all visible tissues, while the PIP2A::mCherry was only detected in cell and nuclear membranes (Fig. 6). This result contributed to the presence of yellow fluorescence in cell and nuclear membranes in the merged images (Fig. 6). In the tobacco cells co-transformed with SFBB::GFP and PIP2A::mCherry, the GFP was detected in the inner side of yellow fluorescence in the merged images (Fig. 6), suggesting that all eight SFBBs may be localized in cytoplasm but not cell wall or cell membrane. To confirm it, protoplasts were extracted from the tobacco mesophyll cells. In the protoplast transformed with GFP empty vector, the GFP was detected in all visible tissues including cell membrane, cytoplasm, and nucleus (Fig. 6). In the protoplast transformed with SFBB::GFP, the GFP was only detected in cytoplasm (Fig. 6), suggesting that all eight SFBBs are localized in cytoplasm.

Fig. 6
figure 6

Subcellular localization of eight SFBBs physically interacting with non-self S-RNases in tobacco mesophyll cell and protoplast. PbSFBB.Ia-S17, PbSFBB.Ib-S17, PbSFBB.II-S17, PbSFBB.VIII-S17, PbSFBB.X-S17, PbSFBB.XI-S17, PbSFBB.XII-S17, and PbSFBB.XVIII-S17 are the SFBBs physically interacting with non-self S-RNases. Control is the empty vector. GFP represents the GFP-fused proteins. PIP2A::mcherry is a control for plasma membrane localization. Bright and Merge show the co-localization of GFP-fused protein and PIP2A::mcherry. Bars = 20 μm

The other genes in Pyrus S17-locus

In the genome of ‘Yali’, besides S-RNase and SFBBs, a total of 114 genes were found between PbrSFBB.XVI-S17 and PbrSFBB.XXII-S17 (Table S5). Similarly, between SFBB.XVI and PbSFBB.XXII in Pyrus S101-, Pyrus S67-, Pyrus S5-, Malus S2-, Malus S3-, Malus S5-, and Malus S9-loci, there were 79, 153, 127, 156, 136, 138, and 115 genes, respectively (Table S5). Notably, 29 of these genes were shared between the Pyrus S17-locus and other Pyrus and Malus S-loci (Fig. 7a). Additionally, these 29 genes were consistently located between SFBB.VIII and SFBB.XXII in all S-locus. Transcriptome data analysis of various tissues revealed that 11 genes, Serine/threonine-protein kinase CTR1 (evm.TU.G_33.131), 50S ribosomal protein L1 (evm.TU.G_33.130), zuotin-like (evm.TU.G_33.107), Ras-related protein Rab11D (evm.TU.G_33.106), ATP-dependent zinc metalloprotease FTSH2 (evm.TU.G_33.103), Receptor protein kinase TMK1 (evm.TU.G_33.90), U-box domain-containing protein 21 (evm.TU.G_33.89), DELLA protein GAI (evm.TU.G_33.76), Protease Do-like 7 (evm.TU.G_33.37.2), Phosphatidylinositol 4-kinase beta 1 (evm.TU.G_33.36), and an uncharacterized protein (evm.TU.G_33.130) were expressed (RPKM > 2) in style, but not in pollen (Fig. 7b; Table S14). These genes may be associated with the style growth and development. Moreover, five genes, Multiprotein-bridging factor 1a (evm.TU.G_33.125), Shaggy-related protein kinase zeta (evm.TU.G_33.119), Aspartate aminotransferase (evm.TU.G_33.115), Nuclear poly(A) polymerase 1 (evm.TU.G_33.95), and an uncharacterized protein (evm.TU.G_33.81), were expressed in both pollen and style (Fig. 7b; Table S14), indicating their potential involvement in pollen tube growth and/or self-incompatibility reaction.

Fig. 7
figure 7

Identification and expression analysis of 29 genes shared in all tested S-loci. a Synteny of 29 genes across the tested S-loci. b Transcriptome analysis showing the gene expression level in various tissues, including pollen, style, root, stem, petal, leaf, and fruit. The Y-axis represents the PRKM value, while the X-axis represents the genes shared in all tested S-loci. The order of the 29 genes in the S17-locus is consistent between (a) and (b)

Discussion

Evaluation of the length of S-locus region in Maloideae species

The S-RNase-based GSI system has been extensively investigated in plants belonging to the Solanaceae, Plantaginaceae, and Rosaceae families (Hua et al. 2008). Recent studies utilizing PacBio long-read sequencing have revealed that two Antirrhinum S-haplotypes encompass approximately 1.2 Mb and contain 32 SLF genes (Zhu et al., 2023). Similarly, using a combination of BAC library and PCR-based approaches, it has been determined that the BAC contigs containing 17 SLFs cover roughly 3.1 Mb of the Petunia S2-locus (Wu et al. 2020). By comparing the Petunia SLFs to the Solanum genomes, it has been estimated that the S-locus region spans approximately 17.9 Mb in tomato (Solanum lycopersicum) and 14.6 Mb in potato (S. tuberosum; Kubo et al. 2015; Wu et al. 2020). In Pyrus species, 10 SFBBs were identified from the BAC contigs covering 378 kb of the S2-locus, while six SFBBs were identified from the BAC contigs covering 649 kb of the S4-locus (Okada et al. 2011). The number of SFBBs in Pyrus S-locus was found to be lower compared to Petunia and Antirrhinum S-loci, indicating that several SFBBs in Pyrus S-loci remained unidentified. In this study, we used long-read sequencing to assemble the genomes of pear cultivar ‘Yali’, and we successfully identified 19 SFBBs from Pyrus S17-locus located at the end of Chromosome 17 (Fig. 1). Furthermore, leveraging previously assembled genomes of Pyrus and Malus S-loci using long-read sequences (Daccord et al. 2017; Linsmith et al., 2019; Zhang et al. 2019; Dong et al. 2020; Sun et al. 2020; Gao et al. 2021), 17–21 SFBBs were identified from these reported S-loci (Fig. 2a; Table S4), which were distributed within a range of 1.35–2.64 Mb (Fig. 3c; Table S4). As a comparison, we also identified the SLFs/SFBs from Prunus S-locus (Figure S2), which were distributed within a range of 2.82–3.07 Mb (Table S4). These findings suggest that, on average, the Pyrus and Malus S-loci may have a smaller size compared to the Prunus S-locus.

The pollen-specifically expressed SFBBs are good candidates for pollen-S determinant in pear

In the GSI mechanism, the pollen-S determinant in Prunus species of Rosaceae family is encoded by an SLF/SFB (Ushijima et al. 2003, 2004; Sonneveld et al. 2005). These Prunus SLFs/SFBs exhibit considerable sequence polymorphism among different S-haplotypes, are specifically expressed in pollen, and are tightly linked with the corresponding S-RNase (Ushijima et al. 2003; Wu et al. 2013). Conversely, the pollen-S determinant in Solanaceae and some Rosaceae species consists of multiple SLFs (Kubo et al., 2010; Kakui et al. 2011). These SLFs are also specifically expressed in pollen and are tightly linked with the corresponding S-RNase; however, the sequence polymorphism is relatively lower compared to the Prunus SLFs/SFBs (Kakui et al. 2011; Kubo et al. 2015). In this study, a total of 19 SFBBs were identified from the Pyrus S17-locus (Fig. 2a) and showed co-segregation with the S17-RNase in cross-pollinated progeny (Fig. 4a). Among these SFBBs, PbrSFBB.Ia-S17, PbrSFBB.Ib-S17, PbrSFBB.II-S17, PbrSFBB.III-S17, PbrSFBB.IV-S17, PbrSFBB.VI-S17, PbrSFBB.VII-S17, PbrSFBB.VIII-S17, PbrSFBB.X-S17, PbrSFBB.XI-S17, PbrSFBB.XII-S17, PbrSFBB.XIII-S17, PbrSFBB.XIV-S17, PbrSFBB.XV-S17, PbrSFBB.XVI-S17, PbrSFBB.XVIII-S17, PbrSFBB.XIX-S17, and PbrSFBB.XX-S17 were specifically expressed in pollen (Fig. 4b). The sequence polymorphism of each of these 18 SFBBs was also lower than that observed among Prunus SLFs/SFBs (Tables 1 and S6 to S8). Additionally, PbrSFBB.Ia-S17, PbrSFBB.Ib-S17, PbrSFBB.II-S17, PbrSFBB.VIII-S17, PbrSFBB.X-S17, PbrSFBB.XI-S17, PbrSFBB.XII-S17, and PbrSFBB.XVIII-S17 showed direct interaction with at least one non-self S-RNase, while the absence of interaction between the remaining 10 SFBBs and non-self S-RNases may be attributed to the limited number of non-self S-RNases used in this study (Fig. 5). Collectively, these findings indicate that these 18 SFBBs that are specifically expressed in pollen represent promising candidates for the pollen-S determinant in pear.

The cross-recognition of SFBBs with non-self S-RNases

In the non-self recognition system, it has been observed that S-RNases cannot be recognized by any SLF present in the same S-haplotype, but they can be recognized by at least one of multiple SLFs present in different S-haplotypes (Sun et al. 2018). A similar mechanism is also proposed in Pyrus species (Kakui et al. 2011); however, direct evidence supporting this viewpoint is lacking. In this study, both Y2H and FLCI assays demonstrated that eight out of the 19 SFBBs in the Pyrus S17-locus are involved in recognizing non-self S-RNases (Fig. 5). This finding provides support for the viewpoint of the non-self recognition system in Pyrus species. It is important to note that PbrSFBB.VIII-S17 is the only SFBB recognizing S5- and S7-RNases but does not recognize S3-RNase (Fig. 5), despite the fact that there is a high amino sequence identity of 94.2% between S3- and S5-RNases (Table S15). If there are minor changes in the amino acid residues of S5-RNase, the mutant S5-RNase may not be recognized by PbrSFBB.VIII-S17, potentially leading to cross-incompatibility. This assumption is consistent with the non-self-recognition system, as the SLFs within an S-haplotype may not recognize approximately 5% of S-RNases in a population (Kubo et al. 2015).

Among the SFBBs that interact with at least one non-self S-RNase, PbrSFBB. VIII-S17 has the ability to recognize a majority of non-self S-RNases. On the other hand, PbrSFBB.Ia-S17, PbrSFBB.Ib-S17, PbrSFBB.II-S17, PbrSFBB.X-S17, PbrSFBB.XI-S17, PbrSFBB.XII-S17, and PbrSFBB.XVIII-S17 can only recognize a limited number of non-self S-RNases (Fig. 5). It should be noted that SFBB.VIII and SFBB.XVIII are present in both Pyrus and Malus S-loci, whereas deletion events occasionally occurs in the other SFBBs (Fig. 2a). It has been determined that these gene deletions may result from ectopic recombination between two LTR retrotransposons belonging to the same family and having the same orientation on the same chromosome (Kumar and Bennetzen 1999). Consequently, the presence of enriched LTR retrotransposons in the two flanking regions of these six SFBBs may be responsible for these deletions (Table S13; Shang et al. 2017). The absence of these SFBBs may lead to cross-incompatibility.

Furthermore, we observed deletions in most genes located between SFBB.XVI and SFBB.VIII (Table S5), with the exception of SFBB.III, SFBB.IV, SFBB.XIV and SFBB.XVIII. Additionally, frequent inversions and/or translocations were detected in this genomic region (Figs. 2a and S5 to S11). It has been determined that inversion may arise from unequal recombination events between two LTR retrotransposons with the opposite orientation on the same chromosome, while translocations may result from ectopic recombination between two LTR retrotransposons on different chromosomes (Kumar and Bennetzen 1999). Therefore, the presence of enriched LTR retrotransposons in Pyrus and Malus S-loci may be linked to the re-arrangement of SFBBs. This finding aligns with previous research indicating that transposable elements contribute to the reshaping of the Petunia S-locus (Wu et al. 2020).

In conclusion, based on the genomes assembled by long-read, we revealed the 17–21 SFBBs in Pyrus and Malus S-loci spanning a range of 1.35 to 2.64 Mb. These SFBBs could be classified into 22 groups, with 18 of them being specifically expressed in pollen. Notably, eight SFBBs demonstrated interactions with at least one non-self S-RNase, while the remaining SFBBs failed to recognize any S-RNase. These findings provide compelling evidence supporting the existence of a collaborative non-self-recognition system governing self-incompatibility in pear species.

Materials and methods

Plant materials

In accordance with a previous study on S-genotypes (Wang et al. 2017), a total of 10 pear cultivars, ‘Yali’ (S17S21), ‘Huanghua’ (S1S2), ‘Cuiguan’ (S3S5), ‘Dangshansuli’ (S7S17), ‘Xinxing’ (S4S7), ‘Mantianhong’ (S4S12), ‘Jinhua’ (S3S18), ‘Hongtaiyang’ (S8S35), and ‘Nanyue’ (S2S9), were selected for this investigation. These cultivars were maintained in an orchard located at Hushu, Nanjing, China. During the spring season, styles and anthers from these cultivars, as well as leaves from these cultivars were collected. The collected leaves and styles were promptly frozen in liquid nitrogen and then stored −80℃ until further use. The anthers, on the other hand, were enclosed in a dry container to facilitate the release of pollen grains. These pollen grains were then enclosed in sulfuric acid paper bags and stored in silica gel at −20℃ until needed.

Following the cross-pollination of ‘Yali’ and ‘Xueqing’ (S3S16), a total of 270 seeds were obtained and subjected to vernalization by placing them in moist sand at 4℃. The resulting seedlings were subsequently planted in an orchard located at Jiangpu, Nanjing, China. After eliminating pseudo-hybrid plants, 113 trees reached the flowering stage. Young leaves from these trees were collected during the spring season and stored −80℃ until analysis.

DNA sequencing and genome assembly

The genomic DNA of ‘Yali’ was extracted from young leaves using the DNAsecure Plant Kit (Tiangen, Beijing, China) and subsequently fragmented. These fragments underwent size selection using AMPure PB Magnetic Beads (Pacific Biosciences), were repaired for any damages, and ligated to hairpin adapters. The resulting fragments were then used to construct a long-read sequencing library with an average insert size of 20 kb. This library was sequenced using the single-molecule real-time (SMRT) DNA sequencer on the PacBio sequel platform. Moreover, a short-read sequencing library was constructed using the Illumina TruSeq library construction kit (Illumina, San Diego, CA), and a Hi-C sequencing library was constructed using the standard protocol (Belton et al. 2012). Both libraries were sequenced on the Illumina Novaseq platform (Illumina).

To improve the quality of the PacBio reads, a self-correction step was perfomed by aligning all sequenced reads pairwise. Subsequently, these corrected reads were assembled into contigs using an overlap-layout-consensus algorithm and FALCON (https://github.com/PacificBiosciences/FALCON; Chin et al. 2016). The specific parameters used for the assembly were as follows: length_cutoff_pr = 10,000, max_diff = 95, and max_cov = 95. The resulting contigs were further refined through polishing steps using Quiver (Chi et al. 2013), which involved aligning the SMRT reads, as well as using pilon software (Walker et al. 2014) and aligning Illumina paired-end reads.

In order to evaluate the quality of the assembly, several assessments were conducted. Initially, sequence consistency was examined by aligning paired-end reads to the assembly using BWA software (http://bio-bwa.sourceforge.net/). The assembly of the ‘Yali’ genome had an average sequencing depth of 101.1-fold and a coverage of 99.83%. Additionally, over 97.5% of paired-end reads could be mapped to the assemblies. Furthermore, the completeness of the assembly was assessed using the BUSCO (http://busco.ezlab.org/).

Genome annotation

Protein-coding genes were predicted with three different strategies. The first strategy involved de novo prediction, which was conducted according to the previous reported method (Shang et al. 2020) without any modifications. For the second strategy, homolog-based prediction was employed using TBLASTN (Altschul et al. 1997), aligning the protein sequences of white pear (Pyrus x bretschneideri), apple (Malus domestica), sweet cherry (Prunus avium), Japanese apricot (Prunus mume), peach (Prunus persica), and oriental cherry (Prunus yedoensis) against the assembly. In addition, transcriptome sequencing was carried out on pollen grains, styles, leaves, and/or developing fruits. The transcriptome-based prediction and function annotation of protein-coding genes were performed following the previous reported method (Shang et al. 2020) without any modification. The coding sequencing of these protein-coding genes can be found in Files S1 and S2, and their annotations are provided in Files S3 and S4. Furthermore, non-coding RNAs and repeats were predicted using both de novo and homolog-based approaches, with the details mirroring those of the previous report (Shang et al. 2020). The annotations of non-coding RNAs and repeats are respectively listed in Tables S3 and S2, respectively.

Identification and sequence analysis of Pyrus S-locus F-box genes

The nucleotide acid sequences of S17-RNase were utilized as a reference to determine the location of Pyrus S17-locus in the ‘Yali’ genome. It was observed that S17-RNase resided within the contig 33 in the ‘Yali’ genome. Next, the F-box motif's amino acid sequences (Figure S1) were subjected to BLAST against Chromosome 17 in the contig 33 in the ‘Yali’ genome. This search was performed using BioEdit software version 7.0.9.0 (http://www.mbio. ncsu.edu/bioedit/bioedit. html) with default parameters. Subsequently, the full-length sequences of F-box genes were manually predicted using Geneious Prime version 2021.1.1 (www.geneious.com). A similar approach was followed to predict the F-box genes in other S-loci of Pyrus, Malus, and Prunus species. The genomes used in this study were provided in Table S1.

To clone the SFBBs in the Pyrus S17-locus, first-strand cDNA from ‘Yali’ pollen were employed, while the rest were obtained from the National Center for Biotechnology Information database (https://www.ncbi.nlm.nih.gov/nucleotide/), as listed in Table S16. Sequence alignment was conducted using ClustalW, and sequence similarity was calculated using BioEdit software. A phylogenetic tree was constructed using MEGA software version 5.04 (www.megasoftware.net) with the neighbor-joining method. Bootstrap values were calculated from 1000 replicates and recorded for values exceeding 0.6. The resulting phylogenetic tree was color-coded using the iTOL tool (https://itol.embl.de/). To identify conserved residues of SFBBs, WebLogo tool (http://weblogo.berkeley.edu/logo.cgi) was utilized. Furthermore, gene duplication events were predicted using the MCScanX package (Wang et al. 2012).

Yeast-two hybrid assay

Total RNAs were extracted from the styles of seven pear cultivars. The first-strand cDNA was synthesized and used as template for PCR amplification of S-RNase genes. A subset of nine S-RNases were randomly chosen for the Y2H assay. From ‘Huanghua’, Pyrus S1-RNase and S2-RNase were amplified. From ‘Cuiguan’, S3-RNase and S5-RNase were amplified. From ‘Dangshansuli’, ‘Nanyue’, ‘Mantianhong’, ‘Yali’, ‘Jinhua’, and ‘Hongtaiyang’, S7-RNase, S9-RNase, S12-RNase, S17-RNase, S18-RNase, and S35-RNase were amplified, respectively. The sequence coding signal peptide were removed from these S-RNases, and the remaining coding sequences of these S-RNases, including the HV region, were amplified from the styles of pear cultivars. The amplified product was ligated into the pGADT7 vector that was digested by EcoR I and BamH I enzymes (New England Biolabs, Illinois, USA), using the 2X MultiF Seamless Assembly Mix (ABclonal, Wuhan, China) that contains a recombinase. The full-length coding sequences of SFBBs in the Pyrus S17-locus were amplified from ‘Yali’ pollen. The amplified product was inserted into the pGBKT7 vector that was digested by EcoR I and Sal I enzymes (New England Biolabs), using the 2X MultiF Seamless Assembly Mix (ABclonal). The Y2H was performed in the yeast strain Y2HGold using the Matchmaker Gold Yeast Two-Hybrid System (Clontech, Palo Alto, CA). Moreover, to generate an S-RNase protein without the HV region, the coding sequences between the signal peptide and HV, and between HV and stop codon of the PbS5-RNase were successively inserted into the same one pGADT7 vector. The Y2H assay was conducted to assess the interaction between PbrSFBB.VIII-S17 and PbrS5-RNase without signal peptide and HV region. The primer sequences used for these experiments are provided in Table S17.

Firefly Luciferase complementation imaging (FLCI) assay

The interaction between eight S-RNases and eight SFBBs were assessed using the FLCI assay. The full-length coding sequences of Pyrus S1-, S2-, S3-, S5-, S7-, S12-, S18-, and S35-RNase genes were individually inserted into the pCAMBIA1300-nLuc vector. Similarly, the full-length coding sequences of PbrSFBB.Ia-S17, PbrSFBB.Ib-S17, PbrSFBB.II-S17, PbrSFBB.VIII-S17, PbrSFBB.X-S17, PbrSFBB.XI-S17, PbrSFBB.XII-S17, and PbrSFBB.XVIII-S17 genes were individually inserted into the pCAMBIA1300-cLuc vector. These constructs were organized into 21 groups, each representing one of the 21 interaction pairs that were examined in the Y2H assay. The constructs in each group were transiently co-expressed in tobacco leaves after being introduced into A. tumefaciens strain GV3101. Luciferase activity was checked at 3 days after infiltration using a Luminescence & Fluorescence Imaging System (PIXIS 1024B/BUV, Teledyne Princeton Instruments, USA). The primer sequences used for this experiment are provided in Table S17.

Subcellular localization

The full-length CDS of PbSFBB.Ia-S17, PbSFBB.Ib-S17, PbSFBB.II-S17, PbSFBB.VIII-S17, PbSFBB.X-S17, PbSFBB.XI-S17, PbSFBB.XII-S17, and PbSFBB.XVIII-S17 genes were individually inserted into a GFP vector to construct the CAMV-35S-drived fusion protein (Wu et al. 2023), PbSFBB.Ia-S17-GFP, PbSFBB.Ib-S17-GFP, PbSFBB.II-S17-GFP, PbSFBB.VIII-S17-GFP, PbSFBB.X-S17-GFP, PbSFBB.XI-S17-GFP, PbSFBB.XII-S17-GFP, and PbSFBB.XVIII-S17-GFP. Moreover, the full-length CDS of aquaporin PIP2A (At3g53420) were amplified from Arabidopsis and was inserted into a mCherry vector to construct the PIP2A::mCherry fusion protein as a plasma membrane control (Santiago et al., 2013). The constructs were transferred into A. tumefaciens strain GV3101 and were then transiently co-expressed in tobacco epidermal cells. The protoplasts of tobacco epidermal cells were isolated according to the method described in a previous study (Panda et al., 2024). The fluorescence was observed using a confocal microscope LSM780 (Zelss, Germany). All the primer sequences are listed in Table S17.

Data availability

Data supporting the findings of this work are available within the paper and its Supplementary Information files. The datasets are available from the corresponding author upon request. Raw genome sequencing reads and the assembly of ‘Yali’ genome are respectively deposited into the NCBI sequence read archive (SRA) with the BioProject number PRJNA852504. Sequence data of 18 S17-locus F-box brother genes are deposited into the GenBank with the accession numbers ON918613-ON918630.

Abbreviations

SFBB:

S-locus F-box brother

BUSCO:

Benchmarking universal single-copy orthologs

SLF:

S-locus F-box

SFB:

S-locus haplotype F-box

Y2H:

Yeast two-hybrid

FLCI:

Firefly luciferase complementation imaging

GFP:

Green fluorescent protein

QRT-PCR:

Quantitative real-time PCR

RPKM:

Reads per kilobase per million

SI:

Self-incompatibility

References

Download references

Acknowledgements

This work was supported by the Bioinformatics Center of Nanjing Agricultural University.

Funding

The work was supported by the National Key Research and Development Program (2022YFD1200501), the National Science Foundation of China (32172511), the Key Research and Development Program (Modern Agriculture) of Jiangsu Province (BE2022381), the Earmarked Fund for the China Agriculture Research System (CARS-28).

Author information

Authors and Affiliations

Authors

Contributions

S.Z. and C.G. designed the project. C.G. performed genome sequencing and data analysis with assistance from X. W., X.Q., Q. L., and Z. W.. Y.X. and L.W. contributed to most experiments with assistance from K. Q. and M.H.. C.G. drafted the manuscript. S. Z. revised the manuscript.

Corresponding authors

Correspondence to Chao Gu or Shaoling Zhang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interest. Prof. Shaoling Zhang is a member of the Editorial Board for Molecular Horticulture. He was not involved in the journal’s review of, and decisions related to this manuscript.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

43897_2024_132_MOESM1_ESM.zip

Supplementary Material 1: Table S1 Comparison of ‘Yali’ genome with previously published assemblies of Pyrus and Malus species. Table S2 Annotation of the repeats in ‘Yali’ genome. Table S3 Annotation of the non-coding RNAs in ‘Dananguo’ and 'Yali' genomes. Table S4 Identification of the F-box genes in Pyrus, Malus and Prunus S-loci. Table S5 Function annotation of the predicted genes in S-loci. Table S6 Sequence similarity (%) among Pyrus and Malus SFBB genes. Table S7 Sequence similarity (%) among Prunus SFB and SLF genes. Table S8 Sequence similarity among Prunus SFB and SLF genes. Table S9 Sequence similarity (%) among Pyrus and Malus S-RNase genes. Table S10 Prediction of gene duplication events of Pyrus and Malus SFBB genes. Table S11 Sequence similarity of the non-coding flanking sequences of SFBBs in Pyrus and Malus S-loci. Table S12 Analysis of number and length of LTR retrotransposon in different S-loci. Table S13 Identification of the LTR retrotransposon in different S-loci. Table S14 RPKM values of the genes commonly existed in the tested S-loci. Table S15 Sequence similarity (%) among the reported Pyrus S-RNase genes. Table S16 The accession numbers of S-RNase and S-locus F-box genes in Pyrus, Malus, and Prunus.Table S17 Primers used in this study. Figure S1 Isolation of the conserved F-box motif in the reported S-locus F-box proteins in Pyrus and Malus. The accession numbers of these F-box proteins were listed in Table S13. Figure S2 Phylogenetic classifications of S-locus F-box genes in Prunus. The S-locus F-box (SLF/SFB) proteins in Prunus comprised by 12 groups, SLF1→SLF11 and SFB. Each group were highlighted with different colors. Figure S3 Phylogenetic analysis of the F-box genes identified from this and previous studies. Cycles with black color present the F-box genes identified from previous study (Huang et al., 2023). The rates (%) of different types of gene duplication events (dispersed, proximal, tandem and transposed) of the S-locus F-box genes were calculated for three Pyrus (PbrS17, PcS101, and PpyS5) and three Malus (MdS2, MdS3, MdS5, and MdS9) S-loci. The details of these duplication events are listed in Table S10. Figure S4 Gene duplication events of Pyrus and Malus SFBBs in whole genome. Figures S5 and S6 Synteny analysis of SFBB genes in Pyrus and Malus S-loci. Arrowhead present the transcriptional direction of a gene. The arrowheads with black color present S-RNase gene, while the arrowheads with other colors present S-locusF-box genes. The characters above arrowhead are the classification of S-locusF-box genes in Pyrus, Malus, and Prunus species. The Arabic numerals below the dotted line are the physical distance (Kb) between two adjacent F-box genes. The Chromosome location of these F-box genes were listed in Table S4. Figures S7 to S11 Synteny analysis of SFBB genes in Malus S-loci and Pyrus S17-, S101-, S67-,S7-, and Pyrus S5-loci. Arrowhead present the transcriptional direction of a gene. The arrowheads with black color present S-RNase gene, while the arrowheads with other colors present S-locusF-box genes. The characters above arrowhead are the classification of S-locusF-box genes in Pyrus, Malus, and Prunus species. The Arabic numerals below the dotted line are the physical distance (Kb) between two adjacent F-box genes. The Chromosome location of these F-box genes were listed in Table S4. Figures S12 to S14 Comparison analysis of the 5 kb non-coding flanking sequences of SFBBs in Pyrus S17-, S5-, S67-, and S101-loci. Figures S15 to S19 Comparison analysis of the 5 kb non-coding flanking sequences of SFBBs in Malus S2-, S3-, S5-, and S9-loci. Figures S20 to S40 Comparison analysis of the 5 kb non-coding flanking sequences of SFBBs in groups Ia, Ib, II to VIII, X to XIX, XXI, and XXII. (a) A snapshot showing the alignment of the 5kb upstream sequences of SFBBs. (b) Pairwise identity of the 5kb upstream sequences of SFBBs. (c) A snapshot showing the alignment of the 5kb downstream sequences of SFBBs. (d) Pairwise identity of the 5kb downstream sequences of SFBBs. Figure S41 Identification of S-genotypes of the individuals in the cross-pollinated progeny of ‘Yali’ × ‘Xueqing’. The lanes 1-42 represent the individuals. Q and Y represent the pear cultivars ‘Xueqing’ and ‘Yali’, respectively. M indicates the DNA ladder. Figures S42 to S60:The amino acid sequences of PbrSFBB.Ia-S17, PbrSFBB.Ib-S17, PbrSFBB.II-S17, PbrSFBB.III-S17, PbrSFBB.IV-S17, PbrSFBB.VI-S17, PbrSFBB.VII-S17, PbrSFBB.VIII-S17, PbrSFBB.X-S17, PbrSFBB.XI-S17, PbrSFBB.XII-S17, PbrSFBB.XIII-S17, PbrSFBB.XIV-S17, PbrSFBB.XV-S17, PbrSFBB.XVI-S17, PbrSFBB.XVIII-S17, PbrSFBB.XIX-S17, PbrSFBB.XX-S17, and PbrSFBB.XXII-S17 in any individual including S17-RNase were identical to that in ‘Yali’. YX-1, 2, 4, 5, 9, 10, 14, 19, 20, 23, 24, 28, 29, 30, 32, 34, 35, 36, 37, 38, 41, and 42 are the individuals of the cross-pollinated progeny of ‘Yali’ × ‘Xueqing’. Figure S61 Expression analysis of S-RNase and SFBB genes in Pyrus S17-locus in different tissues. Figure S62 Self-activation of S-RNase and SFBB proteins in Pyrus S17-locus in yeast cells. AD and BD present the pGADT7 and pGBKT7 vectors respectively. Positive and negative controls were assigned as P53/T-antigen and Lamin/T-antigen, respectively. SD/-Trp/-Leu indicates the SD medium lacking Trp and Leu; SD/-Trp/-Leu/-His/-Ade indicates the SD medium lacking Trp, Leu, His, and Ade; SD/-Trp/-Leu-His/-Ade + X-α-gal indicates that x-a-gal was mixed into the SD medium lacking Trp, Leu, His, and Ade. Files S1 The coding sequences of all predicted genes in ‘Yali’ genome. Files S2 Gene annotation of‘Yali’ genome.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gu, C., Xu, Y., Wu, L. et al. Long-read genome sequencing reveals the sequence characteristics of pear self-incompatibility locus. Mol Horticulture 5, 13 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s43897-024-00132-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s43897-024-00132-0

Keywords