In silico analysis was performed on missense variants, small deletion variants, insertion variants, nonsense variants, and splicing variants in our cohort using various algorithms. The majority of the in silico algorithms, such as PolyPhen2, FATHMM, PROVEAN, VEST3, MetaSVM, MetaLR, M-CAP, CADD, DANN, fathmm-MKL, GenoCanyon, and REVEL, indicated that the 12 missense variants in our cohort were damaging or probably damaging to protein structure and could cause disease. The evaluation of the conservation of these missense variants was performed using GERP, phyloP, PhastCons, and SiPhy, which confirmed their conservation. Evaluation of small deletions, insertions (including the novel variant c.336_337insT), and nonsense variants was performed using phyloP, phastCons, and MutationTaster, revealing that these variants were conserved by phyloP and phastCons and predicted to be disease causing by MutationTaster. The HSF Pro platform (HSF and MaxEnt) and SpliceAI were employed to evaluate the potential effects of the splicing variants. MaxEnt and HSF analysis indicated that variant c.53-1G>A may cause a broken wild-type acceptor site. SpliceAI predicted that this variant could cause acceptor loss within the pre-mRNA position of −1 bp, with a probability of 0.69. Similarly, variant c.327-2A>G was predicted to cause a broken wild-type acceptor site by MaxEnt and HSF. SpliceAI analysis suggested that this variant may cause acceptor loss within the pre-mRNA position of −2 bp, with a high probability of 0.99. For additional information on the results of the in silico analysis, refer to
Supplementary Tables S3 to
S5. Remarkably, Vijayasarathy et al.
51 previously discussed the variant c.52G>A, which involved the terminal nucleotide of exon 1 and was within the 5′ donor splice site of intron 1. The c.52G>A variant disrupted base pairing between the splice site and small nuclear RNA, thereby reducing normal splicing efficiency.
To investigate the hot region of variants in
RS1, we applied the in silico algorithm Rhapsody, which has the benefit of predicting the effect of substituting any amino acid residue in a protein with any of the other 19 amino acids, enabling in silico site-directed mutagenesis analysis. Consequently, we identified residue changes that may be poorly tolerated, which can provide valuable insights for future research. Through protein-wide, site-directed mutagenesis, we identified the top-scoring variants in
RS1 with a Rhapsody score of at least 0.86. Our analysis revealed that the highest scoring variants were located in eight locations in the protein, indicating that these crucial residues have the potential to cause disease when mutated (
Supplementary Fig. S4).