gnomAD Canada v1.0 · HostSeq
NM_000264.5:c.-9_-4del
NP_000255.2:p.?  ·  PTCH1
GRCh38
chr9:95,508,364 TGCCGCC>T
GRCh37
chr9:98270646 TGCCGCC>T
rsID
rs794726881
Type
MIXED · 5 prime UTR variant
Allele type
del · 11 alt
Cohort
HostSeq (10,487 genomes)
Flags
lcr, was_split
Allele frequency
1.2686%
230 / 18,130 alleles
lcr PASS
Allele count
230
adjusted · raw: 233
Allele number
18,130
adjusted · raw: 18,422
Allele frequency
1.27e-02
1.2686% MAF
Homozygotes
0
alt hom carriers
grpmax FAF95
1.04e-02
South Asian · AC=21 AN=1,348
FAF95 max
1.32e-02
European (non-Finnish)
FAF99 max
1.25e-02
European (non-Finnish)
Cohort size
10,487
whole genomes
Raw vs adjusted allele counts
ACANAFHom
Adjusted
PASS genotypes only
23018,130 1.27e-02 0
Raw
all genotypes
233 18,422 1.26e-02
Allele frequency by ancestry
GRCh38 · HostSeq genomes · Canada
Population AC AN AF Hom
African/African American
afr
1 1,012
0.0988%
0
Latino/Admixed American
amr
6 824
0.7282%
0
Ashkenazi Jewish
asj
4 822
0.4866%
0
East Asian
eas
0 1,326 0
European (Finnish)
fin
0 8 0
Middle Eastern
mid
7 140
5.0000%
0
European (non-Finnish)
nfe
173 11,524
1.5012%
0
Remaining individuals
oth
18 1,126
1.5986%
0
South Asiangrpmax
sas
21 1,348
1.5579%
0
Total
230 18,130
1.2686%
0
Filtering allele frequency (FAF)
PopulationFAF 95%FAF 99%
Overall
1.13e-02 1.08e-02
Latino/Admixed American
amr
3.17e-03 2.17e-03
European (non-Finnish)
nfe
1.32e-02 1.25e-02
South Asian
sas
1.04e-02 8.77e-03
Sex-stratified allele counts are based on inferred chromosomal sex (XX / XY) from coverage of sex chromosomes in the HostSeq cohort.
XX genotypes
138 / 10,440  ·  1.3218%
PopulationACANAFHom
African/African American
afr
1 548 0.182% 0
Latino/Admixed American
amr
2 452 0.442% 0
Ashkenazi Jewish
asj
1 416 0.240% 0
East Asian
eas
0 742 0
European (Finnish)
fin
0 8 0
Middle Eastern
mid
3 66 4.545% 0
European (non-Finnish)
nfe
111 6,980 1.590% 0
Remaining individuals
oth
11 604 1.821% 0
South Asian
sas
9 624 1.442% 0
XY genotypes
92 / 7,690  ·  1.1964%
PopulationACANAFHom
African/African American
afr
0 464 0
Latino/Admixed American
amr
4 372 1.075% 0
Ashkenazi Jewish
asj
3 406 0.739% 0
East Asian
eas
0 584 0
European (Finnish)
fin
0 0
Middle Eastern
mid
4 74 5.405% 0
European (non-Finnish)
nfe
62 4,544 1.364% 0
Remaining individuals
oth
7 522 1.341% 0
South Asian
sas
12 724 1.657% 0
Variant quality scores
MQ
Mapping quality
249.3776
FS
Fisher strand bias · lower = better
0.0
MQRankSum
MQ rank sum test
0.0
SOR
Strand odds ratio
0.6823
ReadPosRankSum
Read position rank sum
-0.083
AS_pab_max
Max posterior allele balance
1.0
RF
Random forest score
0.9498
InbreedingCoeff
Inbreeding coefficient
-0.0128
Region flags
LCR (low complexity region) segdup (segmental duplication) monoallelic
Allele balance · alt carriers
Allele balance distribution for alt carriers.
Expected heterozygous AB ≈ 0.5. Values near 0 or 1 may indicate homozygosity or data quality issues.
Read depth distribution (all genotypes)
Read depth distribution across all genotypes.
Genotype quality distribution
Genotype quality distribution across all genotypes.
Strand bias table (SB)
ForwardReverse
Reference
34591 33118
Alternate
46143 44664
Genotype quality · alt carriers only
GQ distribution for alt allele carriers.
Alt-carrier GQ distribution. High GQ (≥20) indicates confident heterozygous calls.
Read depth · alt carriers only
Depth distribution for alt allele carriers.
Applied filters
PASS singleton was_split
Age at recruitment for heterozygous carriers observed in the HostSeq cohort. Age data is available only for a subset of participants.
Age distribution · heterozygous carriers
Age distribution for heterozygous carriers.
Carriers below age 30: 32 Carriers above age 80: 10
Age distribution · homozygous carriers
Age distribution for homozygous carriers.
Dataset information
Dataset name
gnomAD Canada v1.0
Cohort
HostSeq
Data type
Whole genome sequencing
Reference genome
GRCh38
Total genomes
10,487
Alleles (this variant)
18,130
Alt allele count
230
Homozygotes
0
Cross-reference links
gnomAD v4.1 (global) gnomad.broadinstitute.org
gnomAD v2.1 (exome) gnomad.broadinstitute.org
ClinVar — NM_000264.5:c.-9_-4del ncbi.nlm.nih.gov
Variant interpretation (LYFE Sciences) Back to full report
Acknowledgements & data use
Required attribution · gnomAD Canada v1.0
About this display
LYFE Sciences is an independent, unfunded variant interpretation tool. This page displays population frequency data from gnomAD Canada v1.0; I did not generate, fund, or contribute to this dataset. All data belongs to the gnomAD Canada project and the HostSeq cohort. I am presenting it in a convenient format alongside variant interpretation.
Data source
All population frequency data on this page originates from gnomAD Canada v1.0, produced from the HostSeq whole-genome sequencing cohort and made publicly available by the BC Genome Sciences Centre (BCGSC). The official gnomAD Canada browser is at gnomad.ca and the BCGSC instance at bcgsc.ca/gnomad. Please cite the original resource if you use this data in research.
Population labels
Population ancestry labels are reproduced exactly as provided by gnomAD Canada and the HostSeq cohort. These labels reflect ancestry inference using gnomAD v4 reference population PCA and are governed by the Indigenous data sovereignty principles of the Silent Genomes Project and the Indigenous Background Variant Library (IBVL).
Key references
1
Yoo S et al. HostSeq: a Canadian whole genome sequencing and clinical data resource. BMC Genom Data. 2023. doi:10.1186/s12863-023-01128-3
2
Chen S*, Francioli LC* et al. A genomic mutational constraint map using variation in 76,156 human genomes. Nature. 625, 92–100 (2024). doi:10.1038/s41586-023-06045-0