1
|
Musolf AM, Simpson CL, Moiz BA, Pikielny CW, Middlebrooks CD, Mandal D, de Andrade M, Cole MD, Gaba C, Yang P, You M, Li Y, Kupert EY, Anderson MW, Schwartz AG, Pinney SM, Amos CI, Bailey-Wilson JE. Genetic Variation and Recurrent Haplotypes on Chromosome 6q23-25 Risk Locus in Familial Lung Cancer. Cancer Res 2021; 81:3162-3173. [PMID: 33853833 DOI: 10.1158/0008-5472.can-20-3196] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 02/01/2021] [Accepted: 04/09/2021] [Indexed: 11/16/2022]
Abstract
Although lung cancer is known to be caused by environmental factors, it has also been shown to have genetic components, and the genetic etiology of lung cancer remains understudied. We previously identified a lung cancer risk locus on 6q23-25 using microsatellite data in families with a history of lung cancer. To further elucidate that signal, we performed targeted sequencing on nine of our most strongly linked families. Two-point linkage analysis of the sequencing data revealed that the signal was heterogeneous and that different families likely had different risk variants. Three specific haplotypes were shared by some of the families: 6q25.3-26 in families 42 and 44, 6q25.2-25.3 in families 47 and 59, and 6q24.2-25.1 in families 30, 33, and 35. Region-based logarithm of the odds scores and expression data identified the likely candidate genes for each haplotype overlap: ARID1B at 6q25.3, MAP3K4 at 6q26, and UTRN (6q24.1) and PHACTR2 (6q24.2). Further annotation was used to zero in on potential risk variants in those genes. All four genes are good candidate genes for lung cancer risk, having been linked to either lung cancer specifically or other cancers. However, this is the first time any of these genes has been implicated in germline risk. Functional analysis of these four genes is planned for future work. SIGNIFICANCE: This study identifies four genes associated with lung cancer risk, which could help guide future lung cancer prevention and treatment approaches.
Collapse
Affiliation(s)
- Anthony M Musolf
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland
| | - Claire L Simpson
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland.,Department of Genetics, Genomics and Informatics and Department of Ophthalmology, University of Tennessee Health Science Center, Memphis, Tennessee
| | - Bilal A Moiz
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland
| | | | - Candace D Middlebrooks
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland
| | - Diptasri Mandal
- Department of Genetics, Louisiana State University Health Science Center, New Orleans, Louisiana
| | | | - Michael D Cole
- Geisel School of Medicine, Dartmouth College, Lebanon, New Hampshire
| | - Colette Gaba
- Department of Medicine, University of Toledo Dana Cancer Center, Toledo, Ohio
| | | | - Ming You
- Medical College of Wisconsin, Milwaukee, Wisconsin
| | - Yafang Li
- Baylor College of Medicine, Houston, Texas
| | | | | | - Ann G Schwartz
- Karmanos Cancer Institute, Wayne State University, Detroit, Michigan
| | - Susan M Pinney
- Department of Environmental Health, University of Cincinnati College of Medicine, Cincinnati, Ohio
| | | | - Joan E Bailey-Wilson
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland.
| |
Collapse
|
2
|
Musolf AM, Simpson CL, Moiz BA, de Andrade M, Mandal D, Gaba C, Yang P, You M, Kupert EY, Anderson MW, Schwartz AG, Pinney SM, Amos CI, Bailey-Wilson JE. Abstract 37: Highly aggregated lung cancer families reveal a heterogeneous cause for a previous linkage signal on 6q. Cancer Res 2020. [DOI: 10.1158/1538-7445.am2020-37] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
More Americans die every year of lung cancer than any other cancer. While the environmental risks for lung cancer are well understood, the genetic risk factors for this disease are not, even though it has been shown that lung cancer risk aggregates in families. Segregation analyses have confirmed the likelihood of rare, highly penetrant variants affecting lung cancer. Family studies offer a unique opportunity to identify such rare risk variants. We have previously identified a genome-wide significant risk locus for lung cancer using multipoint linkage analysis in highly aggregated lung cancer families. In this study, we performed targeted sequencing on this 6q23-6q27 region on 75 individuals from the 9 most strongly linked families. Parametric two-point linkage analysis using an autosomal dominant model with 10% penetrance for carriers and a 1% phenocopy rate and disease allele frequency of 1% was performed for each family. While we did not find any genome-wide significant results in any of the individual families, we did observe several interesting potential rare risk variants. First, we observed that the significant signal previously observed on 6q is recapitulated in these data but the location of the peak LOD is highly heterogeneous across families and it is likely that each family has a different risk gene. While most of the highest LOD scores were in noncoding variants, we did observe exonic variants with the highest LOD scores in two families (44 and 42). Both variants were rare, with a minor allele frequency (MAF) of approximately 1% and were in LPA and GPR126 respectively. Both these genes have been implicated somatically in lung cancer, however this is the first time that they have been implicated in the germline. While the other families have their highest LOD scores in noncoding variants, some of these genes are also good potential candidate genes including PARK2 (family 30), GRM1 (family 47), and PDE10A (family 102). All three genes have been implicated in lung cancer in some capacity, and this is the first time that GRM1 and PDE10A have been implicated as germline mutations. Again, all these variants are rare (MAF ⇐ 0.01) in the general population, which makes sense for a potential highly penetrant risk variant. In this study, we have elucidated that a previously identified risk locus on 6q25 is heterogeneous in the nine most strongly linked families, with different families appearing to be carrying different risk variants. Many of the top variants in each family are rare variants that are in good potential causal genes that have been identified for the first time here as germline risk variants for lung cancer. Further functional annotation is underway for these variants and some additional linkage analyses using other penetrance matrices may also be performed.
Citation Format: Anthony M. Musolf, Claire L. Simpson, Bilal A. Moiz, Mariza de Andrade, Diptasri Mandal, Colette Gaba, Ping Yang, Ming You, Elena Y. Kupert, Marshall W. Anderson, Ann G. Schwartz, Susan M. Pinney, Christopher I. Amos, Joan E. Bailey-Wilson. Highly aggregated lung cancer families reveal a heterogeneous cause for a previous linkage signal on 6q [abstract]. In: Proceedings of the Annual Meeting of the American Association for Cancer Research 2020; 2020 Apr 27-28 and Jun 22-24. Philadelphia (PA): AACR; Cancer Res 2020;80(16 Suppl):Abstract nr 37.
Collapse
Affiliation(s)
| | | | | | | | - Diptasri Mandal
- 4Louisiana State University Health Science Center, New Orleans, LA
| | - Colette Gaba
- 5University of Toledo Dana Cancer Center, Toledo, OH
| | | | - Ming You
- 7Medical College of Wisconsin, Milwaukee, WI
| | | | | | | | | | | | | |
Collapse
|
3
|
Musolf AM, Moiz BA, Sun H, Pikielny CW, Bossé Y, Mandal D, de Andrade M, Gaba C, Yang P, Li Y, You M, Govindan R, Wilson RK, Kupert EY, Anderson MW, Schwartz AG, Pinney SM, Amos CI, Bailey-Wilson JE. Whole Exome Sequencing of Highly Aggregated Lung Cancer Families Reveals Linked Loci for Increased Cancer Risk on Chromosomes 12q, 7p, and 4q. Cancer Epidemiol Biomarkers Prev 2019; 29:434-442. [PMID: 31826912 DOI: 10.1158/1055-9965.epi-19-0887] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 10/15/2019] [Accepted: 12/04/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Lung cancer kills more people than any other cancer in the United States. In addition to environmental factors, lung cancer has genetic risk factors as well, though the genetic etiology is still not well understood. We have performed whole exome sequencing on 262 individuals from 28 extended families with a family history of lung cancer. METHODS Parametric genetic linkage analysis was performed on these samples using two distinct analyses-the lung cancer only (LCO) analysis, where only patients with lung cancer were coded as affected, and the all aggregated cancers (AAC) analysis, where other cancers seen in the pedigree were coded as affected. RESULTS The AAC analysis yielded a genome-wide significant result at rs61943670 in POLR3B at 12q23.3. POLR3B has been implicated somatically in lung cancer, but this germline finding is novel and is a significant expression quantitative trait locus in lung tissue. Interesting genome-wide suggestive haplotypes were also found within individual families, particularly near SSPO at 7p36.1 in one family and a large linked haplotype spanning 4q21.3-28.3 in a different family. The 4q haplotype contains potential causal rare variants in DSPP at 4q22.1 and PTPN13 at 4q21.3. CONCLUSIONS Regions on 12q, 7p, and 4q are linked to increased cancer risk in highly aggregated lung cancer families, 12q across families and 7p and 4q within a single family. POLR3B, SSPO, DSPP, and PTPN13 are currently the best candidate genes. IMPACT Functional work on these genes is planned for future studies and if confirmed would lead to potential biomarkers for risk in cancer.
Collapse
Affiliation(s)
- Anthony M Musolf
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland
| | - Bilal A Moiz
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland
| | - Haiming Sun
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland.,Laboratory of Medical Genetics, Harbin Medical University, Harbin, China
| | | | - Yohan Bossé
- Institut universitaire de cardiologie et de pneumologie de Québec, Department of Molecular Medicine, Laval University, Québec, Québec, Canada
| | - Diptasri Mandal
- Department of Genetics, Louisiana State University Health Sciences Center, New Orleans, Louisiana
| | | | - Colette Gaba
- Department of Medicine, University of Toledo Dana Cancer Center, Toledo, Ohio
| | | | - Yafang Li
- Baylor College of Medicine, Houston, Texas
| | - Ming You
- Medical College of Wisconsin, Milwaukee, Wisconsin
| | - Ramaswamy Govindan
- Division of Oncology, Washington University School of Medicine, St. Louis, Missouri
| | - Richard K Wilson
- Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, Ohio
| | | | | | - Ann G Schwartz
- Karmanos Cancer Institute, Wayne State University, Detroit, Michigan
| | - Susan M Pinney
- Department of Environmental Health, University of Cincinnati College of Medicine, Cincinnati, Ohio
| | | | - Joan E Bailey-Wilson
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland.
| |
Collapse
|
4
|
Musolf A, Sun H, Moiz BA, Pikielny CW, Andrade MD, Mandal D, Gaba C, Yang P, Li Y, You M, Wilson RK, Kupert EY, Anderson MW, Schwartz AG, Pinney SM, Amos CI, Govindan R, Bailey-Wilson JE. Abstract 4176: Familial lung cancer exhibits multiple novel linked haplotypes within pedigrees. Cancer Res 2019. [DOI: 10.1158/1538-7445.am2019-4176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Lung cancer (LC) kills more people in the United States each year than any other cancer. While it is well known that a variety of environmental factors (particularly tobacco smoke) strongly increase the risk of LC, there are multiple associated genetic variants with small contributions to risk. High aggregation of LC within rare individual families suggest that there are high-risk genetic variants as well. However, these genetic risk factors for LC are under studied due to the rapid fatality of LC. We studied 28 highly aggregated extended high-risk familial lung cancer (HRFLC) families collected from eight different sites across the US. Whole exome sequencing was performed on 290 individuals from these families to identify potential risk variants for HRFLC using genetic linkage analysis. Quality control was performed on the sequence data, filtering on parameters such as read depth, genotype quality, missingness, and Mendelian inconsistencies. Identity-by-descent (IBD) values were also calculated to verify correct familial relationships. Quality control procedures left approximately 400,000 SNVs and indels for analysis.Parametric two-point linkage analysis was performed assuming an autosomal dominant mode of inheritance. Disease allele frequency was set to 1% with a penetrance of 80% for carriers and 1% phenocopy rate. While we did not identify any genome-wide significant variants across the 28 families, multiple suggestive variants were identified. The largest cluster of suggestive variants was located at 14q32 in the CATSPERB gene. Given the likely locus heterogeneity in LC (combined with the lack of power in some families), it is not surprising that none of the variants were significant across the families; looking at the individual family results proved more informative. Long haplotypes linked to LC risk were identified in multiple families. These long runs of positive linkages, which have little to no negative linkage evidence across them, are characteristic of true linkage signals in these types of analysis. Two of the most interesting linked regions were at 7p36.1 and 4q21.23-28.23. The 7p signal (observed in a single family) was genome-wide suggestive and located within the SSPO gene. SSPO has been implicated in breast and skin cancer (melanoma); it is a novel lung cancer signal. The 4q linkage (again observed in a single family) covers a large chunk of 4q and contains multiple potential candidate genes, however, the best candidate gene is PTPN13, a gene implicated in lung cancer but never in familial lung cancer. We are currently evaluating the individual family results of several other pedigrees and plan to perform additional analyses to confirm these linkages.
Citation Format: Anthony Musolf, Haiming Sun, Bilal A. Moiz, Claudio W. Pikielny, Mariza de Andrade, Diptasri Mandal, Colette Gaba, Ping Yang, Yafang Li, Ming You, Richard K. Wilson, Elena Y. Kupert, Marshall W. Anderson, Ann G. Schwartz, Susan M. Pinney, Christopher I. Amos, Ramaswamy Govindan, Joan E. Bailey-Wilson. Familial lung cancer exhibits multiple novel linked haplotypes within pedigrees [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2019; 2019 Mar 29-Apr 3; Atlanta, GA. Philadelphia (PA): AACR; Cancer Res 2019;79(13 Suppl):Abstract nr 4176.
Collapse
Affiliation(s)
| | | | | | | | | | - Diptasri Mandal
- 4Louisiana State University Health Sciences Center, New Orleans, LA
| | | | | | | | - Ming You
- 6Medical College of Wisconsin, Milwaukee, WI
| | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Simpson CL, Musolf AM, Li Q, Portas L, Murgia F, Cordero RY, Cordero JB, Moiz BA, Holzinger ER, Middlebrooks CD, Lewis DD, Bailey-Wilson JE, Stambolian D. Exome genotyping and linkage analysis identifies two novel linked regions and replicates two others for myopia in Ashkenazi Jewish families. BMC Med Genet 2019; 20:27. [PMID: 30704416 PMCID: PMC6357511 DOI: 10.1186/s12881-019-0752-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Accepted: 01/11/2019] [Indexed: 01/10/2023]
Abstract
BACKGROUND Myopia is one of most common eye diseases in the world and affects 1 in 4 Americans. It is a complex disease caused by both environmental and genetics effects; the genetics effects are still not well understood. In this study, we performed genetic linkage analyses on Ashkenazi Jewish families with a strong familial history of myopia to elucidate any potential causal genes. METHODS Sixty-four extended Ashkenazi Jewish families were previously collected from New Jersey. Genotypes from the Illumina ExomePlus array were merged with prior microsatellite linkage data from these families. Additional custom markers were added for candidate regions reported in literature for myopia or refractive error. Myopia was defined as mean spherical equivalent (MSE) of -1D or worse and parametric two-point linkage analyses (using TwoPointLods) and multi-point linkage analyses (using SimWalk2) were performed as well as collapsed haplotype pattern (CHP) analysis in SEQLinkage and association analyses performed with FBAT and rv-TDT. RESULTS Strongest evidence of linkage was on 1p36(two-point LOD = 4.47) a region previously linked to refractive error (MYP14) but not myopia. Another genome-wide significant locus was found on 8q24.22 with a maximum two-point LOD score of 3.75. CHP analysis also detected the signal on 1p36, localized to the LINC00339 gene with a maximum HLOD of 3.47, as well as genome-wide significant signals on 7q36.1 and 11p15, which overlaps with the MYP7 locus. CONCLUSIONS We identified 2 novel linkage peaks for myopia on chromosomes 7 and 8 in these Ashkenazi Jewish families and replicated 2 more loci on chromosomes 1 and 11, one previously reported in refractive error but not myopia in these families and the other locus previously reported in the literature. Strong candidate genes have been identified within these linkage peaks in our families. Targeted sequencing in these regions will be necessary to definitively identify causal variants under these linkage peaks.
Collapse
Affiliation(s)
- Claire L Simpson
- Department of Genetics, Genomics and Informatics and Department of Ophthalmology, University of Tennessee Health Science Center, 71 S. Manassas Room 417, Memphis, TN, 38163, USA.,Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Anthony M Musolf
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Qing Li
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Laura Portas
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Federico Murgia
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Roberto Y Cordero
- Department of Genetics, Genomics and Informatics and Department of Ophthalmology, University of Tennessee Health Science Center, 71 S. Manassas Room 417, Memphis, TN, 38163, USA
| | - Jennifer B Cordero
- Department of Genetics, Genomics and Informatics and Department of Ophthalmology, University of Tennessee Health Science Center, 71 S. Manassas Room 417, Memphis, TN, 38163, USA
| | - Bilal A Moiz
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Emily R Holzinger
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Candace D Middlebrooks
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Deyana D Lewis
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA
| | - Joan E Bailey-Wilson
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 333 Cassell Dr., Suite 1200, Baltimore, MD, 21224, USA.
| | - Dwight Stambolian
- Department of Ophthalmology, University of Pennsylvania, Rm. 313, Stellar Chance Labs, 422 Curie Blvd, Philadelphia, PA, 19104, USA
| |
Collapse
|
6
|
Musolf AM, Sun H, Moiz BA, Mandal D, Andrade MD, Gaba C, Yang P, Li Y, You M, Kupert EY, Anderson MW, Schwartz AG, Pinney SM, Amos CI, Bailey-Wilson JE. Abstract 3276: Whole exome sequencing identifies significantly linked regions on multiple chromosomes in families with a history of lung cancer. Cancer Res 2018. [DOI: 10.1158/1538-7445.am2018-3276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Lung cancer is the deadliest cancer in the United States, contributing approximately 25% of all cancer deaths. Lung cancer risk is well-documented to increase in response to environmental factors, particularly tobacco smoking. As one might expect for a complex trait, there is also a significant genetic risk component in lung cancer, though it is not very well studied. We obtained whole exome sequencing (WES) genotype data from Washington University on 204 subjects from 25 extended families. These individuals were recruited from families with a history of lung cancer and previous analyses showed these 25 families be informative. The purpose of this study is to identify potential risk variants for lung cancer by performing genetic linkage analysis. Quality control was performed on the sequence data, filtering on parameters such as depth (less than 10), genotype quality (less than 10), missingness, and Mendelian inconsistencies. Identity-by-descent (IBD) values were also calculated to verify correct familial relationships. Quality control procedures left approximately 500,000 SNVs and indels for analysis.
We performed two-point parametric linkage analysis assuming an autosomal dominant mode of inheritance with a disease allele frequency of 1%, a 10% penetrance for carriers and a 1% penetrance for non-carriers. Two discrete sets of linkage analyses were performed. One was a variant-based analysis, which evaluated linkage between the phenotype and individual SNVs or indels. The second was gene-based analysis, which created a multi-allelic pseudomarker corresponding to a gene from haplotypes of rare variants (minor allele frequency <= 0.01) located within that particular gene. Two-point linkage analysis was then performed on the pseudomarkers.
While the variant-based analysis did not identify any genome-wide significant results, several were identified by the gene-based analysis. The highest HLOD scores were both greater than the genome-wide significance level of HLOD = 3.3 and were located on two regions of chromosome 1q: 1q42.13-43 and 1q21.2-21.1. In both regions, the significant HLOD scores clustered around known cancer genes. At the 1q42.13-43 region, signals centered on the cancer-implicated genes OBSCN and RYR2 (also a known mesothelioma gene), while at 1q21.2-21.1 the signals centered on five genes in the neuroblastoma breakpoint family (NBPF) genes, a cluster of recently duplicated genes. NBPF genes have previously been implicated in a variety of different cancers including lung cancer. We are currently performing additional analyses to corroborate the significant results, as well as examining the individual families to determine which families are driving the significant signals.
Citation Format: Anthony M. Musolf, Haiming Sun, Bilal A. Moiz, Diptasri Mandal, Mariza de Andrade, Colette Gaba, Ping Yang, Yafang Li, Ming You, Elena Y. Kupert, Marshall W. Anderson, Ann G. Schwartz, Susan M. Pinney, Christopher I. Amos, Joan E. Bailey-Wilson. Whole exome sequencing identifies significantly linked regions on multiple chromosomes in families with a history of lung cancer [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2018; 2018 Apr 14-18; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2018;78(13 Suppl):Abstract nr 3276.
Collapse
Affiliation(s)
| | | | | | - Diptasri Mandal
- 3Louisiana State University Health Sciences Center, New Orleans, LA
| | | | - Colette Gaba
- 5University of Toledo Dana Cancer Center, Toledo, OH
| | | | | | - Ming You
- 7Medical College of Wisconsin, Milwaukee, WI
| | | | | | - Ann G. Schwartz
- 8Karmanos Cancer Institute, Wayne State University, Detroit, MI
| | - Susan M. Pinney
- 9University of Cincinnati College of Medicine, Cincinnati, OH
| | | | | |
Collapse
|
7
|
Musolf AM, Simpson CL, Long KA, Moiz BA, Lewis DD, Middlebrooks CD, Portas L, Murgia F, Ciner EB, Bailey-Wilson JE, Stambolian D. Myopia in Chinese families shows linkage to 10q26.13. Mol Vis 2018; 24:29-42. [PMID: 29383007 PMCID: PMC5767476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2017] [Accepted: 01/12/2018] [Indexed: 11/09/2022] Open
Abstract
Purpose To determine genetic linkage between myopia and Han Chinese patients with a family history of the disease. Methods One hundred seventy-six Han Chinese patients from 34 extended families were given eye examinations, and mean spherical equivalent (MSE) in diopters (D) was calculated by adding the spherical component of the refraction to one-half the cylindrical component and taking the average of both eyes. The MSE was converted to a binary phenotype, where all patients with an MSE of -1.00 D or less were coded as affected. Unaffected individuals had an MSE greater than 0.00 D (ages 21 years and up), +1.50 (ages 11-20), or +2.00 D (ages 6-10 years). Individuals between the given upper threshold and -1.00 were coded as unknown. Patients were genotyped on an exome chip. Three types of linkage analyses were performed: single-variant two-point, multipoint, and collapsed haplotype pattern (CHP) variant two-point. Results The CHP variant two-point results identified a significant peak (heterogeneity logarithm of the odds [HLOD] = 3.73) at 10q26.13 in TACC2. The single-variant two-point and multipoint analyses showed highly suggestive linkage to the same region. The single-variant two-point results identified 25 suggestive variants at HTRA1, also at 10q26.13. Conclusions We report a significant genetic linkage between myopia and Han Chinese patients at 10q26.13. 10q26.13 contains several good candidate genes, such as TACC2 and the known age-related macular degeneration gene HTRA1. Targeted sequencing of the region is planned to identify the causal variant(s).
Collapse
Affiliation(s)
- Anthony M. Musolf
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD
| | - Claire L. Simpson
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN
| | - Kyle A. Long
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD
| | - Bilal A. Moiz
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD
| | - Deyana D. Lewis
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD
| | - Candace D. Middlebrooks
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD
| | - Laura Portas
- Institute of Population Genetics, CNR, Li Punti, Sassari, Italy
| | - Federico Murgia
- Institute of Population Genetics, CNR, Li Punti, Sassari, Italy
| | - Elise B. Ciner
- The Pennsylvania College of Optometry at Salus University, Elkins Park, PA
| | - Joan E. Bailey-Wilson
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD
| | - Dwight Stambolian
- Department of Ophthalmology, University of Pennsylvania, Philadelphia, PA
| |
Collapse
|
8
|
Musolf AM, Simpson CL, Moiz BA, Middlebrooks C, Andrade MD, Mandal D, Gaba C, Yang P, Li Y, You M, Kupert EY, Anderson MW, Schwartz AG, Pinney SM, Amos CI, Bailey-Wilson JE. Abstract 4290: A study in locus heterogeneity: Targeted sequencing analysis of 6q reveals multiple significant loci as the source of a previous linkage peak in familial lung cancer. Cancer Res 2017. [DOI: 10.1158/1538-7445.am2017-4290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Lung cancer is the leading cancer-related cause of death in the United States. We had previously performed multipoint linkage analysis on families with a strong family history of lung cancer and found significant linkage to the 6q25 region. In order to find the source of that linkage signal, we performed targeted sequencing 6q23-6q27 on 75 individuals from the 9 most highly linked families. We performed two types of parametric linkage analysis, using an autosomal dominant mode of inheritance with 10% penetrance for carriers and a 1% phenocopy rate using a disease allele frequency of 1%. The first type was a two-point analysis using an Elston-Stewart algorithm. This approach did not lead to any genome-wide significant results but demonstrated significant heterogeneity throughout the nine families.In an effort to recover the power from the original multipoint analysis, we performed a regional linkage analysis using SEQLinkage and MERLIN. SEQLinkage built regional haplotypes that corresponded to a gene or an intergenic region - the regions were based on a customized map of our design. The regional markers were multiallelic which allowed for greater information content and were similar to the microsatellites that were used in the original multipoint analysis. The regional markers were then analyzed in a two-point linkage analysis via MERLIN. This allowed us to identify two genome-wide significant signals at PACRG-AS1 at 6q26 (HLOD = 3.4) and SAMD5 at 6q24 (3.3). The PACRG-AS1 is novel, though it is associated with the known lung cancer gene PARK2. SAMD5 may be involved in lung cancer cell proliferation. The heterogeneity of the two signals was particularly interesting. The PACRG-AS1 was driven primarily by two families. The SAMD5 marker was not being strongly driven by any of the families and appears to be a small cumulative effect across the nine families. In addition, several families have large, but non-significant LOD scores at other loci across the region. This further reinforces the locus heterogeneity within the region, and it is likely that our two significant signals here are not the only variants affecting the phenotype. We further attempted to localize the signals by running SEQLinkage using a custom map where genes are broken into exons and introns. Though this resulted in no significant markers, the highest signal was located in the intronic region of SAMD5 (HLOD = 3) and several other suggestive signals were localized to intronic regions of good candidate genes like SASH1 and ARID1B. Examination of predicted effects of the candidate regulatory variants using eQTL databases is ongoing. This is a region full of promising candidates, and it is likely that the two significant signals found here are just part of many that could be affecting lung cancer risk. We plan to do further analysis within the individual families to elucidate any more genes affecting this signal.
Citation Format: Anthony M. Musolf, Claire L. Simpson, Bilal A. Moiz, Candace Middlebrooks, Mariza de Andrade, Diptasri Mandal, Colette Gaba, Ping Yang, Yafang Li, Ming You, Elena Y. Kupert, Marshall W. Anderson, Ann G. Schwartz, Susan M. Pinney, Christopher I. Amos, Joan E. Bailey-Wilson. A study in locus heterogeneity: Targeted sequencing analysis of 6q reveals multiple significant loci as the source of a previous linkage peak in familial lung cancer [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2017; 2017 Apr 1-5; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2017;77(13 Suppl):Abstract nr 4290. doi:10.1158/1538-7445.AM2017-4290
Collapse
Affiliation(s)
| | | | | | | | | | - Diptasri Mandal
- 4Louisiana State University Health Sciences Center, New Orleans, LA
| | - Colette Gaba
- 5University of Toledo Dana Cancer Center, Toledo, OH
| | | | | | - Ming You
- 7Medical College of Wisconsin, Milwaukee, WI
| | | | | | - Ann G. Schwartz
- 8Karmanos Cancer Institute, Wayne State University, Detroit, MI
| | - Susan M. Pinney
- 9University of Cincinnati College of Medicine, Cincinnati, OH
| | | | | |
Collapse
|
9
|
Musolf AM, Simpson CL, Moiz BA, Long KA, Portas L, Murgia F, Ciner EB, Stambolian D, Bailey-Wilson JE. Caucasian Families Exhibit Significant Linkage of Myopia to Chromosome 11p. Invest Ophthalmol Vis Sci 2017; 58:3547-3554. [PMID: 28715588 PMCID: PMC5510992 DOI: 10.1167/iovs.16-21271] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 05/29/2017] [Indexed: 11/24/2022] Open
Abstract
Purpose Myopia is a common visual disorder caused by eye overgrowth, resulting in blurry vision. It affects one in four Americans, and its prevalence is increasing. The genetic mechanisms that underpin myopia are not completely understood. Here, we use genotype data and linkage analyses to identify high-risk genetic loci that are significantly linked to myopia. Methods Individuals from 56 Caucasian families with a history of myopia were genotyped on an exome-based array, and the single nucleotide polymorphism (SNP) data were merged with microsatellite genotype data. Refractive error measures on the samples were converted into binary phenotypes consisting of affected, unaffected, or unknown myopia status. Parametric linkage analyses assuming an autosomal dominant model with 90% penetrance and 10% phenocopy rate were performed. Results Single variant two-point analyses yielded three significantly linked SNPs at 11p14.1 and 11p11.2; a further 45 SNPs at 11p were found to be suggestive. No other chromosome had any significant SNPs or more than seven suggestive linkages. Two of the significant SNPs were located in BBOX1-AS1 and one in the intergenic region between ORA47 and TRIM49B. Collapsed haplotype pattern two-point analysis and multipoint analyses also yielded multiple suggestively linked genes at 11p. Multipoint analysis also identified suggestive evidence of linkage on 20q13. Conclusions We identified three genome-wide significant linked variants on 11p for myopia in Caucasians. Although the novel specific signals still need to be replicated, 11p is a promising region that has been identified by other linkage studies with a number of potentially interesting candidate genes. We hope that the identification of these regions on 11p as potential causal regions for myopia will lead to more focus on these regions and maybe possible replication of our specific linkage peaks in other studies. We further plan targeted sequencing on 11p for our most highly linked families to more clearly understand the source of the linkage in this region.
Collapse
Affiliation(s)
- Anthony M. Musolf
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland, United States
| | - Claire L. Simpson
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland, United States
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, Tennessee, United States
| | - Bilal A. Moiz
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland, United States
| | - Kyle A. Long
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland, United States
| | - Laura Portas
- Institute of Population Genetics, CNR, Li Punti, Sassari, Italy
| | - Federico Murgia
- Institute of Population Genetics, CNR, Li Punti, Sassari, Italy
| | - Elise B. Ciner
- The Pennsylvania College of Optometry at Salus University, Elkins Park, Pennsylvania, United States
| | - Dwight Stambolian
- Department of Ophthalmology, University of Pennsylvania, Philadelphia, Pennsylvania, United States
| | - Joan E. Bailey-Wilson
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland, United States
| |
Collapse
|