aboutsummaryrefslogtreecommitdiff
path: root/gnqa/paper2_eval/data
diff options
context:
space:
mode:
authorShelbySolomonDarnell2024-09-27 17:41:43 +0300
committerShelbySolomonDarnell2024-09-27 17:41:43 +0300
commit184339563b23627ca41bac8736f864d1c6bbfcba (patch)
treec25acad92779c9b54c83cd3086d3f219d9c7a0ef /gnqa/paper2_eval/data
parent11a7a465afcaa875d5fdebe840e60b1e7be5ee76 (diff)
downloadgn-ai-184339563b23627ca41bac8736f864d1c6bbfcba.tar.gz
Improved document formatting code. Full conversion from R2R output to RAGAS input complete.
Diffstat (limited to 'gnqa/paper2_eval/data')
-rw-r--r--gnqa/paper2_eval/data/dataset/domain_expert_aging_165
-rw-r--r--gnqa/paper2_eval/data/dataset/domain_expert_aging_265
-rw-r--r--gnqa/paper2_eval/data/dataset/domain_expert_aging_365
-rw-r--r--gnqa/paper2_eval/data/dataset/domain_expert_aging_465
-rw-r--r--gnqa/paper2_eval/data/dataset/domain_expert_aging_565
-rw-r--r--gnqa/paper2_eval/data/dataset/domain_expert_aging_665
6 files changed, 390 insertions, 0 deletions
diff --git a/gnqa/paper2_eval/data/dataset/domain_expert_aging_1 b/gnqa/paper2_eval/data/dataset/domain_expert_aging_1
new file mode 100644
index 0000000..81d5073
--- /dev/null
+++ b/gnqa/paper2_eval/data/dataset/domain_expert_aging_1
@@ -0,0 +1,65 @@
+{
+ "titles": [
+ "2017 - Regular exercise participation improves genomic stability in diabetic patients an exploratory study to analyse telomere length and DNA damage.pdf",
+ "2020 - Clinical Genetics and Genomics of Aging.pdf",
+ "2008 - Telomeres and Aging.pdf",
+ "2006 - Sex-specific telomere length profiles.pdf",
+ "2018 - Sex Differences in Aging Genomic Instability.pdf",
+ "2002 - Mitochondrial dysfunction leads to telomere attrition.pdf",
+ "2006 - Sex-specific telomere length profiles.pdf",
+ "2017 - The Aging Cardiovascular System.pdf",
+ "2020 - Clinical Genetics and Genomics of Aging.pdf",
+ "2018 - Repetitive Fragile Sites Centromere Satellite DNA.pdf"
+ ],
+ "extraction_id": [
+ "0e53122e-a308-55f7-8ee8-a0857ac9c52f",
+ "efd18101-9cf2-56b5-8f86-c2aba6caa0bc",
+ "13990eb4-bef2-58ce-bf3e-0e3bc294caab",
+ "6d3bfe47-f26e-50dc-8d77-19f3797e53a0",
+ "396708f1-aa0a-571e-a8d3-7cb8404e9502",
+ "b92ede07-74a7-524a-8d2c-54b2559e8425",
+ "eb8d8e40-a484-57cb-8125-3fd5eb3f6389",
+ "6949970f-7bc7-5585-a57a-96de1b5ba6ec",
+ "d4afa45a-5efa-577b-822e-7a82c2f6508d",
+ "3b0cb0ab-421d-54d7-9816-c6a2e6f1ac68"
+ ],
+ "document_id": [
+ "dcaf7b09-2d54-5cbf-b061-e3c4e6c6c518",
+ "62b635c3-040e-512a-b016-6ef295308a1e",
+ "61d9c326-d36e-55c1-a891-335dc943e70f",
+ "09c78a17-4a1f-52c1-be4d-994fd9fd71d0",
+ "8cfb5529-7f0c-58fc-b6e4-b3ee800fb72f",
+ "d8bc729b-7513-58b7-b12e-0db1fb6d3b7d",
+ "09c78a17-4a1f-52c1-be4d-994fd9fd71d0",
+ "d3ff8471-986b-5fa0-b9c4-96eaaa8fce7c",
+ "62b635c3-040e-512a-b016-6ef295308a1e",
+ "262df0d6-ad68-544a-88ed-b4568f305858"
+ ],
+ "id": [
+ "chatcmpl-ABLwBBugt6fTuTWqXb74qvoPVubbX",
+ "bb069c10-45f1-5a83-95e3-4b7655874ba7",
+ "28e98b7e-f273-5bdd-9979-185133f311af",
+ "5f940245-af1d-5eee-84dc-942017c523d0",
+ "607cbd31-d430-5517-8212-208b25af32bf",
+ "53508a9e-d064-58a3-a4f9-0785470a1462",
+ "7fad29bd-12bf-53d0-af89-aadd38b974ff",
+ "64ef9964-1831-5a7a-8a69-5e8d0c332d37",
+ "1b453e12-a0c4-59db-a978-bbebd689e7dc",
+ "65fb74aa-f3c3-5c80-919f-329169db982f",
+ "f181e6da-58b6-5f26-87a2-355e25388673"
+ ],
+ "contexts": [
+ "repetitive nucleotide sequences at the end of each eukaryotic chromosome, which protects them from attrition and damage. Although the relationship between leukocyte telomere length (LTL) and diabetes is still questioned 8, different studies have shown that T2D individuals have shorter leukocyte telomeres than non-T2D individuals9, 10 that may be associated with disease progression11. Indeed, the decreased antioxidant capacity described in patients",
+ "Telomeres are arrays of linked nucleotide hexamer repeats that are found at the ends of chromosomes in a vast clade of organisms [14]. While the sequence of these telomeric repeats can vary between organisms, their biological function is highly conserved, which is to limit damage inflicted on genes during the replica- tion of chromosomes. Telomere length is progressively shortened with each round of genomic replication, unless it is restored through the action of a ribonucleo-",
+ "telomere length,a phenomenon attributed to higher levels of oxidativestress at the cellular level (70). More recent studies havelinked telomere length in smooth muscle cells with senes-cence and disease severity in patients with atherosclero-sis (141, 150). Leukocyte telomere length was also short ina cohort of similar patients and associated with a higherrisk of developing occult cardiovascular disease (71).More data are needed to understand and validate the useof leukocyte telomere length as a biomarker",
+ "age telomere length through accumulation of several short telo- meres (Londono-Vallejo et al., 2001; Martens et al., 2000) is responsible for senescence or whether a speci c chromosome arm limits the replication potential of human cells (Hemann et al., 2001). Individual chromosome arms were shown to have large variations in their length (Lansdorp et al., 1996; Benn, 1997; Londono-Vallejo et al., 2001), and chromosome 17p seemed to be equipped with especially short telomeres in hu-",
+ "Telomeres are specialized structures that protect the ends of linear chromosomes. They shorten during aging due to the unidirectional activity of DNA polymerase, which leaves a section of DNA unrepli-cated on the lagging strand. Telomeres also are subject to shortening by genotoxic stress, such as oxidative damage (33). Among many eukaryotes, the enzyme telomerase maintains telomere length; but telomerase activity varies over the lifespan and between cell types, tissues, and species (34). In most human",
+ "TTAGGG sequence that cap the ends of chromosomes, protect-ing them from degradation and fusion. The length of telomererepeats is primarily maintained by active telomerase, which iscomposed of Telomerase RNA (TR) and a catalytic subunitTelomerase Reverse Transcriptase (TERT) (Blackburn, 2001).Extensive evidence has shown that telomere shortening anderosion lead to chromosome end-to-end fusions and genomicinstability (Blasco et al ., 1997; Hande et al ., 1999), causing",
+ "a pivotal role in maintenance of genomic integrity and func-tion (Moyzis et al., 1988; McElligott and Wellinger, 1997; van Steensel and de Lange, 1997). It is generally accepted that telomeres shorten during DNA replication both in vitro and in vivo. In individuals, short telomeres are considered to be a sign of advanced age. Cawthon and coworkers (2003) showed that telomere shortening in hu-",
+ "Each cell division shortens telomeric DNA until, at a critical length, the cells lose capping function at thechromosomal ends, activating DNA damage check-points, cell senescence, and eventually apoptosis.Telomere shortening has particular relevance in thesetting of CVD. Leukocyte telomere length (LTL) associates signi cantly with vascular cell senescence,",
+ "nization may directly affect telomere attrition, resulting in accelerated replicative senescence and progeroid phenotypes [180]. Telomeres are regions constituted by tandem repeats of non-coding DNA sequences 5-(TTAGGG)n-3 and a protein complex called shelterin, bound to them. This structure ensures the stability of the genome and protects the chromosomes from a wrong action of the DNA repair machinery [184] by allowing the formation of a chromatin loop called T-Loop [185].",
+ "telomeres, the repetitive sequence at the end of linear chromosomes, has garnered much attention for its relation to aging. Telomere repeats serve as an internal clock for cycling cells because each round of replication results in the loss of telomeric DNA in the absence of active telomerase (reviewed in [66]). Eventually, this loss over cellular generations culminates in telomere crisis and a permanent state of"
+ ],
+ "answer": "The length of telomeres is significant for several reasons: 1. **Protection of Chromosomes**: Telomeres are repetitive nucleotide sequences at the ends of eukaryotic chromosomes that protect them from attrition and damage during replication [1], [2], [6]. They prevent the loss of important genetic information and maintain genomic stability [9]. 2. **Aging and Cellular Senescence**: Telomere length is progressively shortened with each round of genomic replication unless restored by the enzyme telomerase [2], [5]. Short telomeres are considered a sign of advanced age and are associated with cellular senescence, where cells lose their ability to divide and function properly [4], [7], [8]. 3. **Disease Association**: Shorter telomeres have been linked to various diseases. For instance, individuals with Type 2 Diabetes (T2D) have been shown to have shorter leukocyte telomeres, which may be associated with disease progression [1]. Similarly, telomere length in smooth muscle cells has been linked to disease severity in atherosclerosis, and shorter leukocyte telomeres are associated with a higher risk of developing cardiovascular disease [3], [8]. 4. **Genomic Integrity**: Telomeres play a crucial role in maintaining genomic integrity and function. Shortening and erosion of telomeres can lead to chromosome end-to-end fusions and genomic instability, which can cause various cellular dysfunctions and diseases [6], [7]. 5. **Biomarker Potential**: Due to their association with aging and disease, telomere length is being studied as a potential biomarker for various health conditions, including cardiovascular diseases and diabetes [3]. In summary, the length of telomeres is crucial for protecting chromosomes, regulating cellular aging, and maintaining genomic stability. It also has significant implications for understanding and potentially diagnosing age-related diseases and conditions.",
+ "question": "What is the significance of the length of telomeres?"
+} \ No newline at end of file
diff --git a/gnqa/paper2_eval/data/dataset/domain_expert_aging_2 b/gnqa/paper2_eval/data/dataset/domain_expert_aging_2
new file mode 100644
index 0000000..c4e6321
--- /dev/null
+++ b/gnqa/paper2_eval/data/dataset/domain_expert_aging_2
@@ -0,0 +1,65 @@
+{
+ "titles": [
+ "2015 - A Chromosome 13 locus is associated with male-specific mortality in mice.pdf",
+ "2021 - Footprints in the Sand Deep Taxonomic Comparisons in Vertebrate Genomics to Unveil the Genetic Programs of Human Longevity.pdf",
+ "2021 - Genetic loci and metabolic states associated with murine epigenetic aging.pdf",
+ "2021 -Mozhui- Epigenetic aging.pdf",
+ "2016 - Unraveling the message insights into comparative genomics.pdf",
+ "2012 - Chromatin Remodeling, DNA Damage Repair and Aging.pdf",
+ "2021 - Gene-by-environment modulation of lifespan and weight gain in the murine BXD family.pdf",
+ "2006 - THE GENETIC REGULATION OF THE RESPONSE OF HEMATOPOIETIC STEM_PROG.pdf",
+ "2012 - Genome-Scale Studies of Aging Challenges and Opportunities.pdf",
+ "2003 - Lifelong voluntary exercise in the mouse prevents.pdf"
+ ],
+ "extraction_id": [
+ "5cc56e3b-53ab-5299-814d-014e2ed31d2f",
+ "11ca91fa-a13f-5cc5-90c8-53d1ebe76836",
+ "a9ebf1d8-5ef8-5c52-962e-110873476823",
+ "e662d80d-b529-5749-856c-ed734c6e3eaa",
+ "c6f50e80-1bc5-5b0a-b57b-4c2bfe524d96",
+ "d9a12bd9-c65e-547a-89aa-4e0231558ddc",
+ "30ba3324-6e19-58c2-9e32-508f827af3e5",
+ "c04cac81-a0b0-5d0a-b21e-2f94494bb302",
+ "9669b6fe-e9d7-55e8-a91a-c015df633daa",
+ "6a2cdf66-f3c9-5be9-b6b0-f203be169103"
+ ],
+ "document_id": [
+ "ad8f2626-87fb-520e-8cef-ee9a9cc3ab0b",
+ "0dc45abe-ab02-5b07-9916-7093b53323c0",
+ "b82bd9e1-2373-577b-a942-164565eaca6b",
+ "d23daa43-4176-54e6-b3c3-b889843e92f1",
+ "0deba7bb-c27a-5d9e-b1b2-e48a5574882c",
+ "594e5dbe-b92a-5b0c-9f65-2a10670f9517",
+ "4d082da4-fa48-5170-8147-c4fea47a5d4b",
+ "b84914bc-195d-5c48-8e89-0db719675c1f",
+ "b77aace0-fa36-5fd4-8e2a-c8932198acd1",
+ "24d4f270-f45b-5830-84f9-b1e5bcd3c070"
+ ],
+ "id": [
+ "chatcmpl-ABLwRFLcOLGvXJuXhHs6NCge9tY7Z",
+ "09da6f9e-b996-5438-91be-41d9438cb930",
+ "14bf5e8a-4095-536f-b98b-00c8cdae3a31",
+ "f8fdd2ee-710c-5d2c-8a70-bf48f4927653",
+ "e613d3df-adb0-56b0-abfd-8828020c23c3",
+ "02296a91-f1a4-5b35-a5d1-e1851797404b",
+ "90214d4d-4068-5490-9049-5604b5dcf3e2",
+ "56e03e38-0ae5-5b29-b929-662fa091e0ac",
+ "ebc5b444-a63f-5819-9d3a-ffbf96b3d367",
+ "80d01818-7573-5321-b33d-c7e291f3fe74",
+ "11af155f-85c6-5f8b-8943-5391ad678f7e"
+ ],
+ "contexts": [
+ "11. Gelman R, Watson A, Bronson R et al (1988) Murine chromo- somal regions correlated with longevity. Genetics 118(4):693704 12. Jackson AU, Galecki AT, Burke DT et al (2002) Mouse loci associated with life span exhibit sex-specic and epistatic effects. J Gerontol A Biol Sci Med Sci 57(1):B9B15 13. Foreman JE, Lionikas A, Lang DH et al (2009) Genetic archi- tecture for hole-board behaviors across substantial time intervalsin young, middle-aged and old mice. Genes Brain Behav",
+ "Long-lived rodents reveal signatures of positive selection in genes associated with lifespan. PLoS Genet. 14:e1007272. doi: 10.1371/journal.pgen.100 7272 Schchter, F., Faure-Delanef, L., Gunot, F., Rouger, H., Froguel, P., Lesueur-Ginot, L., et al. (1994). Genetic associations with human longevity at the APOE and ACE loci. Nat. Genet. 6, 2932. doi: 10.1038/ng0194-29 Schinaman, J. M., Rana, A., Ja, W. W., Clark, R. I., and Walker, D. W. (2019).",
+ "of the mouse growth hormone receptor results in severely decreased body weights, insulin, and insulin- like growth factor I levels and increased life span. Endocrinology 144:37993810. DOI: https://doi.org/10.1210/en. 2003-0374, PMID: 12933651 de Haan G, Williams RW. 2005. A genetic and genomic approach to identify longevity genes in mice. Mechanisms of Ageing and Development 126:133138. DOI: https://doi.org/10.1016/j.mad.2004.09.012, PMID: 15610771",
+ "of the mouse growth hormone receptor results in severely decreased body weights, insulin, and insulin- like growth factor I levels and increased life span. Endocrinology 144:37993810. DOI: https://doi.org/10.1210/en. 2003-0374, PMID: 12933651 de Haan G, Williams RW. 2005. A genetic and genomic approach to identify longevity genes in mice. Mechanisms of Ageing and Development 126:133138. DOI: https://doi.org/10.1016/j.mad.2004.09.012, PMID: 15610771",
+ "Mulvey L, Sinclair A, Selman C (2014) Lifespan modulation in mice and the confounding effects of genetic background. J Genet Genomics 41:497503. doi: 10.1016/j.jgg.2014.06.002 OConnor TP, Lee A, Jarvis JUM, Buffenstein R (2002) Prolonged longevity in naked mole-rats: age-related changes in metabolism, body composition and gastrointestinal function. Comp Biochem Physiol A 133:835842. doi: 10.1016/S1095-6433(02)00198-8 Opazo JC, Palma RE, Melo F, Lessa EP (2005) Adaptive evolution of",
+ "/ mice by Lmna heterozy- gosity ameliorates progeroid phenotypes and extends lifespan [143, 174, 175].",
+ "References 1. Hook Met al.Genetic cartography of longevity in humans and mice: Current landscape and horizons. Biochim. Biophys. Acta1864, 27182732 (2018). 2. Kuningas Met al.Genes encoding longevity: from model organisms to humans. Aging Cell7, 270 280 (2008). [PubMed: 18208581] 3. de Magalhes JP, Wuttke D, Wood SH, Plank M & V ora C Genome-environment interactions that modulate aging: Powerful targets for drug discovery. Pharmacol. Rev. 64, 88101 (2012). [PubMed: 22090473]",
+ "\"Murine chromosomal regions correlated with longevity.\" Genetics 118: 693-704.",
+ "expression of alpha-1,2-mannosidase I extends lifespan in Drosophila melanogaster and Caenorhabditis elegans . Aging Cell, 2009 , 8(4), 370-9. [73] Wang, H.D.; Kazemi-Esfarjani, P.; Benzer, S. Multiple-stress analysis for isolation of Drosophila longevity genes . Proc Natl Acad Sci U S A , 2004 , 101(34), 12610-5. [74] Lin, Y.J.; Seroude, L.; Benzer, S. Extended life-span and stress resistance in the Drosophila mutant methuselah . Science , 1998 , 282(5390), 943-6.",
+ "sion analysis of mouse liver genes: effect of age and of thelongevity mutant Prop1df. J Gerontol A Biol Sci Med Sci 56: B72B80, 2001. 12.Fabrizio P, Pozza F, Pletcher SD, Gendron CM, and Longo VD. Regulation of longevity and stress resistance by Sch9 in Yeast. Science 292: 288 290, 2001. 13.Haase D, Lehmann MH, Korner MM, Korfer R, Sigusch HH, and Figulla HR. Identi cation and validation of selective"
+ ],
+ "answer": "The mouse genes associated with longevity include: 1. **Mouse loci associated with life span**: These loci exhibit sex-specific and epistatic effects [1]. 2. **Mouse growth hormone receptor**: Alterations in this gene result in increased lifespan [3], [4]. 3. **Lmna heterozygosity**: This genetic modification ameliorates progeroid phenotypes and extends lifespan [6]. These references provide evidence of specific genes and loci in mice that have been linked to increased longevity.",
+ "question": "Which mouse genes have been associated with longevity?"
+} \ No newline at end of file
diff --git a/gnqa/paper2_eval/data/dataset/domain_expert_aging_3 b/gnqa/paper2_eval/data/dataset/domain_expert_aging_3
new file mode 100644
index 0000000..1f78fa9
--- /dev/null
+++ b/gnqa/paper2_eval/data/dataset/domain_expert_aging_3
@@ -0,0 +1,65 @@
+{
+ "titles": [
+ "2009 - The Human Ageing Genomic Resources online.pdf",
+ "2011 - How pleiotropic genetics of the musculoskeletal system.pdf",
+ "2021 - Genome-wide association studies identify.pdf",
+ "2012 - Genome-Environment Interactions That Modulate.pdf",
+ "2020 - Clinical Genetics and Genomics of Aging.pdf",
+ "2001 - A genome-wide scan for linkage to human.pdf",
+ "2007 - Temporal and spatial transcriptional profiles.pdf",
+ "2012 - Genome-Environment Interactions That Modulate.pdf",
+ "2020 - Role of Helicobacter pylori infection in the manifestation of old age-related diseases.pdf",
+ "2012 - Genome-Environment Interactions That Modulate.pdf"
+ ],
+ "extraction_id": [
+ "7ada6b55-99c2-5e20-bf96-d153f927256c",
+ "b5b3c74a-90de-5b1e-9580-8031b10be7ec",
+ "cd7730b6-22dc-5256-9310-79fc348b3226",
+ "d59d7882-333d-5576-86ab-3cfa6354b946",
+ "4ea8424f-1cd8-569c-a1df-3f0f54206e70",
+ "17246c43-2e44-579b-867d-3dc7150ceedd",
+ "2e42619b-d0b2-5d33-aab8-6f04002ee807",
+ "d59d7882-333d-5576-86ab-3cfa6354b946",
+ "e6916baa-9f9d-57aa-b44d-95fb614610a8",
+ "a01ca925-4ccf-5863-a162-7bd4c754fe89"
+ ],
+ "document_id": [
+ "e43cd3b6-ad8e-5422-ba7c-ceb6e66cc529",
+ "ed31486c-a651-5894-bd96-21fbd78f2646",
+ "60c2e869-1fee-53ea-b332-26d9c2abc747",
+ "b1a1997c-e9df-5dc0-9d12-a3977d0c64ec",
+ "62b635c3-040e-512a-b016-6ef295308a1e",
+ "1431984a-82d9-51d4-a23c-5f76a02ab554",
+ "38f27ec7-08bf-5397-b2b8-bde95e0dc3f8",
+ "b1a1997c-e9df-5dc0-9d12-a3977d0c64ec",
+ "e99c68d2-4f35-5591-8072-cfdb31966e68",
+ "b1a1997c-e9df-5dc0-9d12-a3977d0c64ec"
+ ],
+ "id": [
+ "chatcmpl-ABLwW9HA9VG184zgOmenEBU2eMIMc",
+ "3117c019-7311-53ae-8ab1-927ca822c709",
+ "0ad664d2-6756-5123-b192-8a56cf6887a5",
+ "9fa00091-9661-57bd-91c7-f0bf436805a7",
+ "786d2756-4c4d-5ac0-8d3d-63f914d51664",
+ "a0672677-71ad-5603-8427-a0648eec407f",
+ "e0cce1c5-8709-5218-99b6-48a6ba242931",
+ "bf2cd208-273f-5848-b243-df8b95ea7833",
+ "413f8f54-b5cc-5089-9f5c-d9e3b8bcf594",
+ "50581d4f-396c-5d12-aec6-5f42e2ab88ef",
+ "3c369292-4b9c-5156-a80f-4b3301026f30"
+ ],
+ "contexts": [
+ "It is undisputed that genetic factors influence aging. In a remarkable",
+ "perform a study of the genetic sources of biological aging. However, to be successful, the genetic study of acomplex condition requires a heritable phenotype to be developed and validated. Genome-wide association studies offer an unbiased approach to identify newcandidate genes for human diseases. It is hypothesized that convergent results from multiple aging-related traits will point out the genes responsible for the general agingof the organism. This perspective focuses on the",
+ "population dynamics on the genetic architecture of human longevity. Aging (Albany NY). 2018;10(8):1947 63. 68. Bellenguez C, Kucukali F, Jansen I, Andrade V, Morenau-Grau S, Amin N, et al. Large meta-analysis of genome-wide association studies expands knowledge of the genetic etiology of Alzheimer disease and highlights potential translational opportunities. medRxiv. 2020. 69. Kojima T, Shimazui T, Hinotsu S, Joraku A, Oikawa T, Kawai K, et al. Decreased expression of CXXC4 promotes a",
+ "In addition to aging- and CR-related genes, another source of candidate genes and pathways for drug designare human longevity-associated genes (Barzilai andShuldiner, 2001; Browner et al., 2004; Kenyon, 2010).Dozens of genes have now been associated with humanlongevity (de Magalha es et al., 2009a), although only ahandful of genes have been shown to have consistenteffects across populations. Many longevity-associated genes are related to spe-",
+ "Clinical Genetics and Genomics of Aging",
+ "effect fundamental mechanisms of aging (14, 16). The drawbacksof such studies include the improbability of picking the right geneto study the myriad of known and unknown genes affecting theprocess of interest (17). The linkage study described heremarkedly improves the efficiency of such association studies bydefining a region likely to contain polymorphism(s) with signif-icant influence on life span. Additional association studies with these families and repli-",
+ "The multifactorial and temporal features of aging can beanalyzed efficiently by genome-wide transcriptional profiling,which has been conducted in various model organisms and hu-mans (Melov and Hubbard 2004). Aging is associated with alter-ations in transcript levels of many genes, including those in-volved in evolutionarily conserved mitochondrial and protea-somal functions (McCarroll et al. 2004), some of which havebeen shown to be directly involved in regulating lifespan in C.",
+ "overexpressed with age seem to be a response to aging,in that they have been previously found to have protec-tive functions (de Magalha es et al., 2009b). As such,these genes may help organisms manage aging andcould be targets for manipulation. Likewise, gene ex-pression analysis of CR has been conducted to identifyassociated genes (Lee et al., 1999, 2000). A number ofmolecular signatures have emerged from such studiesthat could be useful to identify candidate processes andpathways that affect aging,",
+ "Mol Genet Genomic Med. 2020;00:e1157. | 1 of 11 https://doi.org/10.1002/mgg3.1157 wileyonlinelibrary.com/journal/mgg3 1 | INTRODUCTION Aging is one of the inevitably dominant risk associated with many diseases. Several biological factors contribute to this etiology which",
+ "al., 2009; Stanfel et al., 2009). Many of these genesmodulate the response to environmental signals, such asfood availability, and act in signaling pathways that ifunderstood can be targeted (Fig. 1). The genetic regula-tion of aging is therefore an emerging field with multipleapplications in the human nutrition, cosmetic, and phar-maceutical industries. AGING GENES AS TARGETS FOR DRUG DISCOVERY 91"
+ ],
+ "answer": "Genetic factors associated with aging include: 1. **Genome-wide association studies (GWAS)**: These studies offer an unbiased approach to identify new candidate genes for human diseases, including those related to aging. Convergent results from multiple aging-related traits can point out the genes responsible for the general aging of the organism [2]. 2. **Longevity-associated genes**: Dozens of genes have been associated with human longevity, although only a handful have shown consistent effects across populations. These genes are often related to specific biological pathways and processes [4]. 3. **Genes involved in mitochondrial and proteasomal functions**: Aging is associated with alterations in the transcript levels of many genes, including those involved in evolutionarily conserved mitochondrial and proteasomal functions. Some of these genes have been shown to be directly involved in regulating lifespan [7]. 4. **Genes with protective functions**: Certain genes that are overexpressed with age seem to have protective functions, helping organisms manage aging. These genes could be targets for manipulation to potentially influence the aging process [8]. These genetic factors highlight the complex interplay of various genes and pathways in the aging process.",
+ "question": "what genetic factor are associated with aging"
+} \ No newline at end of file
diff --git a/gnqa/paper2_eval/data/dataset/domain_expert_aging_4 b/gnqa/paper2_eval/data/dataset/domain_expert_aging_4
new file mode 100644
index 0000000..09d1513
--- /dev/null
+++ b/gnqa/paper2_eval/data/dataset/domain_expert_aging_4
@@ -0,0 +1,65 @@
+{
+ "titles": [
+ "2012 - Genome-Environment Interactions That Modulate.pdf",
+ "2009 - The Human Ageing Genomic Resources online.pdf",
+ "2012 - Genome-Environment Interactions That Modulate.pdf",
+ "2011 - Clinical aspects and molecular diagnostics of skin aging.pdf",
+ "2015 - Transcriptomic profiles of aging in purified.pdf",
+ "2004 - A Transcriptional Profile of Aging.pdf",
+ "2020 - A multidimensional systems biology.pdf",
+ "2011 - Metabolism, Genomics, and DNA Repair in the Mouse Aging Liver.pdf",
+ "2010 - MicroRNA, mRNA, and protein expression link.pdf",
+ "2012 - GeneFriends An online co-expression analysis.pdf"
+ ],
+ "extraction_id": [
+ "d59d7882-333d-5576-86ab-3cfa6354b946",
+ "52c67b46-63f2-54ae-a78e-e9d54a55f6e4",
+ "d59d7882-333d-5576-86ab-3cfa6354b946",
+ "674e1da7-73d5-5101-b5a5-4981e483123c",
+ "e5fd1ff0-8df5-577f-9f2d-31b0941d5ce5",
+ "8b47c304-ee91-5c52-8324-79fd0bd32b27",
+ "9d1656aa-32d2-5094-8232-4817655b1cbd",
+ "a6a6b5ba-3a72-55c5-91bb-abe747624348",
+ "fc9974c9-2e48-5a08-9112-0109df9ce096",
+ "1839dfa6-7080-5de4-96cb-3493ca2056d3"
+ ],
+ "document_id": [
+ "b1a1997c-e9df-5dc0-9d12-a3977d0c64ec",
+ "e43cd3b6-ad8e-5422-ba7c-ceb6e66cc529",
+ "b1a1997c-e9df-5dc0-9d12-a3977d0c64ec",
+ "e32f8f2c-d3ad-5dae-a393-9bd87c370ebe",
+ "50f357a3-e0f2-5e32-a6b2-771fb4b1e1c6",
+ "4ab656a7-9656-526b-94e1-422875409b44",
+ "d040bfe3-e409-5b5c-b8f8-f3dd4fc060e3",
+ "a94fd15d-373e-51c5-ad74-a17e4260d32a",
+ "c3ae47b0-42dd-5ab0-8fec-a41831d1bbfa",
+ "be20af52-c782-5098-893a-9a92000bf5a0"
+ ],
+ "id": [
+ "chatcmpl-ABLwhCwS1z9hZBn1zWSbHm5JcD7CF",
+ "b719fbc0-94e4-5df0-abb7-0d13fc36214c",
+ "8fd5ab85-67ed-55e6-bbfa-09436c4fdbfb",
+ "413f8f54-b5cc-5089-9f5c-d9e3b8bcf594",
+ "896169ed-4b9b-5ebc-9c9d-1cd2e6f3437c",
+ "6b4fb407-fd3f-52a3-9cfd-07dc5c891dd5",
+ "17ce11f7-55df-59bd-a801-a6f38ae9a9ef",
+ "61baeaa5-d65a-54b5-bfee-9bab8bbf1985",
+ "3414ff44-7d60-5492-9956-353ab9a94a43",
+ "b2d47567-09dc-5c77-be72-9448aa954e6b",
+ "1c3f7772-e1fa-5063-bda7-04b2f7e7b0e3"
+ ],
+ "contexts": [
+ "lar signatures of mammalian aging. Some of the genes",
+ "www.ncbi.nlm.nih.gov/homologene) of genes strongly asso-ciated with aging in model organisms. Also included are genesin which mutations result in segmental progeroid syndromes,such as the Werners syndrome gene, as well as genes criticalin pathways previously related to aging, such as the insulin/insulin-like signalling pathway (de Magalhes et al ., 2005a). The",
+ "overexpressed with age seem to be a response to aging,in that they have been previously found to have protec-tive functions (de Magalha es et al., 2009b). As such,these genes may help organisms manage aging andcould be targets for manipulation. Likewise, gene ex-pression analysis of CR has been conducted to identifyassociated genes (Lee et al., 1999, 2000). A number ofmolecular signatures have emerged from such studiesthat could be useful to identify candidate processes andpathways that affect aging,",
+ "expression profile of aging in human muscle. Physiol Genomics 2003;14:149-59. 142. Rodwell GE, Sonu R, Zahn JM. A transcriptional profile of aging inthe human kidney. PLoS Biol 2004;e427:2. 143. Hasty P, Campisi J, Hoeijmakers J, van Steeg H, Vijg J. Aging and genome maintenance: lessons from the mouse? Science 2003;299:1355-9. 144. Kyng KJ, May A, Klvraa S, Bohr VA. Gene expression profiling in Werner syndrome closely resembles that of normal aging. Proc Natl Acad Sci U S A 2003;100:12259-64.",
+ "neurodegenerative diseases. Nature. 2006;443:787 95. 50. de Magalhes JP, Curado J, Church GM. Meta-analysis of age-related gene expression profiles identifies common signatures of aging. Bioinformatics. 2009;25:875 81. 51. Zahn JM, Poosala S, Owen AB, Ingram DK, Lustig A, Carter A, et al. AGEMAP: a gene expression database for aging in mice. PLoS Genet. 2007;3:e201. 52. Liu LF, Shen WJ, Ueno M, Patel S, Kraemer FB. Characterization of age- related gene expression profiling in bone marrow and epididymal",
+ "Ly DH, Lockhart DJ, Lerner RA, Schultz PG (2000) Mitotic misregulation and human aging. Science 287: 24862492. McCarroll SA, Murphy CT, Zou S, Pletcher SD, Chin CS, et al. (2004) Comparing genomic expression patterns across species identies shared transcriptional prole in aging. Nat Genet 36: 197204. Murphy CT, McCarroll SA, Bargmann CI, Fraser A, Kamath RS, et al. (2003) Genes that act downstream of DAF-16 to inuence the lifespan of Caenorhabditis elegans Nature 424: 277283.",
+ "genes driving cellular senescence, and perform various integrative analyses. Genes inducing cellular senescence tend to be overexpressed with age in human tissues and are significantly overrepresented in anti-longevity and tumor-suppressor genes, while genes inhibiting cellular senescence overlap with pro-longevity and oncogenes. Furthermore, cellular senescence genes are strongly conserved in mammals but not in invertebrates. We also build",
+ "exhibits important alterations in global gene expressionproles with age. In mice, aging is accompanied by changesin expression of genes associated with increased inamma-tion, cellular stress, brosis, altered capacity for apoptosis,xenobiotic metabolism, normal cell-cycle control, and DNAreplication [ 5]. Lifelong calorie restriction reversed the",
+ "stance, genes associated with energy production, which decrease their expression during aging across various tissues and species (Zahn et al. 2006, 2007; de Magalha es et al. 2009), start decreasing at this transition point in our data (group 5; Fig. 2A). Hence, 25 yr of age in humans may mark the beginning of systemic change associated with certain senescence processes. Conservation of expression changes with age We observe that both developmental and aging expression pro-",
+ "p <10 -6; Table 1 shows the top 25 genes. Many of these genes have been associated with age-related diseases.Several other genes that have been shown to play a role in aging such as lysosomal-associated membrane protein-2 Lamp2 [19] (p = 5.68 -30), Fas [20] (p = 2.70-31) and growth hormone receptor Ghr [21] (p = 1.34-19) also showed a significant co-expression. Anxa2, Anxa3 and Anxa4 also show a low p-value (p < 10-25) as well as several S100 calcium binding proteins which have been"
+ ],
+ "answer": "Genes typically associated with early aging include those involved in segmental progeroid syndromes, such as the Werner syndrome gene [2]. Additionally, genes critical in pathways previously related to aging, such as the insulin/insulin-like signaling pathway, are also associated with early aging [2].",
+ "question": "which genes are typically associated with early aging?"
+} \ No newline at end of file
diff --git a/gnqa/paper2_eval/data/dataset/domain_expert_aging_5 b/gnqa/paper2_eval/data/dataset/domain_expert_aging_5
new file mode 100644
index 0000000..c0b3b7f
--- /dev/null
+++ b/gnqa/paper2_eval/data/dataset/domain_expert_aging_5
@@ -0,0 +1,65 @@
+{
+ "titles": [
+ "2001 - A genome-wide scan for linkage to human.pdf",
+ "2023 - A transcriptome-based single-cell biological age model.pdf",
+ "2015 - A Chromosome 13 locus is associated with male-specific mortality in mice.pdf",
+ "2007 - Longevity Genomics Across Species.pdf",
+ "2020 - Clinical Genetics and Genomics of Aging.pdf",
+ "2017 - Systems genetic analysis in GeneNetwork.org.pdf",
+ "2021 - Gene-by-environment modulation of lifespan and weight gain in the murine BXD family.pdf",
+ "2016 - Systems genetics identifies Hp1bp3 as a novel modulator of cognitive aging.pdf",
+ "2015 - A Chromosome 13 locus is associated with male-specific mortality in mice.pdf",
+ "2009 - Meta-analysis of age-related gene expression profiles identifies.pdf"
+ ],
+ "extraction_id": [
+ "17246c43-2e44-579b-867d-3dc7150ceedd",
+ "0fd46f00-d3e1-54f4-9395-6c3e8294ed51",
+ "5cc56e3b-53ab-5299-814d-014e2ed31d2f",
+ "522e2616-daa1-5bf3-8673-a717dfb9b13f",
+ "5c3840bd-45a5-5928-84ab-a1f2d8536691",
+ "59121146-02b9-5479-96e2-9fb45cffc81b",
+ "396683f9-b2e3-5942-bec8-f96fa798c341",
+ "382122b9-6922-5d85-9e8c-acfa86aff085",
+ "df0b4be9-3393-5642-a722-ccafffb60df8",
+ "4d95f551-34bd-5e7a-8702-eb59de73a480"
+ ],
+ "document_id": [
+ "1431984a-82d9-51d4-a23c-5f76a02ab554",
+ "9be234b7-f37d-5cd5-8895-bfe676441b2f",
+ "ad8f2626-87fb-520e-8cef-ee9a9cc3ab0b",
+ "1ab0b63f-d97c-5f5c-98ee-0bde785fa630",
+ "62b635c3-040e-512a-b016-6ef295308a1e",
+ "41be0f9f-a5af-5586-b6cd-16e56fd89cdc",
+ "4d082da4-fa48-5170-8147-c4fea47a5d4b",
+ "8cde78ac-cb0e-5983-86ee-91074b2fe1e3",
+ "ad8f2626-87fb-520e-8cef-ee9a9cc3ab0b",
+ "5c2cf97f-a57a-5284-85a3-b8d9c5943113"
+ ],
+ "id": [
+ "chatcmpl-ABLwlxjoJ15UXMdKPBfDnYfvZNLDD",
+ "e0cce1c5-8709-5218-99b6-48a6ba242931",
+ "9f9fef49-0bda-5948-93bd-0f8f43bbefdf",
+ "09da6f9e-b996-5438-91be-41d9438cb930",
+ "ab0845d4-b4db-53db-927e-b96a52cf7667",
+ "c2299f0f-9e0b-5279-90e5-37c6bd664976",
+ "3004d1fd-c5ce-5587-bfab-471e7141952c",
+ "9082d164-59f8-58a0-ace7-8b3aa9d884e2",
+ "7abf14d2-cdfe-5c37-8217-6b63bd8fb255",
+ "380ca35e-b42b-59b4-aef7-aaf2ba3bb59d",
+ "eea576fd-d766-5ae7-9e63-045869a3f8f7"
+ ],
+ "contexts": [
+ "effect fundamental mechanisms of aging (14, 16). The drawbacksof such studies include the improbability of picking the right geneto study the myriad of known and unknown genes affecting theprocess of interest (17). The linkage study described heremarkedly improves the efficiency of such association studies bydefining a region likely to contain polymorphism(s) with signif-icant influence on life span. Additional association studies with these families and repli-",
+ "Map contains 1119 and 1459 curated human and mouse aginggenes, respectively, covering almost all scales of aging, rangingfrom molecular damage to genetic predisposition. Cross-speciescomparison revealed a modest overlap between known humanand mouse aging genes, suggesting both conservation of core sen- escence pathways and fundamental differences in aging between mice and humans (Fig. 2E). Aging-associated genes can alternatively be identified in a",
+ "11. Gelman R, Watson A, Bronson R et al (1988) Murine chromo- somal regions correlated with longevity. Genetics 118(4):693704 12. Jackson AU, Galecki AT, Burke DT et al (2002) Mouse loci associated with life span exhibit sex-specic and epistatic effects. J Gerontol A Biol Sci Med Sci 57(1):B9B15 13. Foreman JE, Lionikas A, Lang DH et al (2009) Genetic archi- tecture for hole-board behaviors across substantial time intervalsin young, middle-aged and old mice. Genes Brain Behav",
+ "Along with longevity, a select group of potential aging-related biomarkers will be assayed for each of these mouse models. In addition, it should be possible to assay several of these mouse lines for resistance to specific age-associated diseases, such as diabetes and neurological disorders, by crossing them into the appropriate transgenic disease back- ground. CONCLUSION Our understanding of the basic mechanisms of aging have benefited greatly from the use of simple model systems",
+ "198 the study of age-related diseases for various reasons: (a) mice are closely related to humans, with nearly 99% of human orthologous in mice; (b) their relatively short lifespan and small size allow surveillance of the aging process within a pertinent time frame and make their housing less expensive; (c) the feasibility of performing genetic manipulations facilitates the engineering of transgenic strains (gain- and loss-of function mice) that model premature aging disorders. In this section, we",
+ "Hsu HC, Lu L, Yi N, Van Zant G, Williams RW, Mountz JD. Quantitative trait locus (QTL) mapping in aging systems. Methods in Molecular Biology (Clifton, NJ ). 2007; 371:321348. Hunter KW, Crawford NPS. The future of mouse QTL mapping to diagnose disease in mice in the age of whole-genome association studies. Annual Review of Genetics. 2008; 42:131141. Ito R, Robbins TW, Everitt BJ. Differential control over cocaine-seeking behavior by nucleus",
+ "multiscalar integration of traits. Cell150, 12871299 (2012). [PubMed: 22939713] 33. De Haan G & Van Zant G Genetic analysis of hemopoietic cell cycling in mice suggests its involvement in organismal life span. FASEB J. Off. Publ. Fed. Am. Soc. Exp. Biol. 13, 707713 (1999). 34. Gelman R, Watson A, Bronson R & Yunis E Murine chromosomal regions correlated with longevity. Genetics 118, 693704 (1988). [PubMed: 3163317] 35. Houtkooper RHet al.The metabolic footprint of aging in mice. Sci. Rep1, (2011).",
+ "mice to identify genetic factors involved in the regulation of cognitive aging that may have gone undetected in either complex human studies or murine studies utilizing only a single genetic background. Aging is a leading risk factor for age-associated de- mentias such as AD, and our work and others suggest that geneticfactors and mechanisms underlying biological processes during midlife play a key role in determining an individual s susceptibility",
+ "span and have yielded insights into potential biological pathways and processes related to aging. Despite these successes, several problems are inherent in human longevity studies including potentially high degrees ofenvironmental heterogeneity, genetic diversity, and lack of birth matched controls, among others [ 8]. Inbred mouse strains represent a powerful alternative for identifying genes underlying complex trait genes such as longevity [ 9]. Initial mapping approaches include quanti-",
+ "Recently, the Atlas of Gene Expression in Mouse Aging Project (AGEMAP) reported gene expression proles with age for 8932genes in 16 mouse tissues (Zahn et al ., 2007). We chose not to"
+ ],
+ "answer": "To generate a linkage or association mapping study in mice to understand aging, you can follow these steps: 1. **Select Appropriate Mouse Strains**: Choose inbred mouse strains that are known to have variations in lifespan and aging-related traits. Inbred strains are powerful for identifying genes underlying complex traits such as longevity due to their genetic uniformity [9]. 2. **Identify Aging-Related Traits**: Determine the specific aging-related traits you want to study. This could include longevity, resistance to age-associated diseases, or other biomarkers of aging [4]. 3. **Cross Mouse Strains**: Perform crosses between different inbred strains to generate F1 and F2 generations. This helps in creating genetic diversity necessary for mapping studies [5]. 4. **Phenotyping**: Assay the F2 generation (or other relevant generations) for the aging-related traits of interest. This could involve measuring lifespan, resistance to diseases, or other biomarkers [4]. 5. **Genotyping**: Genotype the mice using molecular markers such as SNPs (Single Nucleotide Polymorphisms) to identify genetic variations across the genome [6]. 6. **Quantitative Trait Locus (QTL) Mapping**: Use QTL mapping techniques to identify chromosomal regions associated with the aging-related traits. This involves statistical analysis to correlate genetic markers with phenotypic traits [6]. 7. **Association Studies**: Conduct association studies to further refine the regions identified by QTL mapping. This involves looking for specific polymorphisms within the identified regions that are significantly associated with the traits [1]. 8. **Replication and Validation**: Validate the findings by replicating the study in different mouse populations or using additional genetic crosses. This helps to confirm the association between the identified genetic regions and aging traits [1]. By following these steps, you can generate a linkage or association mapping study in mice to understand the genetic factors influencing aging.",
+ "question": "How do I generate a linkage or association mapping study in mice to understand aging?"
+} \ No newline at end of file
diff --git a/gnqa/paper2_eval/data/dataset/domain_expert_aging_6 b/gnqa/paper2_eval/data/dataset/domain_expert_aging_6
new file mode 100644
index 0000000..8e291e1
--- /dev/null
+++ b/gnqa/paper2_eval/data/dataset/domain_expert_aging_6
@@ -0,0 +1,65 @@
+{
+ "titles": [
+ "2010 - A Meta-analysis of Four Genome-Wide Association Studies.pdf",
+ "2014 - Whole-Genome Sequencing of the World?s Oldest People.pdf",
+ "2011 - Genome-wide association study identifies a single major locus contributing to survival into old age the APOE locus revisited.pdf",
+ "2017 - Four Genome-Wide Association Studies Identify New.pdf",
+ "2012 - Whole genome sequences of a male and female supercentenarian, ages greater than 114 years.pdf",
+ "2013 - Genome Instability and Aging.pdf",
+ "2012 - Genome-wide miRNA signatures of human longevity.pdf",
+ "2012 - Whole genome sequences of a male and female supercentenarian, ages greater than 114 years.pdf",
+ "2011 - Genome-wide association study identifies a single major locus contributing to survival into old age the APOE locus revisited.pdf",
+ "2017 - Genome-wide meta-analysis associates HLA.pdf"
+ ],
+ "extraction_id": [
+ "8bc54e5b-f45f-54f9-9591-1e26dd80b50d",
+ "c918522d-c0bf-5b7a-9ced-a69d485b2cb6",
+ "a4aa5d3a-81e8-582c-aee6-3ebdd329de86",
+ "b539194c-50bb-55e5-83b2-e779f63ed363",
+ "402ab5b5-e6fa-58fe-8f32-7c235be7a746",
+ "f33756b1-7d64-5ab9-bcd6-717deaf05339",
+ "e79b0811-a0f3-5f44-8004-89fe59aa8a3e",
+ "402ab5b5-e6fa-58fe-8f32-7c235be7a746",
+ "a4aa5d3a-81e8-582c-aee6-3ebdd329de86",
+ "9c6a9e93-5dc5-571d-b3c2-b600ed95e102"
+ ],
+ "document_id": [
+ "8e452186-a71c-5b62-81b2-7681c87c8e1d",
+ "d2a5ec28-873a-5ff3-9cf4-dbec3b52dd21",
+ "05208abc-5ac0-5d4d-b600-2caf59ce75b7",
+ "c10653f6-b3d7-5b92-9271-ab8fcc7905a7",
+ "408cdcd5-ab70-520a-b2c4-d9028b0a8d6f",
+ "71e08916-8cc8-5d96-8c06-4461b972b54d",
+ "18407659-c241-5f37-8ad2-ab59f6a7e288",
+ "408cdcd5-ab70-520a-b2c4-d9028b0a8d6f",
+ "05208abc-5ac0-5d4d-b600-2caf59ce75b7",
+ "3a565ba9-ee5b-5596-b870-ce8c055cb1f1"
+ ],
+ "id": [
+ "chatcmpl-ABLwzkPUEqxCEqW5L5wugbbowvYPv",
+ "c2234f77-2268-57d0-a227-e931fc4802c1",
+ "fb0af8f1-5b2a-5ba1-8a53-ee543a9267bf",
+ "754929a6-af78-569a-969c-e750d174b952",
+ "4a6d2b9b-9496-5d90-a24a-43c643c4916b",
+ "1f4437a7-cee1-5dc2-80e1-9924248857d0",
+ "91010ff1-43a7-53f6-966d-601913e3b26b",
+ "63ebd662-9aca-5b8a-b3e3-89860a45da42",
+ "53a8e33f-da6f-5550-bf18-e45f2779f7a9",
+ "57227bee-d562-52c9-86dc-f9e2fcea1792",
+ "b1b9f731-236c-5b4b-8cc6-fcf1e06d866a"
+ ],
+ "contexts": [
+ "GENOME-WIDE ASSOCIATION STUDY OF LONGEVITY 479 INCREASES in longevity of the general population world - wide are an unprecedented phenomenon with significant health and social impact. Although environmental factors have led to an increase in life span, there is ample evidence that genetic factors are involved in extreme longevity both in humans (17) and in other organisms (8). The protective genetic factors that lead to longevity are likely to involve",
+ "that any genetic variant that contributes strongly to extremelongevity would also be rare. One possibility is that a specificmutation could alter the protein-coding region in a gene andconfer a significant increase in longevity. Such a mutation couldact in a dominant or recessive fashion, and might be shared by asignificant fraction of the supercentenarian genomes but not bycontrol genomes. We created a computational pipeline todetermine whether our supercentenarian genomes are enrichedfor such a variant",
+ "ever, natural human and animal longevity is presumed to be acomplex trait (Finch & Tanzi, 1997). In humans, both candidategene and genome-wide genetic association approaches havebeen applied in an attempt to identify longevity loci. The fre-quency of genetic variants has been typically compared between nonagenarian cases and young controls, revealing",
+ "genetic makeup of extreme longevity is based on a combination of common and rare variants, with common vari-ants that create the background to survive to relatively common old ages, and specific combinations of uncommon and rare variants that add an additional survival advantage to even older ages. Our analy-sis showed that LAVs discovered through a casecontrol study are not necessarily the variants that make someone live to extreme old age, and additional survival analysis is needed to characterize and",
+ "genetic determination of human exceptional longevity, they arethe rst step toward the generation of a comprehensive referencepanel of exceptionally long-lived individuals. The data also provideinteresting insights into genetic backgrounds that are conduciveto exceptional longevity and allow us to test different models of exceptional longevity. www.frontiersin.org January 2012 | Volume 2 | Article 90 | 1",
+ "tremely long lived individuals. Longevity has a genetic component, with an estimated heritability of average life expectancy of approximately 25% (105, 106). Family studies of centenarians, thosewho live to 100 years or more, suggest that the relationship between genetics and longevity is stronger in the oldest-old adults (107, 108), supporting the utility of long-lived individuals as a model system for studying genetic variations that predispose people to longevity.",
+ "because of genetic variation that becomes particularly important for sur- vival at advanced age (Hjelmborg et al. , 2006). Epidemiological studies have revealed that long-lived individuals (LLI), that is, people surviving to the 95th percentile of the respective birth cohort-specic age distribu- tions (Gudmundsson et al. , 2000), frequently show a favorable (healthy) course of the aging process, with the absence or a delayed onset of age-",
+ "Studies of centenarians have provided strong evidence to sup-port the hypothesis that a genetic contribution to human excep-tional longevity is decisive, although only a small number ofgenetic variants with modest effects have been irrefutably linkedto this phenotype ( Schachter et al., 1994; Barzilai et al., 2003 ; Christensen et al., 2006 ;Wheeler and Kim, 2011 ). The tech- nology of next generation sequencing provides a tool to gen-erate data that may eventually provide an answer ( Metzker, 2009).",
+ "genetic contribution to human lifespan variation was estimated at 2530% in twin studies (Gudmundsson et al. , 2000; Skytthe et al. , 2003; Hjelmborg et al. , 2006). The most prominent genetic inuence is observed in families in which the capacity toattain a long lifespan clusters (Perls et al. , 2000; Schoenmaker et al. , 2006). Exceptional longevity can be reached with a low degree of age-related disability (Christensen et al. , 2008; Terry et al. , 2008), raising the question whether protective mecha-",
+ "age, usually de ned by a threshold, such as 90 years). Up to 25% of the variability in human lifespan has been estimated to be genetic1, but genetic variation at only three loci (near APOE , FOXO3A and CHRNA3/5 )25have so far been demonstrated to be robustly associated with lifespan. Prospective genomic studies of lifespan have been hampered by the fact that subject participation is often only recent, allowing insuf cient follow-up time for a well-powered analysis of"
+ ],
+ "answer": "Yes, there are specific genetic variations that can contribute to someone living longer. Research has shown that certain genes and their variations can influence longevity. For example, variations near the genes APOE, FOXO3A, and CHRNA3/5 have been robustly associated with lifespan [10]. These genetic variations can affect how our bodies age and how well we can avoid or delay age-related diseases. In simpler terms, think of these genes as parts of a car. Just like how certain parts can make a car run more efficiently and last longer, these genetic variations can help our bodies function better and live longer. However, it's important to note that living a long life is usually a combination of both genetic factors and lifestyle choices, such as diet, exercise, and avoiding harmful habits.",
+ "question": "is there a specific genetic variation that can cause someone to live longer? please make your answer accessible to a non-expert"
+} \ No newline at end of file