Human-Genetic Ancestry Inference and False Positives in Forensic Familial Searching

In forensic familial search methods, a query DNA profile is tested against a database to determine if the query profile represents a close relative of a database entrant. One challenge for familial search is that the calculations may require specification of allele frequencies for the unknown popula...

Full description

Bibliographic Details
Main Authors: Alyssa Lyn Fortier, Jaehee Kim, Noah A. Rosenberg
Format: Article
Language:English
Published: Oxford University Press 2020-08-01
Series:G3: Genes, Genomes, Genetics
Subjects:
Online Access:http://g3journal.org/lookup/doi/10.1534/g3.120.401473
id doaj-62ea0b1068714274a7042c46178f4228
record_format Article
spelling doaj-62ea0b1068714274a7042c46178f42282021-07-02T14:18:08ZengOxford University PressG3: Genes, Genomes, Genetics2160-18362020-08-011082893290210.1534/g3.120.40147329Human-Genetic Ancestry Inference and False Positives in Forensic Familial SearchingAlyssa Lyn FortierJaehee KimNoah A. RosenbergIn forensic familial search methods, a query DNA profile is tested against a database to determine if the query profile represents a close relative of a database entrant. One challenge for familial search is that the calculations may require specification of allele frequencies for the unknown population from which the query profile has originated. The choice of allele frequencies affects the rate at which non-relatives are erroneously classified as relatives, and allele-frequency misspecification can substantially inflate false positive rates compared to use of allele frequencies drawn from the same population as the query profile. Here, we use ancestry inference on the query profile to circumvent the high false positive rates that result from highly misspecified allele frequencies. In particular, we perform ancestry inference on the query profile and make use of allele frequencies based on its inferred genetic ancestry. In a test for sibling matches on profiles that represent unrelated individuals, we demonstrate that false positive rates for familial search with use of ancestry inference to specify the allele frequencies are similar to those seen when allele frequencies align with the population of origin of a profile. Because ancestry inference is possible to perform on query profiles, the extreme allele-frequency misspecifications that produce the highest false positive rates can be avoided. We discuss the implications of the results in the context of concerns about the forensic use of familial searching.http://g3journal.org/lookup/doi/10.1534/g3.120.401473ancestryfalse positivesfamilial identificationpopulation geneticsrelatedness
collection DOAJ
language English
format Article
sources DOAJ
author Alyssa Lyn Fortier
Jaehee Kim
Noah A. Rosenberg
spellingShingle Alyssa Lyn Fortier
Jaehee Kim
Noah A. Rosenberg
Human-Genetic Ancestry Inference and False Positives in Forensic Familial Searching
G3: Genes, Genomes, Genetics
ancestry
false positives
familial identification
population genetics
relatedness
author_facet Alyssa Lyn Fortier
Jaehee Kim
Noah A. Rosenberg
author_sort Alyssa Lyn Fortier
title Human-Genetic Ancestry Inference and False Positives in Forensic Familial Searching
title_short Human-Genetic Ancestry Inference and False Positives in Forensic Familial Searching
title_full Human-Genetic Ancestry Inference and False Positives in Forensic Familial Searching
title_fullStr Human-Genetic Ancestry Inference and False Positives in Forensic Familial Searching
title_full_unstemmed Human-Genetic Ancestry Inference and False Positives in Forensic Familial Searching
title_sort human-genetic ancestry inference and false positives in forensic familial searching
publisher Oxford University Press
series G3: Genes, Genomes, Genetics
issn 2160-1836
publishDate 2020-08-01
description In forensic familial search methods, a query DNA profile is tested against a database to determine if the query profile represents a close relative of a database entrant. One challenge for familial search is that the calculations may require specification of allele frequencies for the unknown population from which the query profile has originated. The choice of allele frequencies affects the rate at which non-relatives are erroneously classified as relatives, and allele-frequency misspecification can substantially inflate false positive rates compared to use of allele frequencies drawn from the same population as the query profile. Here, we use ancestry inference on the query profile to circumvent the high false positive rates that result from highly misspecified allele frequencies. In particular, we perform ancestry inference on the query profile and make use of allele frequencies based on its inferred genetic ancestry. In a test for sibling matches on profiles that represent unrelated individuals, we demonstrate that false positive rates for familial search with use of ancestry inference to specify the allele frequencies are similar to those seen when allele frequencies align with the population of origin of a profile. Because ancestry inference is possible to perform on query profiles, the extreme allele-frequency misspecifications that produce the highest false positive rates can be avoided. We discuss the implications of the results in the context of concerns about the forensic use of familial searching.
topic ancestry
false positives
familial identification
population genetics
relatedness
url http://g3journal.org/lookup/doi/10.1534/g3.120.401473
work_keys_str_mv AT alyssalynfortier humangeneticancestryinferenceandfalsepositivesinforensicfamilialsearching
AT jaeheekim humangeneticancestryinferenceandfalsepositivesinforensicfamilialsearching
AT noaharosenberg humangeneticancestryinferenceandfalsepositivesinforensicfamilialsearching
_version_ 1724163303043760128