Whole-genome sequencing of a mid-20th-century femur from central Israel in an open missing-person case
Whole-genome sequencing of a mid-20th-century femur found in a cave in central Israel revealed that the remains belonged to a person of Arab ancestry, thereby refuting the initial hypothesis that the individual was of Ashkenazi Jewish origin.
Original authors:Vol, E., Waldman, S., Lomes, A., Brielle, E. S., Appel, N., Dolin, B., Asif, S., Nagar, Y., Marco, E., Bergman, N., Khaner, O., Raviv, D., Oliel, J., Lewis, R. Y., Carmi, S.
This is an AI-generated explanation of a preprint that has not been peer-reviewed. It is not medical advice. Do not make health decisions based on this content. Read full disclaimer
This paper describes a forensic investigation into a cold case involving human remains found in a cave in central Israel. The bone, a femur, was thought to belong to an Ashkenazi Jewish person who had been missing since 1948.
To identify the person, the researchers used whole-genome sequencing. This technology attempts to read the entire genetic code from a sample. However, working with old bone is difficult because the DNA is often highly degraded. In this case, the researchers faced a significant challenge: out of nearly 500 million genetic sequences they read, only 0.5% actually matched the human genome. This resulted in a very low "depth of coverage," meaning they could only see a tiny fraction of the person's genetic information.
Despite this limited data, the researchers performed quality control and determined the sample came from a male. They then used statistical methods to analyze the genetic patterns to determine the person's ancestry.
The results of these analyses suggested that the genome belonged to a person of Arab ancestry. This finding refuted the original hypothesis that the remains belonged to someone of Ashkenazi Jewish origin. The paper presents this as the first known instance of using whole-genome sequencing on a mid-20th-century bone sample from the Middle East to resolve a missing-person case.
Technical Summary: Whole-Genome Sequencing of a Mid-20th-Century Femur
Problem
The study addresses a forensic challenge common in "cold cases": identifying the identity or ancestry of skeletal remains from the mid-20th century to generate investigative leads. Specifically, the researchers investigated a femur discovered in a cave in Central Israel. The initial hypothesis proposed by investigators was that the remains belonged to an individual of Ashkenazi Jewish ancestry who had been missing since 1948. The central problem was to determine whether the genetic profile of the sample supported or refuted this specific ancestral hypothesis.
Methodology
The researchers employed advanced genomic techniques designed to handle potentially degraded ancient or historical DNA:
Sample Type: A femur bone from the mid-20th century.
DNA Extraction & Library Preparation: To maximize the recovery of highly fragmented DNA, the team utilized single-stranded library preparation. This method is specifically chosen for degraded samples as it is more efficient at capturing short or damaged DNA fragments than traditional double-stranded methods.
Sequencing: The team performed Whole-Genome Sequencing (WGS), generating a massive dataset of nearly 500 million reads.
Bioinformatic Analysis:
Mapping: Reads were mapped to the human reference genome.
Quality Control (QC): Standard filtering to ensure data integrity.
Sex Inference: Determination of the biological sex of the individual.
Ancestry Assignment: The researchers used Principal Component Analysis (PCA) and ADMIXTURE analyses—standard population genetic tools—to compare the sample's genetic markers against known global and regional reference populations.
Key Contributions
Methodological Milestone: The paper presents what is, to the authors' knowledge, the first instance of applying whole-genome sequencing to a mid-20th-century bone sample from the Middle East.
Forensic Application of WGS: It demonstrates the utility (and the limitations) of using high-throughput sequencing to resolve historical missing-person cases where traditional forensic DNA profiling might be insufficient.
Results
Low Endogenous DNA Content: Despite the high number of reads (500 million), the sequencing revealed significant contamination or degradation. Only 0.5% of the reads mapped to the human genome, resulting in a very low depth of coverage of 0.07x.
Ancestry Refutation: Despite the extremely low coverage, the bioinformatic analysis was sufficient to perform ancestry assignment. The results definitively indicated that the individual possessed Arab ancestry, directly refuting the hypothesis that the remains were of Ashkenazi Jewish origin.
Significance
This study highlights both the power and the technical hurdles of "paleogenomic" approaches in modern forensics.
Investigative Impact: It proves that even with extremely low-quality DNA (0.07x coverage), WGS can provide definitive ancestral information that can pivot a criminal or historical investigation.
Technical Cautionary Tale: The discrepancy between the massive number of reads (500 million) and the tiny fraction of human DNA (0.5%) serves as a technical reminder of the challenges of environmental contamination and DNA degradation in archaeological and forensic contexts.
Regional Importance: It establishes a precedent for using genomic technologies to resolve historical mysteries in the Middle East, a region with complex migratory and ancestral histories.