The most prevalent method of DNA fingerprinting used today is based on PCR and uses Short Tandem Repeats (STRs). This method uses highly polymorphic regions that have short repeated sequences of DNA (the most common is 4 bases repeated, but there are other lengths in use, including 3 and 5 bases). Because different people have different numbers of repeat units, these regions of DNA can be used to discriminate between individuals. These STR locia (locations) are targeted with sequence-specific primers and are amplified using PCR. The DNA fragments that result are then separated and detected using electrophoresis. There are two common methods of separation and detection, capillary electrophoresis (CE) and gel electrophoresis.
The polymorphisms displayed at each STR region are by themselves very common, typically each polymorphism will be shared by around 5 - 20% of individuals. When looking at multiple loci, it is the unique combinations of these polymorphisms to an individual that makes this method discriminating as an identification tool. The more STR regions that are tested in an individual the more discriminating the test becomes.
From country to country different STR based DNA profiling systems are in use. In North America CODIS is prevalent, while in the UK the SGM+ system, which is compatible with The National DNA Database in use. Whichever system is used, many of the STR regions under test are the same. These DNA profiling systems are based around multiplex reactions, whereby many STR regions will be under test at the same time.
Capillary electrophoresis works by electrokinetically (movement through the application of an electric field) injecting the DNA fragments into a thin glass tube (the capillary) filled with polymer. The DNA is pulled through the tube by the application of an electric field, separating the fragments such that the smaller fragments travel faster through the capillary. The fragments are then detected using fluorescent dyes that were attached to the primers used in PCR. This allows multiple fragments to be amplified and run simultaneously, something known as multiplexing. Sizes are assigned using labeled DNA size standards that are added to each sample, and the number of repeats are determined by comparing the size to an allelic ladder, a sample that contains all of the common possible repeat sizes. Although this method is expensive, larger capacity machines with higher throughput are being used to lower the cost/sample and reduce backlogs that exist in many government crime facilities.
Gel electrophoresis acts using similar principles as CE, but instead of using a capillary, a large polyacrylamide gel is used to separate the DNA fragments. An electric field is applied, as in CE, but instead of running all of the samples by a detector, the smallest fragments are run close to the bottom of the gel and the entire gel is scanned into a computer. This produces an image showing all of the bands corresponding to different repeat sizes and the allelic ladder. This approach does not require the use of size standards, since the allelic ladder is run alongside the samples and serves this purpose. Visualization can either be through the use of fluorescently tagged dyes in the primers or by silver staining the gel prior to scanning. Although it is cost effective and can be rather high throughput, silver staining kits for STRs are being discontinued. In addition, many labs are phasing out gels in favor of CE as the cost of machines becomes more manageable.
The true power of STRs is in its statistical power of discrimination. In the U.S.A., there are 13 loci (DNA locations) that are currently used for discrimination. Because these loci are independently assorted (having a certain number of repeats at one locus doesn't change the likelihood of having any number of repeats at any other locus), the power rule of statistics can be applied. This means that if someone has the DNA type of ABC, where the three loci were independent, we can say that the probability of having that DNA type is the probability of having type A times the probability of having type B times the probability of having type C. This has resulted in the ability to generate match probabilities of 1 in a quintillion (1 with 18 zeros after it) or more.