Genome Foundations

The genome is the entire set of genetic instructions found in a cell. In humans, the genome consists of 23 pairs of chromosomes, found in the nucleus, as well as a small chromosome found in the cells' mitochondria. Each set of 23 chromosomes contains approximately 3.1 billion bases of DNA sequence.4

Allele

An allele is one of two or more versions of a gene. An individual inherits two alleles for each gene, one from each parent. If the two alleles are the same, the individual is homozygous for that gene. If the alleles are different, the individual is heterozygous. Though the term allele was originally used to describe variation among genes, it now also refers to variation among non-coding DNA sequences.4

Base Pair

A base pair is two chemical bases bonded to one another forming a "rung of the DNA ladder." The DNA molecule consists of two strands that wind around each other like a twisted ladder. Each strand has a backbone made of alternating sugar (deoxyribose) and phosphate groups. Attached to each sugar is one of four bases--adenine (A), cytosine (C), guanine (G), or thymine (T). The two strands are held together by hydrogen bonds between the bases, with adenine forming a base pair with thymine, and cytosine forming a base pair with guanine.4
Base Pair Illustration

Chromosome

A chromosome is an organized package of DNA found in the nucleus of the cell. Different organisms have different numbers of chromosomes. Humans have 23 pairs of chromosomes--22 pairs of numbered chromosomes, called autosomes, and one pair of sex chromosomes, X and Y. Each parent contributes one chromosome to each pair so that offspring get half of their chromosomes from their mother and half from their father.4   
Chromosome Illustration

Deoxyribonucleic Acid (DNA)

DNA is the chemical name for the molecule that carries genetic instructions in all living things. The DNA molecule consists of two strands that wind around one another to form a shape known as a double helix. Each strand has a backbone made of alternating sugar (deoxyribose) and phosphate groups. Attached to each sugar is one of four bases--adenine (A), cytosine (C), guanine (G), and thymine (T). The two strands are held together by bonds between the bases; adenine bonds with thymine, and cytosine bonds with guanine. The sequence of the bases along the backbones serves as instructions for assembling protein and RNA molecules.
DNA Graphic

Double Helix

Double helix is the description of the structure of a DNA molecule. A DNA molecule consists of two strands that wind around each other like a twisted ladder. Each strand has a backbone made of alternating groups of sugar (deoxyribose) and phosphate groups. Attached to each sugar is one of four bases: adenine (A), cytosine (C), guanine (G), or thymine (T). The two strands are held together by bonds between the bases, adenine forming a base pair with thymine, and cytosine forming a base pair with guanine.4
Double Helix Illustration

Epigenetics

The study of heritable changes that do not affect the DNA sequence but influence gene expression.3

Epigenome

The term epigenome is derived from the Greek word epi which literally means "above" the genome. The epigenome consists of chemical compounds that modify, or mark, the genome in a way that tells it what to do, where to do it, and when to do it. Different cells have different epigenetic marks. These epigenetic marks, which are not part of the DNA itself, can be passed on from cell to cell as cells divide, and from one generation to the next.4

Exon

An exon is the portion of a gene that codes for amino acids. In the cells of plants and animals, most gene sequences are broken up by one or more DNA sequences called introns. The parts of the gene sequence that are expressed in the protein are called exons, because they are expressed, while the parts of the gene sequence that are not expressed in the protein are called introns, because they come in between--or interfere with--the exons.4
Exon Illustration

Gene

The basic unit of heredity that occupies a specific location on a chromosome. Each consists of nucleotides arranged in a linear manner. Most genes code for a specific protein or segment of protein leading to a particular characteristic or function.3
Gene Illustration

Gene Expression

Gene expression is the process by which the information encoded in a gene is used to direct the assembly of a protein molecule. The cell reads the sequence of the gene in groups of three bases. Each group of three bases (codon) corresponds to one of 20 different amino acids used to build the protein.4
Gene Expression Illustration

Genetics                    

The study of genes and heredity. Heredity is the passing of genetic information and traits (such as eye color and an increased chance of getting a certain disease) from parents to offspring.2

Genotype                  

A genotype is an individual's collection of genes. The term also can refer to the two alleles inherited for a particular gene. The genotype is expressed when the information encoded in the genes' DNA is used to make protein and RNA molecules. The expression of the genotype contributes to the individual's observable traits, called the phenotype.4
Genotype Illustration

Germ Line                 

A germ line is the sex cells (eggs and sperm) that are used by sexually reproducing organisms to pass on genes from generation to generation. Egg and sperm cells are called germ cells, in contrast to the other cells of the body that are called somatic cells.4  
Germ Line Illustration

Genomics 

Genomics refers to the study of the entire genome, essentially all the genes that can be found in an organism.4    

Example: If you wanted to study the genomics of an organism or a person you could sequence all of their genes and all of their DNA and look for changes and make comparisons with other individual's genomes.4

Haplotype

A haplotype is a set of DNA variations, or polymorphisms, that tend to be inherited together. A haplotype can refer to a combination of alleles or to a set of single nucleotide polymorphisms (SNPs) found on the same chromosome. Information about haplotypes is being collected by the International HapMap Project and is used to investigate the influence of genes on disease.4

Heterozygous

Heterozygous refers to having inherited different forms of a particular gene from each parent. A heterozygous genotype stands in contrast to a homozygous genotype, where an individual inherits identical forms of a particular gene from each parent.4 
Heterozygous Illustration

Homozygous

Homozygous is a genetic condition where an individual inherits the same alleles for a particular gene from both parents.4 
Homozygous Illustration

Intron

An intron is a portion of a gene that does not code for amino acids. In the cells of plants and animals, most gene sequences are broken up by one or more introns. The parts of the gene sequence that are expressed in the protein are called exons, because they are expressed, while the parts of the gene sequence that are not expressed in the protein are called introns, because they come in between the exons.4 
Intron Illustration

Locus

A locus is the specific physical location of a gene or other DNA sequence on a chromosome, like a genetic street address. The plural of locus is "loci".4

Loss of Heterozygosity (LOH)

If there is one normal and one abnormal allele at a particular locus, as might be seen in an inherited autosomal dominant cancer susceptibility syndrome, loss of the normal allele produces a locus with no normal function. When the loss of heterozygosity involves the normal allele, it creates a cell that is more likely to show malignant growth if the altered gene is a tumor suppressor gene.3           

Microbiome

A microbiome is all of the genetic material found within an individual microbe such as a bacterium, fungal cell, or virus. It also may refer to the collection of genetic material found in a community of microbes that live together.4  
Microbiome Illustration

Microsatellite

Microsatellite sequences are repetitive DNA sequences usually several base pairs in length. They are used as genetic markers to follow the inheritance of genes in families.4 

Promoter

A promoter is a sequence of DNA needed to turn a gene on or off. The process of transcription is initiated at the promoter. Usually found near the beginning of a gene, the promoter has a binding site for the enzyme used to make a messenger RNA (mRNA) molecule.4
Promoter Illustration

Pseudogene

A pseudogene is a DNA sequence that resembles a gene but has been mutated into an inactive form over the course of evolution. A pseudogene shares an evolutionary history with a functional gene and can provide insight into their shared ancestry.4

Ribonucleic Acid (RNA)

Ribonucleic acid (RNA) is a molecule similar to DNA. Unlike DNA, RNA is single-stranded. An RNA strand has a backbone made of alternating sugar (ribose) and phosphate groups. Attached to each sugar is one of four bases--adenine (A), uracil (U), cytosine (C), or guanine (G). Different types of RNA exist in the cell: messenger RNA (mRNA), ribosomal RNA (rRNA), and transfer RNA (tRNA). More recently, some small RNAs have been found to be involved in regulating gene expression.4
RNA Illustration

Telomere

A telomere is the end of a chromosome. Telomeres are made of repetitive sequences of non-coding DNA that protect the chromosome from damage. Each time a cell divides, the telomeres become shorter. Eventually, the telomeres become so short that the cell can no longer divide.4
Telomere Illustration

Wild-type gene

A term used to describe a gene when it is found in its natural, unchanged form.2


Subcategory: Clinical Genomics

Cascade Genetic Testing

The process of extending genetic testing to individuals at risk for inheriting a pathogenic variant previously identified in a biologic relative. This process is repeated as more pathogenic variant carriers are identified within the family. This process is sometimes referred to as cascade screening, although cascade genetic testing is the preferred term.3

Penetrance

Penetrance refers to the likelihood that a clinical condition will occur when a particular genotype is present. For adult-onset diseases, penetrance is usually described by the individual carrier's age, sex, and organ site.2

Example: The penetrance for breast cancer in female carriers of BRCA1 pathogenic variants is often quoted by age 50 years and by age 70 years.2

Pathognomonic

Findings that are distinctive or characteristic of a particular disease or condition and make the diagnosis.2

Precision Oncology

A form of medicine that uses information about a person’s own genes or proteins to prevent, diagnose, or treat disease. In cancer, precision medicine oncology uses specific information about a person’s tumor to help make a diagnosis, plan treatment, find out how well treatment is working, or make a prognosis. Also called personalized medicine, precision medicine, or precision health.2

Example: Examples of precision oncology include using targeted therapies to treat specific types of cancer cells, such as HER2-positive breast cancer cells, or using tumor marker testing to help diagnose cancer.2

Pharmacogenomics

The study of the relationship between genetic variations and how our body responds to medications.6

Phenocopy

A phenotypic trait or disease that resembles the trait expressed by a particular genotype, but in an individual who is not a carrier of that genotype.3

Example: A woman with early onset breast cancer who does not have a variant in any gene known to be associated with breast cancer.

Phenotype

The observable characteristics in an individual resulting from the expression of genes; the clinical presentation of an individual with a particular genotype.3

Example: An individual on colonoscopy found to have significant polyposis, which is a physical finding associated with variants in gene such as MUTYH and APC.