Phenotypic Mutation 'andalusia' (pdf version)
Alleleandalusia
Mutation Type makesense
Chromosome9
Coordinate111,100,478 bp (GRCm39)
Base Change A ⇒ G (forward strand)
Gene Mlh1
Gene Name mutL homolog 1
Synonym(s) 1110035C23Rik, colon cancer, nonpolyposis type 2
Chromosomal Location 111,057,296-111,100,854 bp (-) (GRCm39)
MGI Phenotype FUNCTION: [Summary is not available for the mouse gene. This summary is for the human ortholog.] This gene was identified as a locus frequently mutated in hereditary nonpolyposis colon cancer (HNPCC). It is a human homolog of the E. coli DNA mismatch repair gene mutL, consistent with the characteristic alterations in microsatellite sequences (RER+phenotype) found in HNPCC. Alternative splicing results in multiple transcript variants encoding distinct isoforms. Additional transcript variants have been described, but their full-length natures have not been determined.[provided by RefSeq, Nov 2009]
PHENOTYPE: Homozygotes for targeted null mutations exhibit reduced pairing in meiotic prophase I and produce no mature germ cells. Mutants also display increased microsatellite instability and a predisposition for developing intestinal and other tumors. [provided by MGI curators]
Accession Number

NCBI RefSeq: NM_026810, NM_001324522; MGI:101938

MappedYes 
Amino Acid Change Stop codon changed to Arginine
Institutional SourceBeutler Lab
Gene Model predicted gene model for protein(s): [ENSMUSP00000035079] [ENSMUSP00000143786]
AlphaFold Q9JK91
SMART Domains Protein: ENSMUSP00000035079
Gene: ENSMUSG00000032498
AA Change: C39R

DomainStartEndE-ValueType
HATPase_c 23 158 4.57e-1 SMART
DNA_mis_repair 216 335 1.08e-44 SMART
low complexity region 363 375 N/A INTRINSIC
low complexity region 429 454 N/A INTRINSIC
Pfam:Mlh1_C 504 760 8.3e-100 PFAM
Predicted Effect possibly damaging

PolyPhen 2 Score 0.935 (Sensitivity: 0.80; Specificity: 0.94)
(Using ENSMUST00000035079)
Predicted Effect probably null
Meta Mutation Damage Score 0.9755 question?
Is this an essential gene? Non Essential (E-score: 0.000) question?
Phenotypic Category Autosomal Recessive
Candidate Explorer Status loading ...
Single pedigree
Linkage Analysis Data
Penetrance  
Alleles Listed at MGI

All Mutations and Alleles(14) : Endonuclease-mediated(1) Gene trapped(6) Targeted(7)

Lab Alleles
AlleleSourceChrCoordTypePredicted EffectPPH Score
IGL01306:Mlh1 APN 9 111081980 missense possibly damaging 0.84
IGL02530:Mlh1 APN 9 111058943 missense probably benign 0.09
IGL02811:Mlh1 APN 9 111100582 missense probably benign 0.04
IGL02892:Mlh1 APN 9 111082037 missense probably benign 0.00
IGL03394:Mlh1 APN 9 111097311 missense probably damaging 1.00
andalusia2 UTSW 9 111100591 start codon destroyed probably null 0.93
andalusia3 UTSW 9 111058906 critical splice donor site probably null
ANU23:Mlh1 UTSW 9 111081980 missense possibly damaging 0.84
PIT4495001:Mlh1 UTSW 9 111076328 missense probably benign 0.00
R0496:Mlh1 UTSW 9 111070624 missense probably benign
R0723:Mlh1 UTSW 9 111100540 missense probably damaging 1.00
R1395:Mlh1 UTSW 9 111076445 missense probably damaging 1.00
R1694:Mlh1 UTSW 9 111057543 missense probably damaging 1.00
R1762:Mlh1 UTSW 9 111058997 missense probably damaging 1.00
R1865:Mlh1 UTSW 9 111086092 intron probably benign
R1885:Mlh1 UTSW 9 111087624 missense probably benign 0.18
R1992:Mlh1 UTSW 9 111057631 missense probably damaging 0.96
R2186:Mlh1 UTSW 9 111087634 unclassified probably benign
R2680:Mlh1 UTSW 9 111065085 critical splice acceptor site probably null
R4693:Mlh1 UTSW 9 111084726 missense probably damaging 1.00
R4784:Mlh1 UTSW 9 111068866 missense probably benign
R5007:Mlh1 UTSW 9 111100478 makesense probably null
R5130:Mlh1 UTSW 9 111058906 critical splice donor site probably null
R5166:Mlh1 UTSW 9 111070581 missense probably benign 0.04
R5265:Mlh1 UTSW 9 111100591 start codon destroyed probably null 0.93
R5481:Mlh1 UTSW 9 111058905 splice site probably null
R5483:Mlh1 UTSW 9 111060126 missense possibly damaging 0.82
R5602:Mlh1 UTSW 9 111081946 missense probably damaging 0.97
R5658:Mlh1 UTSW 9 111076448 missense probably damaging 0.99
R5890:Mlh1 UTSW 9 111057563 missense possibly damaging 0.88
R6810:Mlh1 UTSW 9 111070626 missense possibly damaging 0.52
R7607:Mlh1 UTSW 9 111058958 missense probably damaging 1.00
R7753:Mlh1 UTSW 9 111081931 critical splice donor site probably null
R7894:Mlh1 UTSW 9 111059145 splice site probably null
R7912:Mlh1 UTSW 9 111090581 missense possibly damaging 0.69
R7995:Mlh1 UTSW 9 111064989 missense probably damaging 1.00
R8097:Mlh1 UTSW 9 111085160 critical splice donor site probably null
R8280:Mlh1 UTSW 9 111078286 critical splice donor site probably null
R8804:Mlh1 UTSW 9 111093972 missense probably damaging 1.00
R9562:Mlh1 UTSW 9 111060013 missense
Mode of Inheritance Autosomal Recessive
Local Stock
Repository
Last Updated 2019-09-04 9:40 PM by Anne Murray
Record Created 2017-05-22 8:55 AM by Bruce Beutler
Record Posted 2018-08-02
Phenotypic Description

Figure 1. Homozygous andalusia mice exhibit diminished T-dependent IgG responses to recombinant Semliki Forest virus (rSFV)-encoded β-galactosidase (rSFV-β-gal). IgG levels were determined by ELISA. Normalized data are shown. Abbreviations: WT, wild-type; REF, homozygous reference mice; HET, heterozygous variant mice; VAR, homozygous variant mice. Mean (μ) and standard deviation (σ) are indicated.

The andalusia phenotype was identified among N-ethyl-N-nitrosourea (ENU)-mutagenized G3 mice of the pedigree R5007, some of which showed diminished T-dependent antibody responses to recombinant Semliki Forest virus (rSFV)-encoded β-galactosidase (rSFV-β-gal) (Figure 1). 

Nature of Mutation

Figure 2. Linkage mapping of the diminished T-dependent IgG responses to rSFV-β-gal using a recessive model of inheritance. Manhattan plot shows -log10 P values (Y-axis) plotted against the chromosome positions of 83 mutations (X-axis) identified in the G1 male of pedigree R5007. Normalized phenotype data are shown for single locus linkage analysis without consideration of G2 dam identity. Horizontal pink and red lines represent thresholds of P = 0.05, and the threshold for P = 0.05 after applying Bonferroni correction, respectively.

Whole exome HiSeq sequencing of the G1 grandsire identified 83 mutations. The diminished T-dependent antibody response to rSFV-β-gal phenotype was linked by continuous variable mapping to mutations in two genes on chromosome 9: Pcbp4 and Mlh1. The mutation in Mlh1 was presumed causative as a second allele was discovered in a separate pedigree (R5265; see the record for andalusia2) that phenocopied andalusia. The Mlh1 mutation is a T to C transition at base pair 111,271,410 (v38) on chromosome 9, or base pair 377 in the GenBank genomic region NC_000075 encoding Mlh1. Linkage was found with a recessive model of inheritance, wherein five variant homozygotes departed phenotypically from 14 homozygous reference mice and 24 heterozygous mice with a P value of 1.335 x 10-6 (Figure 2).  A substantial semidominant effect was also observed (P = 6.901 x 10-5). 

The mutation corresponds to residue 199 in the mRNA sequence NM_026810 within exon 1 of 19 total exons.

 

184 GAGATGATAGAAAACTGTTTAGATGCAAAATCT

34  -E--M--I--E--N--C--L--D--A--K--S-

The mutated nucleotide is indicated in red. The mutation results in a cysteine to arginine substitution at position 39 (C39R) in the MLH1 protein, and is strongly predicted by PolyPhen-2 to be damaging (score = 0.935).

Illustration of Mutations in
Gene & Protein
Protein Prediction
Figure 3. Domain organization of MLH1. The andalusia mutation results in a cysteine to arginine substitution at position 39. Mutations found in the MLH1 protein are shown in red. Click on each mutation for more information.
Figure 4. Crystal structure of human MLH1. β-strands are shown in cyan, and α-helices are shown in pink. UCSF Chimera model is based on PDB 4P7A. Click on the 3D structure to view it rotate.

MLH1 (mutL homolog 1 [E. coli]) is a member of the GHKL (gyrase, Hsp90, histidine kinase, MutL) ATPase/kinase superfamily of proteins (1). MLH1 has an ATPase domain, MutS homolog (MSH2, MSH3, MSH6) interaction domain, EXO1 interaction domain, PMS2/MLH3/PMS1 interaction domain, and a CTH (C-terminal homology) motif (Figure 3). The significance of the interactions between MLH1 and its interacting partners will be described in more detail in the Background section, below. The CTH is required for the spellchecker function of MLH1 (2).

The X-ray crystal structure of a the human MLH1 N-terminus (amino acids 1 to 340) has been solved [Figure 4; PDB:4P7A; (3)]. The structure included the ATPase domain and a ‘transducer’ domain linked by two helices. The ATPase domain forms an ATPase Bergerat fold, which is composed of a four-stranded, antiparallel β-sheet (β1–β3 and β5) and three α-helices (αB–αD) (4). Between helices αC and αD (residues 74–85 and 98–101) is an ATP-binding loop (3). The ATP-binding loop contains a conserved GFRGE(A/G)L motif found in mismatch repair proteins (5). The transducer domain is a small α/β barrel with a ribosomal protein S5 domain 2-like fold and a left-handed α-helical crossover (αI) between β10 and β11. 

The andalusia mutation results in a cysteine to arginine substitution at position 39 (C39R) in the MLH1 protein; Cys39 is within the ATPase domain.

Expression/Localization

Mlh1 is ubiquitously expressed.

Background
Figure 5. Model for mammalian DNA mismatch repair (MMR). The MutSα complex recognizes single nucleotide mismatches and 1-bp insertion/deletion loops. MutSα binds to DNA as a sliding clamp. MutLα then binds to hMutSα to guide an exonuclease to remove several bases from the newly synthesized DNA strand, with subsequent re-synthesis of DNA with the correct base pairing.When the MutS complexes bind DNA, they exchange ADP for ATP. For the MMR proteins to be released from DNA, ATP is hydrolyzed to ADP. Figure and legend are adapted from Jang and Chung, 2010.
Figure 6. MLH1 functions in meiotic recombination. The two interacting DNA duplexes (chromatids) are shown as red and blue double strands. A double-strand break formed in the red duplex is resected by a 5′ to 3′ exonuclease to generate long 3′ single-strand overhangs. One single strand invades the homologous duplex, forming a heteroduplex and displacing a D loop, which is extended by DNA synthesis. The resulting D loop can either follow the synthesis-dependent strand annealing (SDSA) or double-strand break repair (DSBR) pathway. In the SDSA pathway, the D loop intermediate is disassembled by displacement of the newly synthesized DNA strand, which then anneals with the second single-strand overhang. Repair of the break is completed by DNA synthesis and ligation. The SDSA pathway generates only non-crossover products. In the DSBR pathway, the D loop anneals to the second 3′ end overhang, serving as a template for additional DNA synthesis. Ligation of the newly synthesized strands to the recessed ends leads to the formation of a Double Holliday junction with heteroduplex DNA flanking the double-strand break site. MutLγ (MLH1-MLH3) localizes to sites of crossing over in the meiotic chromosomes. This recombination intermediate can be resolved by cutting the outside strands or the inside strands of each junction. The resolution always involves opposite sense cutting (two outside strands and two inside strands); only one of the two alternatives is shown here. This modified version of the DSBR pathway would generate only crossover products. Legend was modified from Dooner, H.K. (2002).

The DNA mismatch repair (MMR) pathway removes base mismatches and insertion/deletion mispairs that occur during DNA replication and recombination (Figure 5). During MMR, a MutS heterodimer [MSH2-MSH6 (see the record for medea) (MutSα) or MSH2–MSH3 (MutSβ)] binds to DNA mismatches (6). MutSα preferentially recognizes single base (G/T) mismatches and one- or two-nucleotide insertion/deletion mispairs (7), while MutSβ preferentially recognizes insertion/deletion mispairs that contain two or more extra bases. Upon binding, the MutS undergoes an ADP to ATP exchange and a conformational change, followed by recruitment of MutLα (a MLH1-PMS2 heterodimer), MutLβ (a MLH1-PMS1 heterodimer), or MutLγ (a MLH1-MLH3 heterodimer). MutLα primarily functions in MMR, the function of MutLβ is unknown, and MutLγ primarily functions in meiotic recombination (8;9). The MutL complexes cleave the defective strand near the mismatch site. The MutS-MutL complex then recruits an exonuclease, subsequently leading to strand-specific excision; PCNA coordinates with the exonuclease to excise the mismatch-containing region. The removed DNA fragment is resynthesized by DNA polymerase δ and the repair process is completed by DNA ligase.

MLH1 (in complex with MLH3 [i.e., MutLγ]) functions in meiotic recombination (Figure 6). Meiotic recombination is initiated by a DNA double-strand break. After the DNA break, there is pairing of a homologous chromosome and strand invasion to initiate repair. Gap repair can proceed by crossover or non-crossover events. Crossover recombination putatively occurs by the Double Holliday Junction model, and non-crossover recombination putatively occurs by the Synthesis-dependent Strand Annealing model. MutLγ localizes to sites of crossing over in the meiotic chromosomes (10), and is required for oocytes to progress through metaphase II of meiosis (11).

Mutations in MLH1 are linked to hereditary nonpolyposis colorectal cancer type 2 (HNPCC2; OMIM: # 609310; (12)), mismatch repair cancer syndrome (OMIM: #276300; (13)), and Muir-Torre syndrome (OMIM: #158320; (14)). Muir-Torre syndrome is an autosomal dominant disorder characterized by development of sebaceous gland tumors and skin cancers, including keratoacanthomas and basal cell carcinomas. Patients can have several internal malignancies, including colorectal, endometrial, urologic, and upper gastrointestinal neoplasms. 

Male and female Mlh1-deficient (Mlh1-/-) mice exhibited infertility (10;15) and premature death. The Mlh1-/- mice showed reduced levels of chiasmata (10;16). The chromosomes in Mlh1-/- sperm separate prematurely during spermatogenesis. In addition, the first division of meiosis is frequently arrested (10). Mlh1-/- mice exhibit increased incidences of intestinal adenocarcinomas and adenomas, uterus tumor incidence, skin tumor incidence, and lymphoma incidence (15;17-20).

Primers PCR Primer
andalusia_pcr_F: TTCGGCCGTGTGCATAATG
andalusia_pcr_R: TAAAAGCGCTTGACTGGCATTC

Sequencing Primer
andalusia_seq_F: GCCGTGTGCATAATGGGAAACC
andalusia_seq_R: CATTCATGCTGCCCAATCAG
Genotyping

PCR program

1) 94°C 2:00
2) 94°C 0:30
3) 55°C 0:30
4) 72°C 1:00
5) repeat steps (2-4) 40x
6) 72°C 10:00
7) 4°C hold


The following sequence of 426 nucleotides is amplified (chromosome 9, - strand):


1   taaaagcgct tgactggcat tcatgctgcc caatcagcac ttgccgctgg gaagacggcg
61  caggactgca gtcggccgaa gctgaaggaa gaacttgagc gtgaggagct cgagtgattg
121 gctgactggg aactcgggcg ccaatatggc gtttgtagca ggagttattc ggcgtctgga
181 cgagacggta gtgaaccgca tagcggcggg ggaagtcatt cagcggccgg ccaatgctat
241 caaagagatg atagaaaact ggtacggagg gagcggagcc gagttccccg actgagagcc
301 ggggcgggcc gacccacctg ctgcagtcgg ccaccgtgcg gggcaccgag cacggggacc
361 gtgggggcat cgggcacagg gaccctgcgc gtgccaggct ggtttcccat tatgcacacg
421 gccgaa


Primer binding sites are underlined and the sequencing primers are highlighted; the mutated nucleotide is shown in red.

References
Science Writers Anne Murray
Illustrators Diantha La Vine
AuthorsJin Huk Choi and Bruce Beutler