Research Interests |
Protein-nucleic acid interactions play many important cellular roles, including regulating DNA replication, controlling gene expression and DNA house-keeping, such as maintaining a super-coiled state, performing damage and mismatch repair, allowing recombination, restricting foreign DNA, and the processes of DNA ligating, methylating and degrading. Our research goals are to provide better understanding for sequence-dependent and -independent recognition between proteins and DNA/RNA at atomic level. We would like to find out why some proteins recognize only specific sequences, but some recognize DNA without sequence preference. We work on several endonucleases which cleave DNA either site-specifically or randomly. Using X-ray diffraction methods, we elucidate the three-dimensional structures of these proteins and protein/DNA complexes to show the underlying structural basis for the selectivity principle of these proteins. Biochemistry and mutational approaches are also used to characterize the enzymatic function of these endonucleases. Two main projects are briefly described below ¡@ 1. The H-N-H family of endonucleases The H-N-H motif was first recognized by the sequence similarity in several intron-encoded homing endonucleases and bacteria toxins. The numbers of proteins discovered containing the H-N-H motif has increased substantially in the past several years; more than one hundred proteins are listed in databases (information found at the Pfam Protein Families database). The biggest subgroups in the H-N-H family are group I and group II homing endonucleases, which initialize the process of transferring a mobile intervening sequence into a homologous allele that lacks the sequence. These homing endonucleases recognize and cleave DNA site-specifically at target alleles. The second biggest subgroup of proteins in the H-N-H family are bacteria toxins, such as Escherichia coli colicins (ColE7 and ColE9) and Pseudomonas aeruginosa pyocins (S1 and S2). All of these toxins share a highly homologous C-terminal nuclease domain capable of hydrolyzing DNA nonspecifically in target cells. ¡@ |
HNH consensus E HH P GG NL H H Bacteriocins Colicin E7 SGKRTSFELHHEKPISQNGGVYDMDNISVVTPKRHIDIHRGK-576 Colicin E2 VGGRERFELHHDKPISQDGGVYDMNNIRVTTPKRHIDIHRGK-581 Colicin E8 VGGRRSFELHHDKPISQDGGVYDMDNLRITTPKRHIDIHRGQ Colicin E9 VGGRVKYELHHDKPISQGGEVYDMDNIRVTTPKRHIDIHRGK-582 Pyocin S1 AGGRIKIEIHHKVRVADGGGVYNMGNLVAVTPKRHIEIHKGG-617 Pyocin S2 AGGRIKIEIHHKVRIADGGGVYNMGNLVAVTPKRHIEIHKGG-770 Group I homing I-HmuI EGYEEGLVVDHKD...GNKDNNLSTNLRWVTQKINVENQMSR-77 endonuclease I-HmuII GGYEESLVVDHID...RNRHNNHFSNLRWVSRKENSSNISAD-104 I-HmuIII EGYGEDLVVDHID...QDRDNNHCSNLRWVSRKENSNNISAD-103 yosQ YDIPKGMFVNHID...GNKLNNHVRNLEIVTPKENTLHAMKI-102 I-TevIII DSDGRTDEIHHKD...GNRENNDLDNLMCLSIQEHYDIHLAQ-55 ORF253 VT-DKNKYIDHIN...GNPLDNRRNNLRVVSHQENMMNKKTY-192 Group II homing Avi CPMC9 WNIHHIIKRHMGGGDEL.DNLVLLHPNCHRQLH endonuclease Cpc1 CSHC9 IEIDHIIP.KSQGGKDVYDNLQALHRHCHDVKTATD-568 Cpc2 CSEC9 MEVHHIDQNR..GNNKL.SNLTLVHRHCHDIIH PetD INSIP.YELHHILP.KRFGGKDTPNNMVLLCKSPCHQLVSSSI-574 Restriction or McrA CENC14LEVHHVIP.LSSGGADTTDNCVALCPNCHRELHYSK-259 Repair enzyme S.g. mtMSH ICGAPADAVHHIKP6LCNRKLNRRSNLVPVCSSCHLDIHRNK-953 |
Sequence alignment of several H-N-H family proteins in the H-N-H motif region. The dots (.) represent gaps and numbers represent the insertion of amino acids. The first row shows the consensus sequence of the H-N-H motif with the most conserved Asn and His residues displayed in bald with underline. The H-N-H proteins are classified into four sub-groups: (1) Bacterial toxins, including colicins and pyocins; (2) Group I homing endonucleases, including I-HmuI, I-HmuII, I-HmuIII, yosQ, I-TevIII and ORF253; (3) Group II homing endonucleases, including Avi, Cpc and PetD; (4) Restriction or repair enzymes, including McrA and S. g. mtMSH. |
||||||||||||||||||||
¡@
|