1 Introduction
Tnteractions between proteins and DNAs play a crucial role in a wide variety of biological processes, such as, DNA replication, recombination, repair, gene transcription and expression [1], [2], [3]. Hence, the accurate prediction of protein-DNA binding residues contributes to elaborate the interaction mechanism of them, and facilitate our understanding of these biological processes. Traditionally, protein-DNA binding residues can be identified by experimental techniques, such as electrophoretic mobility shift assays (EMSAs) [4], [5], Fast ChIP [6], and X-ray crystallography [7]. However, these techniques are time-consuming and laborious. With the rapid advance of protein sequencing technology, a large amount of unannotated protein-DNA complexes is sequenced and deposited. Therefore, there is an urgent need to develop computational methods that can rapidly and reliably identify DNA-binding residues from protein sequences.