Due to the enormous amount of information available on the Internet, extracting and classifying it has become one of the most important tasks. This principle is valid also while searching for scientific publications. This paper describes a system able to retrieve scientific publications from the Web throughout a text categorization process. To this end, a generic multiagent architecture has been customized according to the requirements imposed by the specific task. Experiments have been performed on publications extracted from BMC Bioinformatics and PubMed digital archives.