http://www.gdb.org/Dan/proteins/pir.html
PIR is a searchable protein sequence database. The data is organized into entries that contain all the information associated with a particular sequence, including the title, the biological source, references, associated text, and the sequence itself. Some entries contain information from several closely related sequences but only one sequence is explicitly represented in each entry.