Abstract:
Protein design involves searching over a large combinatorial sequence space. Evaluating the fitness of new protein sequences often requires wet-lab experiments that are c...Show MoreMetadata
Abstract:
Protein design involves searching over a large combinatorial sequence space. Evaluating the fitness of new protein sequences often requires wet-lab experiments that are costly and time consuming. In this paper we propose a neural bandits algorithm that utilizes a modified upper-confidence bound algorithm for accelerating the search for optimal designs. The algorithm makes adaptive queries as guided by the kernelized neural bandits. The algorithm is tested on two public protein fitness datasets, the GB1 and WW domain. For both datasets, our algorithm consistently identifies top-fitness protein sequences. Notably, this approach finds a diverse and rich class of high fitness proteins using substantially fewer design queries compared to a range of alternative methods.
Date of Conference: 09-11 March 2022
Date Added to IEEE Xplore: 14 April 2022
ISBN Information: