Identifying complex motifs in massive omics data with a variable-convolutional layer in deep neural network
Jing-Yi Li*, Shen Jin*, Xinming Tu, Yang Ding†, and Ge Gao†
Briefings in Bioinformatics, 2021
bbab233
Motif identification is among the most common and essential computational tasks for bioinformatics and genomics. Here we proposed a novel convolutional layer for deep neural network, named variable convolutional (vConv) layer, for effective motif identification in high-throughput omics data by learning kernel length from data adaptively. Empirical evaluations on DNA-protein binding and DNase footprinting cases well demonstrated that vConv-based networks have superior performance to their convolutional counterparts regardless of model complexity. Meanwhile, vConv could be readily integrated into multi-layer neural networks as an ’in-place replacement’ of canonical convolutional layer. All source codes are freely available on GitHub for academic usage.