May be a little OT, but of general interest.
The Facebook AI Research (FAIR) Speech team is sharing the first fully convolutional speech recognition system. It uses convolutional neural networks (CNNs) for acoustic modeling and language modeling, and is reproducible. The team says that wav2letter++ is composed only of convolutional layers, which yields performance that’s competitive with recurrent architectures.