[twitter] snapshot of cutting edge public transformer model research
I am very lucky to have one of the content feed AIs out there give me content that is not random and/or gross. This twitter feed gives me high-quality artificial intelligence research papers, tutorials, etc, new ones almost every day. I hope it keeps up. I see the AK user a lot in this feed. arXiv.org ShapeFormer: Transformer-based Shape Completion via Sparse Representation We present ShapeFormer, a transformer-based network that produces a distribution of object completions, conditioned o... Read more at Twitter https://twitter.com/i/redirect?url=https%3A%2F%2Ftwitter.com%2Fi%2Ftopics%2F... ---------- AK sharedAK shared arXiv.org PONI: Potential Functions for ObjectGoal Navigation with... State-of-the-art approaches to ObjectGoal navigation rely on reinforcement learning and typically require significant... Read more at Twitter https://twitter.com/i/redirect?url=https%3A%2F%2Ftwitter.com%2Fi%2Ftopics%2F... ---------- AK sharedAK shared arXiv.org SPIRAL: Self-supervised Perturbation-Invariant Representation... We introduce a new approach for speech pre-training named SPIRAL which works by learning denoising representation of ... Read more at Twitter https://twitter.com/i/redirect?url=https%3A%2F%2Ftwitter.com%2Fi%2Ftopics%2F... ---------- AK sharedAK shared arXiv.org Improving the fusion of acoustic and text representations in RNN-T The recurrent neural network transducer (RNN-T) has recently become the mainstream end-to-end approach for streaming ... Read more at Twitter https://twitter.com/i/redirect?url=https%3A%2F%2Ftwitter.com%2Fi%2Ftopics%2F... ---------- AK sharedAK shared arXiv.org Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention With recent advancements in voice cloning, the performance of speech synthesis for a target speaker has been rendered... Read more at Twitter https://twitter.com/i/redirect?url=https%3A%2F%2Ftwitter.com%2Fi%2Ftopics%2F... ---------- AK sharedAK shared arXiv.org Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition Audio-visual automatic speech recognition (AV-ASR) extends the speech recognition by introducing the video modality. ... Read more at Twitter https://twitter.com/i/redirect?url=https%3A%2F%2Ftwitter.com%2Fi%2Ftopics%2F...
participants (1)
-
fuzzyTew