Topics
A list of topics for each seminar is provided below:
- February 10: Intro
- Short seminar, no paper
- February 17: Learning Transferable Visual Models From Natural Language Supervision (CLIP)
- https://arxiv.org/abs/2103.00020
- Submit your overview here!
- February 24: Independence Day
- No seminar
- March 3: ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
- https://arxiv.org/abs/2102.03334
- Submit your overview here!
- March 10: Flamingo: a Visual Language Model for Few-Shot Learning
- https://openreview.net/forum?id=EbMuimAbPbs
- Submit your overview here!
- March 17: A Generalist Agent
- https://openreview.net/forum?id=1ikK0kHjvj
- Submit your overview here!
- March 24: Perceiver IO: A General Architecture for Structured Inputs & Outputs
- https://arxiv.org/abs/2107.14795
- Submit your overview here!
- March 31: data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
- https://arxiv.org/abs/2202.03555
- Submit your overview here!
- April 7: Good Friday
- No seminar
- April 14: PaLM-E: An Embodied Multimodal Language Model
- https://palm-e.github.io/
- Submit your overview here!
- April 21: Sparks of Artificial General Intelligence: Early experiments with GPT-4
- https://arxiv.org/abs/2303.12712
- Submit your overview here!
- April 28: Alpaca and co
- Multiple papers and blogs, links in the subtopics sheet.
- Submit your overview here!
- May 5: SpeechT5
- https://arxiv.org/abs/2110.07205
- Submit your overview here!
- May 12: VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
- https://arxiv.org/abs/2104.11178
- Submit your overview here!
- May 19: ImageBind: One Embedding Space To Bind Them All
- https://arxiv.org/abs/2305.05665
- Submit your overview here!
- May 26: ChatGPT and other modalities (Visual ChatGPT, HuggingGPT, etc.)
- Multiple papers and demos, links in the subtopics sheet.
- Submit your overview here!