Further reading
- Alex Wang et.al, 2019, GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding: https://arxiv.org/pdf/1804.07461.pdf
- Alex Wang et al., 20192, SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems: https://w4ngatang.github.io/static/papers/superglue.pdf
- Tom B. Brown et al., 2020, Language Models are Few-Shot Learners: https://arxiv.org/abs/2005.14165
- Chi Wang et al., 2023, Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference: https://arxiv.org/abs/2303.04673
- Vaswani et al., 2017, Attention Is All You Need: https://arxiv.org/abs/1706.03762
Join our community on Discord
Join our community’s Discord space for discussions with the authors and other readers: