Publications
- Our paper “Anonymity at Risk? Assessing Re-Identification Capabilities of Large Language Models” was discussed in
“Echo der Zeit” by the Swiss National Radio and Television (SRF) on
June 23rd, 2024.
Publication List
For a complete, up-to-date list, please find me
on Google Scholar.
- Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning,
“FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning” - Pre-Print
Article
Code
🤗 Dataset
- Dor Bernsohn, Gil Semo, Yaron Vazana, Gila Hayat, Ben Hagag, Joel Niklaus, Rohit Saha, Kyryl Truskovskyi,
“LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text” -
Conference of the European Chapter of the Association for Computational Linguistics (EACL) 2024 oral
Article
Code
🤗 Dataset
- Joel Niklaus, Magda Chodup, Thomas Lüthi, Daniel Kettiger,
“Re-Identifizierung in Gerichtsurteilen mit Simap Daten” - Pre-Print
Article
Code
- Santosh T.Y.S.S, Nina Baumgartner, Matthias Stürmer, Matthias Grabmair, Joel Niklaus,
“Towards Explainability and Fairness in Swiss Judgement Prediction: Benchmarking on a Multilingual Dataset” -
Natural Legal Language Processing Workshop (NLLP) @ International Conference on Empirical Methods in Natural Language Processing (EMNLP)
2023, Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING) 2024 Poster
Article
🤗 Dataset
Slides
- Joel Niklaus, Robin Mamié, Matthias Stürmer, Daniel Brunner, Marcel Gygli,
“Automatic Anonymization of Swiss Federal Supreme Court Rulings” -
Natural Legal Language Processing Workshop (NLLP) @ International Conference on Empirical Methods in Natural Language Processing (EMNLP) 2023
Article
Slides
🤗 Models
- Ramona Christen, Anastassia Shaitarova, Matthias Stürmer, Joel Niklaus, “Resolving Legalese: A Multilingual
Exploration of Negation Scope Resolution in Legal Documents” - Natural Legal Language Processing Workshop (NLLP) @
International Conference on Empirical Methods in Natural Language Processing (EMNLP) 2023, Joint International
Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING) 2024 oral
Article
Code
🤗 Dataset
🤗 Models
Slides
Video
- Neel Guha, Julian Nyarko, Daniel E Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin
Peters, Brandon Waldon, Daniel N Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan,
Galit Sarfaty, Gregory M Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay,
Jonathan H Choi, Kevin Tobia, Margaret Hagan, Megan Ma, Michael Livermore, Nikon Rasumov-Rahe, Nils Holzenberger, Noam
Kolt, Peter Henderson, Sean Rehaag, Sharad Goel, Shang Gao, Spencer Williams, Sunny Gandhi, Tom Zur, Varun Iyer, Zehua
Li, “LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models” - NeurIPS
Datasets and Benchmarks 2023
Article
Code
🤗 Dataset
Poster
- Alex Nyffenegger, Matthias Stürmer, Joel Niklaus, “Anonymity at Risk? Assessing Re-Identification Capabilities of
Large Language Models” - Annual Conference of the North American Chapter of the Association for Computational
Linguistics (NAACL) Findings 2024
Article
Code
🤗 Dataset
- Vishvaksenan Rasiah, Ronja Stern, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, Joel Niklaus, “
SCALE: Scaling up the Complexity for Advanced Language Model Evaluation” – Natural Legal Language Processing
Workshop (NLLP) @ International Conference on Empirical Methods in Natural Language Processing (EMNLP)
2023, Data-centric Machine Learning Workshop (DMLR) @ International Conference on Learning Representations
(ICLR) 2023
Article
🤗 Dataset
🤗 Models
Poster
Slides
- Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, “MultiLegalPile: A 689GB
Multilingual Legal Corpus” – Data-centric Machine Learning Research Workshop (DMLR) @ International Conference on
Machine Learning (ICML) 2023, Natural Legal Language Processing Workshop (NLLP) @ International Conference on
Empirical Methods in Natural Language Processing (EMNLP) 2023,
Annual Meeting of the Association for Computational Linguistics (ACL) 2024 oral and
Outstanding Paper Award (top 2% of accepted papers)
Article
Dataset Code
Pretraining Code
🤗 Dataset
🤗 Models
Slides
Poster
Video
- Joel Niklaus, Veton Matoshi, Pooja Rani, Andrea Galassi, Matthias Stürmer, Ilias Chalkidis,
“LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain” –
Natural Legal Language Processing Workshop (NLLP) @ International Conference on Empirical Methods in Natural Language Processing (EMNLP) 2023 & EMNLP Findings 2023
Article
Code
🤗 Dataset
Slides
Video
- Tobias Brugger, Matthias Stürmer, Joel Niklaus,
“MultiLegalSBD: A Multilingual Legal Sentence Boundary Detection Dataset” –
International Conference on AI and Law (ICAIL) 2023
Article
Code
🤗 Dataset
🤗 Models
Slides
- Joel Niklaus, Daniele Giofré, “BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From
Scratch?” – Efficient Natural Language and Speech Processing Workshop (ENLSP) @ Neural Information Processing
Systems (NeurIPS) 2022 & Workshop on Simple and Efficient Natural Language Processing (SustaiNLP) @ Annual
Meeting of the Association for Computational Linguistics (ACL) 2023
Article
Poster
🤗 Models
Video
- Gil Semo, Dor Bernsohn, Ben Hagag, Gila Hayat, Joel Niklaus, “ClassActionPrediction: A Challenging Benchmark for
Legal Judgment Prediction of Class Action Cases in the US” – Natural Legal Language Processing Workshop (NLLP) @
International Conference on Empirical Methods in Natural Language Processing (EMNLP) 2022
Article
Code
🤗 Dataset
- Joel Niklaus, Matthias Stürmer, Ilias Chalkidis,
“An Empirical Study on Cross-X Transfer for Legal Judgment Prediction” –
Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (AACL) and
International Joint Conference on Natural Language Processing (IJCNLP) 2022
Article
Code
Slides
Video
- Joel Niklaus, Ilias Chalkidis, Matthias Stürmer,
“Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction Benchmark” –
Natural Legal Language Processing Workshop (NLLP) @ International Conference on Empirical Methods in Natural Language Processing (EMNLP) 2021
Article
Code
🤗 Dataset
Slides
Video
- Christine Krebs, Michael Falkner, Joel Niklaus, Luca Persello, Stefan Klöppel, Tobias Nef, Prabitha Urwyler,
“Application of Eye Tracking in Puzzle Games for Adjunct Cognitive Markers: Pilot Observational Study in Older Adults”
– JMIR serious games 9
Article
- Joel Niklaus, Michele Alberti, Rolf Ingold, Markus Stolze, Thomas Koller,
“Challenging Human Supremacy: Evaluating Monte Carlo Tree Search and Deep Learning for the Trick Taking Card Game Jass” –
Reinforcement Learning in Games (RLG) @ Association for the Advancement of Artificial Intelligence (AAAI) 2020 &
International Conference on Artificial Intelligence and Soft Computing (ICAISC) 2020
Article
Code
Poster
- Joel Niklaus, Michele Alberti, Vinay Pondenkandath, Rolf Ingold, Marcus Liwicki,
“Survey of Artificial Intelligence for Card Games and Its Application to the Swiss Game Jass” –
Swiss Conference on Data Science (SDS) 2019
Article
- Zhongliang Zhao, Jose Carrera, Joel Niklaus, Torsten Braun,
“Machine Learning-Based Real-Time Indoor Landmark Localization” –
International Conference on Wired/Wireless Internet Communication (WWIC) 2018
Article
Memberships
Reviewing
- ARR August 2024
- NeurIPS 2024
- ARR June 2024
- ARR April 2024
- ARR February 2024
- ARR December 2023
- NLLP Workshop 2023
- ARR August 2023
- MLLD Workshop 2023
- ARR April 2022
- ARR February 2022