Publications

Scientific publications of OPI PIB

All: 275
Per Page:

A Review of the Challenges with Massive Web-Mined Corpora Used in Large Language Models Pre-training

Michał Perełkiewicz, Rafał Poświata

2025 W: Artificial Intelligence and Soft Computing : 23rd International Conference, ICAISC 2024, Zakopane, Poland, June 16–20, 2024, Proceedings, Part III / Leszek Rutkowski, Rafał Scherer, Marcin Korytkowski, Witold Pedrycz, Ryszard Tadeusiewicz, Jacek M. Zurada. - Cham : Springer. - s. 153-156

International Conference on Artificial Intelligence and Soft Computing [ICAISC], Zakopane, 16-20.06.2024

https://link.springer.com/chapter/10.1007/978-3-031-81596-6_14

AI-driven glomerular morphology quantification: a novel pipeline for assessing basement membrane thickness and podocyte foot process effacement in kidney diseases

Michifumi Yamashita, Natalia Piaseczna, Akira Takahashi, Daisuke Kiyozawa, Narihito Tatsumoto, Shohei Kaneko, Natalia Zurek, Arkadiusz Gertych

2025 Computer Methods and Programs in Biomedicine

https://www.sciencedirect.com/science/article/pii/S0169260725002597

Assessing generalization capability of text ranking models in Polish

Sławomir Dadas, Małgorzata Grębowiec

2025 W: Artificial Intelligence and Soft Computing : 23rd International Conference, ICAISC 2024, Zakopane, Poland, June 16–20, 2024, Proceedings, Part I / Leszek Rutkowski, Rafał Scherer, Marcin Korytkowski, Witold Pedrycz, Ryszard Tadeusiewicz, Jacek M. Zurada. - Cham : Springer. - s. 37-49

International Conference on Artificial Intelligence and Soft Computing [ICAISC], Zakopane, 16-20.06.2024

https://link.springer.com/book/10.1007/978-3-031-84353-2

Ludzie Nauki: Data aspects of the Polish national CRIS

Emil Podwysocki, Maria Bylina, Jacek Raczko, Michał Ulaniuk, Marek Michajłowicz, Rafał Jendrzejewski

2025 EPiC Series in Computing. - T. 105, s. 94-107

https://easychair.org/publications/paper/bbdP

MMTEB: Massive Multilingual Text Embedding Benchmark

Kenneth Enevoldsen Kenneth Enevoldsen , Isaac Chung, Imene Kerboua, Rafał Poświata et al.

2025

International Conference on Learning Representations [ICLR], Singapore, 24.04.2025

https://openreview.net/forum?id=zl3pfz4VCV