Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
malteos's picture
75 11 54

malteos

malteos
katebor's profile picture tomaarsen's profile picture RASMUS's profile picture
·
https://ostendorff.org
  • malteos

AI & ML interests

None yet

Organizations

Open Legal Data's profile picture Spaces-explorers's profile picture Deutsche Telekom AG's profile picture OSCAR's profile picture Scilons Project's profile picture Speech and Language Technology, DFKI's profile picture Just some testing..'s profile picture German Research Center for Artificial Intelligence (DFKI)'s profile picture Common Crawl Foundation's profile picture Calçots's profile picture Occiglot's profile picture Social Post Explorers's profile picture primeLine Research Community's profile picture Stefmal's profile picture

authored 10 papers 6 months ago

Tokenizer Choice For LLM Training: Negligible or Crucial?

Paper • 2310.08754 • Published Oct 12, 2023 • 2

Towards an Open Platform for Legal Information

Paper • 2005.13342 • Published May 27, 2020

Aspect-based Document Similarity for Research Papers

Paper • 2010.06395 • Published Oct 13, 2020

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

Paper • 2202.06671 • Published Feb 14, 2022 • 2

Specialized Document Embeddings for Aspect-based Similarity of Research Papers

Paper • 2203.14541 • Published Mar 28, 2022

Investigating Gender Bias in Turkish Language Models

Paper • 2404.11726 • Published Apr 17, 2024 • 1

Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning

Paper • 2301.09626 • Published Jan 23, 2023 • 2

Progress Report: Towards European LLMs

Paper • 2410.03730 • Published Sep 30, 2024 • 3

Data Processing for the OpenGPT-X Model Family

Paper • 2410.08800 • Published Oct 11, 2024 • 1

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 38
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略