๐ Live Open Source Explorer
Explore live open-source projects and AI models.
Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.
๐ Live Search
Search live open-source data
Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.
Live Results
GitHub Open Source Repositories
Search: language-data
Page 28
Showing 10 results from 1,706
Azure/ADX-in-a-Day
GitHub MIT LicenseHands on experience on Azure Data Explorer and Kusto Query Languages(KQL)
External source
GitHub
chewxy/lingo
GitHub Go MIT Licensepackage lingo provides the data structures and algorithms required for natural language processing
External source
GitHub
jiangsongtao/Med-MoE
GitHub Python Apache License 2.0[EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"
External source
GitHub
bajajvinamr/HacktoberFest2020
GitHub HTMLMake your first PR! ~ A beginner-friendly repository. Add your profile, a blog, or any program under any language (it can be anything from a hello-world program to a complex data structure algorithm) or update the existing one. Just make sure to add the file under the correct directory. Happy hac... Read more
External source
GitHub
YicongHong/Discrete-Continuous-VLN
GitHub Python MIT LicenseCode and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
External source
GitHub
csebuetnlp/banglanmt
GitHub PythonThis repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)... Read more
External source
GitHub
yueyu1030/AttrPrompt
GitHub Python Apache License 2.0[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
External source
GitHub
Ensembl/ensembl-rest
GitHub Perl Apache License 2.0Language agnostic RESTful data access to Ensembl data over HTTP
External source
GitHub
uiuc-sst/g2ps
GitHub HTML MIT LicenseData and code for grapheme-to-phoneme transducers in lots of languages
External source
GitHub
FreedomIntelligence/crosstalk-generation
GitHub Python Apache License 2.0Code and data for crosstalk text generation tasks, exploring whether large models and pre-trained language models can understand humor.
External source
GitHub
10 results on this page ยท 1,706 total found
Showing first 1,000 accessible GitHub results.