Biapy's Bookmarks
Links
Lists
Tags
Login
Tag
Feed
dataset
Open Links in Tabs
Order by
Oldest
Newest
URL A-Z
URL Z-A
Title A-Z
Title Z-A
Random
CredData (Credential Dataset)
https://github.com/Samsung/CredData
apache2-licensed
dataset
foss
open-data
open-source
secret
security
Added
1 month ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
Parquet
https://parquet.apache.org/
apache2-licensed
columnar
data-science
dataset
format
foss
open-source
parquet
Added
2 months ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
ecosyste.ms
https://ecosyste.ms/
commercial
dataset
open-source
web-service
Added
5 months ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
Virtual Cell Atlas
https://arcinstitute.org/tools/virtualcellatlas
data-science
dataset
open-data
Added
1 year ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
FineWeb
https://huggingface.co/datasets/HuggingFaceFW/fineweb
ai
dataset
llm
machine-learning
open-source
Added
1 year ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
Address Database
https://netsyms.com/gis/addresses
commercial
database
dataset
geolocation
sqlite
Added
1 year ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
Common Corpus
https://huggingface.co/datasets/PleIAs/common_corpus
dataset
foss
llm
mozilla
open-source
Added
1 year ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
LAION-5B
https://laion.ai/blog/laion-5b/
data-science
dataset
machine-learning
stable-diffusion
Added
2 years ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
RedPajama-Data
https://github.com/togethercomputer/RedPajama-Data
dataset
llama
llm
machine-learning
open-source
Added
2 years ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
Kaggle
https://www.kaggle.com/
ai
data-science
dataset
jupyter
machine-learning
open-data
Added
4 years ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn
Keshif
https://keshif.me/
commercial
data-analytics
dataset
open-source
web-app
Added
10 years ago
Share this Link
Share link via Email
Share link via Facebook
Share link via Twitter
Share link via Reddit
Share link via Pinterest
Share link via Whatsapp
Share link via Telegram
Share link via WeChat
Share link via SMS
Share link via Skype
Share link via sharing.service.bluesky
Share link via Hacker News
Share link via Mastodon
Share link via Flipboard
Share link via Evernote
Share link via Trello
Share link via Buffer
Share link via Tumblr
Share link via Xing
Share link via LinkedIn