Titelbild von RapidataRapidata
Rapidata

Rapidata

Softwareentwicklung

Fast and reliable data labeling that powers your AI.

Info

We use the intelligence of the masses to label your dataset. This enables us to rapidly and cost effectively label large datasets.

Website
https://rapidata.ai/
Branche
Softwareentwicklung
Größe
2–10 Beschäftigte
Hauptsitz
Zürich
Art
Privatunternehmen

Orte

Beschäftigte von Rapidata

Updates

  • Rapidata hat dies direkt geteilt

    Profil von Jason Corkill anzeigen

    Founder & CEO @ Rapidata | Instant human intelligence for AI at scale

    No one actually knows if a model is good. OpenAI just released their new 4o image model (wtf is that naming, just call it Dall-E 4 already) The only way to truly judge a genAI model is with shit tons of humans. Luckily I am in the "shit ton of humans" business. In just one day we collected data from 200'000 humans on how this new models stacks up. We present the first ever independent large scale benchmark of the new model! They crushed! Black Forest Labs Flux model has finally been dethroned!

    • Kein Alt-Text für dieses Bild vorhanden
  • Unternehmensseite für Rapidata anzeigen

    1.228 Follower:innen

    🚀 First Benchmark of OpenAI's 4o Image Generation Model! We've just completed the first-ever (to our knowledge) benchmarking of the new OpenAI 4o image generation model, and the results are impressive! In our tests, OpenAI 4o image generation absolutely crushed  leading competitors, including Black Forest Labs, Google, xAI, Ideogram, Recraft, and DeepSeek AI, in prompt alignment and coherence! They hold a gap of more than 20% to the nearest competitor in terms of Bradley-Terry score, the biggest we have seen since the beginning of the benchmark! The benchmarks are based on 200k human responses collected through our API. However, the most challenging part wasn't the benchmarking itself, but generating and downloading the images: - 5 hours to generate 1000 images (no API available yet) - Just 10 minutes to set up and launch the benchmark - Over 200,000 responses rapidly collected While generating the images, we faced some hurdles that meant that we had to leave out certain parts of our prompt set. Particularly we observed that the OpenAI 4o model proactively refused to generate certain images: 🚫 Styles of living artists: completely blocked 🚫 Copyrighted characters (e.g., Darth Vader, Pokémon): initially generated but subsequently blocked Overall, OpenAI 4o stands out significantly in alignment and coherence, especially excelling in certain unusual prompts that have historically caused issues such as: 'A chair on a cat.' See the images for more examples!

    • Kein Alt-Text für dieses Bild vorhanden
    • Kein Alt-Text für dieses Bild vorhanden
    • Kein Alt-Text für dieses Bild vorhanden
    • Kein Alt-Text für dieses Bild vorhanden
  • Unternehmensseite für Rapidata anzeigen

    1.228 Follower:innen

    Perhaps this shows that there is a space for more specialized AI models next to foundation models.

    Profil von Jason Corkill anzeigen

    Founder & CEO @ Rapidata | Instant human intelligence for AI at scale

    Europe can do more than just unremovable bottlecaps! Our AI industry is often belittled, but one of our earliest players, DeepL from Germany, is still sticking it to the bajillion dollar funded tech giants. Their models significantly outperform the top LLMs (even chain-of-thought LLMs) at translating text, one of the most fundamental task for any AI. It shows that their models have a better understanding of the structures of language and the relationship of words in sentences. We use DeepL to translate tasks for our annotators, its a rather costly service and we could use our cloud credits to use LLMs like Deepseek-R1, Llama or Mistral for free. We used our own annotation service, Rapidata, to gather over 51'000 votes from native speakers and realized that the translation quality from DeepL was significantly better. Translation quality is super important for the perceived quality of your product, so we are sticking to the premium option.

    • Kein Alt-Text für dieses Bild vorhanden
  • Unternehmensseite für Rapidata anzeigen

    1.228 Follower:innen

    OpenAI Is Falling? The Unthinkable Shift in AI Dominance It happened in the span of a single hour: over 30,000 human annotations poured in, revealing the truth ... The Runway’s Gen3 Alpha are now outperforming OpenAI’s own Sora in style and coherence But ... what does this really mean? For an AI model to rise to the top, it needs more than brilliant engineering and computational power Most critically is access to high-quality data which is currently a bottleneck in AI, as even OpenAI Co-Founder Ilya Sutskever said ... Even though Harvard University’s Institutional Data Initiative (IDI) has released a trove of nearly 1 million public-domain books, it might not be enough to fully satisfy the ever-growing hunger for diverse Yes, this helps, but how much? Rapidata helps AI companies to laverage more than 25M human anotators, delivering answers and insights within minutes. As AI companies scramble to refine their models, Rapidata’s on-demand data annotation could be the key to unlocking new levels of performance And with the AI landscape shifting faster than ever, whoever masters the data pipeline may well decide the fate of AI’s next chapter. Hold on tight ...

    • Kein Alt-Text für dieses Bild vorhanden
  • Rapidata hat dies direkt geteilt

    Profil von Daniel van Strien anzeigen

    Machine Learning Librarian@🤗 | Championing Open Science & Machine Learning

    Three important datasets recently shared on the @huggingface Hub: **Bespoke-Stratos-17k: Reasoning Distillation at Scale** from Mahesh (Maheswaran) Sathiamoorthy What: A synthetic reasoning dataset (17k examples) with questions, reasoning traces, and answers. Built by refining the Berkeley Sky-T1 pipeline using DeepSeek-R1. Why It Matters: Powers two high-performing models (32B https://lnkd.in/e4NfEABj and 7B https://lnkd.in/et23TwHB) that outperform predecessors like Sky-T1-32B on benchmarks like MATH500 and LiveCodeBench. Innovation: Generated in 1.5 hours using the Bespoke Curator https://lnkd.in/eKZYzz8K tool, with a 73% retention rate for correct solutions via GPT-4o-mini filtering. Showing again that data is 🔑 🔗 https://lnkd.in/eMC_wCTY **Rapidata Video Generation Preference Dataset** from Rapidata What: AI-generated video dataset with Likert scale ratings (1-5) from ~6,000 human evaluators assessing visual appeal. Evaluators rated from "Strongly Dislike" (1) to "Strongly Like" (5), blind to generation prompts. Why It Matters: Creates benchmark for human preference data in video generation models. Scores incorporate evaluator userScore for weighted ratings. Innovation: Rapid data collection via Rapidata Python API. Videos accessible as full MP4s under Files with GIF previews in dataset viewer. Open API enables scalable annotation projects. 🔗 https://lnkd.in/eq9PqW9y *Polite Guard: Politeness Classification Benchmark* from Intel Corporation What: 100k synthetic samples (50k few-shot, 50k Chain-of-Thought) plus 200 annotated corporate training samples for classifying text into polite, somewhat polite, neutral, and impolite categories. Why It Matters: First benchmark for context-aware politeness classification. Provides defense against adversarial inputs and enhances customer service AI interactions. Innovation: Multi-model generation using Llama 3.1, Gemma 2, and Mixtral. Complete reproducibility with code, training pipeline, and fine-tuning guide at https://lnkd.in/egZgZH5k. 🔗 https://lnkd.in/e5-S2SNb

  • Unternehmensseite für Rapidata anzeigen

    1.228 Follower:innen

    Our vision seems aligned with one of the brightest minds in AI (Ilya Sutskever). That is one hell of a stamp of approval (and especially nice if you aren't competing).

    Profil von Jason Corkill anzeigen

    Founder & CEO @ Rapidata | Instant human intelligence for AI at scale

    No GPT-5 any time soon. AI has hit a (temporary?) plateau. The whole internet has been harvested and fed into the hungry hippo that are LLM weights. But now what? This is exactly the scenario I hypothesized in a pitch at Swissnex in San Francisco a few months ago. Ilya the ex-co-founder of Open-AI, who just raised a cool billion with a B in the SSI pre-seed round, just confirmed this theory. Comparing the first two slides from our pitch with Ilyas shows how aligned the visions are. But now what? We need more sources of information, not just the internet. The largest source of knowledge out there besides the internet is the information stored in the brains of all humans. But, there is no way to systematically access it. This is where I see Rapidata's long term opportunity. We become the interface to humanities greatest store of intelligence and wisdom. On one side a programmatic API, to be used by Agents and Engineers. On the other side we can already theoretically target and reach 1/3 of the worlds population.

    • Kein Alt-Text für dieses Bild vorhanden
  • Unternehmensseite für Rapidata anzeigen

    1.228 Follower:innen

    🚨 Google crushes the competition: In our updated text-to-image benchmark, Imagen 3 beats ALL other models*** ***at generating a person under a car. In the rest of the examples not so much. We just collected 1,250,000 additional human preferences to include Imagen 3 and Flux1.1 in our leaderboard of best text-to-image AI models. The results show that Google have stepped up their game a bit with Imagen 3, however, they are still behind both the current and last generation of Flux. Interestingly enough, the latest version of Flux, 1.1, also performs worse than the previous generation. Perhaps the model regressed slightly with the improvement in efficiency? We have made all the images and preference data freely available on our Huggingface page - check it out through the links in the comments!

    • Imagen 3
    • Stable Diffusion 3
    • Imagen 3
    • Imagen 3
    • Imagen 3

Ähnliche Seiten

Finanzierung

Rapidata Insgesamt 5 Finanzierungsrunden

Letzte Runde

Zuschuss

1.231.768,00 $

Weitere Informationen auf Crunchbase