Get live statistics and analysis of Vaibhav (VB) Srivastav's profile on X / Twitter

chief get-shit-done officer @huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my own

375 following33k followers

The Innovator

Vaibhav (VB) Srivastav is a tech evangelist and problem-solver who thrives at the cutting edge of AI and open-source software. As the chief get-shit-done officer at Hugging Face, VB amplifies innovation by championing locally runnable, state-of-the-art AI models. His high-energy tweets combine technical breakthroughs with enthusiasm that electrifies the AI community.

Impressions
957.8k-200k
$179.54
Likes
4.6k-1.1k
68%
Retweets
350-48
5%
Replies
284-89
4%
Bookmarks
1.5k-187
23%

Top users who interacted with Vaibhav (VB) Srivastav over the last 14 days

2 interactions
@royam0820

Entrepreneur Interested in AI, Machine Learning, Education, Social Media, Philosophy, World Affairs and much more ...😃

1 interactions
1 interactions
@belltyler

Provost AI Fellow. ECE Prof @uiowa. AI + VR/AR + 3D + HCI researcher at the intersection of AI, education, telepresence, accessibility, & creativity. Optimist.

1 interactions
@TwoNibble

building chaptra • sometimes I say smart things • 1x engineer •  ex @apple feel free to connect — i promise, i don’t byte 🤗

1 interactions
1 interactions
@iamprinceba

User Experience Designer with Software Engineering background. Building for the next billion users with my principle of Offline first, Accessibility & Openness.

1 interactions
@dkundel

DevX, gpt-oss, TS Agents SDK, Codex @OpenAI - prev. AI agents @twilio - JS Hacker - MBA @BerkeleyHaas - he/him - Opinions my own

1 interactions
@ivanfioravanti

Co-founder and CTO of @CoreViewHQ GenAI/LLM addicted, Apple MLX, Microsoft 365, Azure, Kubernetes, builder of my personal dreams.

1 interactions
@bdsqlsz

CPP:@pika_labs @SkyReels BusinessCooperation qinglongshengzhe@gmail.com bdsqlsz@deepghs.org Architectural Model↓ bdsqlsz@tyrellai.co tylab.ai

1 interactions
1 interactions
@DaliBasor

CTO @ViaanixInc | Building ArgusIQ LoRaWAN · Industrial IoT · Resolving factory chaos on a daily basis · RPi + GPS robotics · 5+ years battery life sensors

1 interactions
@LyceumCloud

Built to remove infrastructure headaches. Lyceum is the easiest way to run your code on a GPU.

1 interactions
@CsgoBarfires

🎁 Giveaway Host #barfireslegit Mail: barfiresbusiness@gmail.com main : @barfires

1 interactions
@MaxxifiedLian

An AI Philosopher and Product manager who is trying to make sense of the world. | Building @MagnetXDao . Founder @cardano_nigeria

1 interactions
@runonthespot

Developer. Executor of ideas

1 interactions
@garyfung

Founded isoHunt, WonderSwipe. AI shepherd at @boomtv @StarpowerAI. 70% engineering 30% product

1 interactions
@olcan

Engineer @GoogleDeepMind. Prev. Product @ GDM, Founder/CEO @ Scaled Inference, Engineer @Google (Search, Research, X, Brain). Creator of @EnjoyMindPage.

1 interactions
@DoctorYev

Architecting AI chaos into growth loops | 1B+ human-driven views | Emergent experimentation systems | prev. growth @lovable | ex-Sizzle AI (acq 2025)

1 interactions
@communicating

Optimist, Geek, Building @AgletsAI. DotConnector, ToolBuilder, InfoHacker & Coder. Into Agents, Graphs, LLMs especially SLMs, NLProc & making hard things easier

1 interactions

VB’s tweet count is so high, he probably has a direct neural link to X by now—at this rate, his keyboard’s starting to feel like an extension of his own arms. If bots were paid by tweets, he’d be the wealthiest AI enthusiast on the platform!

VB’s biggest win is championing the breakthrough of DeepSeek R1 1.5B running fully locally at blazing speeds in browsers, showcasing his knack for pushing open-source AI into user-friendly, real-world applications.

VB’s life purpose is to democratize advanced AI technologies, making powerful tools accessible, efficient, and open to everyone. He aims to break down barriers by enabling AI that runs fully locally in browsers, cutting server costs and maximizing user control. Ultimately, VB wants to accelerate the pace of technological adoption while fostering a collaborative innovation ecosystem.

VB believes in transparency, open licensing, and the free flow of knowledge. He values practical, scalable solutions over hype, and trusts community-driven development to push AI forward. He also places immense importance on speed and accessibility, convinced that intelligence should be “too cheap to meter.”

VB’s strengths lie in his deep technical insight combined with an infectious passion for innovation. His ability to spot and amplify cutting-edge projects, and communicate their importance in an accessible yet impactful way, sets him apart as a key influencer in the AI space.

Sometimes, VB’s rapid-fire enthusiasm for new tech might overwhelm audiences less familiar with AI jargon, potentially limiting broader accessibility. His focus on technical excellence could also eclipse softer community-building and engagement nuances.

To grow his audience on X, VB should blend his high-octane technical updates with more bite-sized, beginner-friendly content and personal stories that demystify complex AI concepts. Live Q&A sessions or interactive threads could deepen engagement and turn followers into a loyal community.

Fun fact: VB has tweeted over 10,000 times, maintaining a relentless pace of sharing high-impact AI releases and commentary. He frequently spotlights breakthroughs like DeepSeek's locally runnable models, proudly touting their ability to outperform major AI benchmarks.

Top tweets of Vaibhav (VB) Srivastav

HOLY FUCK! @ZyphraAI just dropped Zonos - Apache 2.0 licensed, Multilingual, Text to Speech model with INSTANT voice cloning! 🔥 > Zero-shot TTS with Voice Cloning: Input text and a 10-30 second speaker sample to generate high-quality text-to-speech output > Audio Prefix Inputs: Enhance speaker matching by adding an audio prefix to the text, enabling behaviors like whispering that are hard to achieve with voice cloning alone > Multilingual Support: Supports English, Japanese, Chinese, French, and German > Audio Quality & Emotion Control: Fine-tune speaking rate, pitch, frequency, audio quality, and emotions (e.g., happiness, anger, sadness, fear) > Fast Performance: Runs at ~2x real-time speed on an RTX 4090 > Available on the Hugging Face Hub 🤗

297k

HOLY SHITT, Microsoft dropped an open-source Multimodal (supports Audio, Vision and Text) Phi 4 - MIT licensed! 🔥 > Beats Gemini 2.0 Flash, GPT4o, Whisper, SeamlessM4T v2 > Models on Hugging Face hub, integrated with/ Transformers! Phi-4-Multimodal: > Modalities: Integrates text, vision, and speech/audio > Architecture: Uses "Mixture of LoRAs" to add modality-specific adapters without fine-tuning the base model > Vision Modality: SigLIP-400M image encoder, 2-layer MLP projector, dynamic multi-crop strategy > Speech/Audio Modality: 3-layer convolution, 24 conformer blocks, 80ms token rate > Performance: Ranks first on OpenASR leaderboard, supports vision+language, vision+speech, and speech/audio tasks, outperforming larger models Phi-4-Mini: > Parameters: 3.8 billion > Architecture: 32 Transformer layers, 3,072 hidden state size, Group Query Attention (GQA) with 24 query heads and 8 key/value heads > Vocabulary: 200K tokens for multilingual support. Training Data: High-quality web and synthetic data, emphasizing math and coding > Performance: Outperforms similar-sized models and matches larger models (e.g., DeepSeek-Rl-Distill-Qwen-7B) on math and coding tasks Training Pipeline: > Language Training: Pre-training on 5 trillion tokens, post-training with function calling, summarization, and instruction-following data > Multimodal Training: Vision training (4 stages), speech/audio training (2 stages), and joint vision-speech training > Reasoning Training: Pre-trained on 60B CoT tokens, fine-tuned on 200K high-quality CoT samples, and DPO-trained on 300K preference samples Vision Benchmarks: > Outperforms Phi-3.5-Vision, Qwen2.5-VL, InternVL2.5, and matches Gemini and GPT-4o on tasks like chart understanding and OCR > Vision-Speech Benchmarks: Significantly outperforms InternOmni and Gemini-2.0-Flash Speech Benchmarks: > ASR: Achieves SOTA on CommonVoice, FLEURS, and Open ASR Leaderboard, surpassing WhisperV3 and SeamlessM4T > AST: Best performance on CoVoST2, comparable to GPT-4o on FLEURS > Speech Summarization: First open-source model with this capability, close to GPT-4o in quality Language Benchmarks: > Outperforms similar-sized models (Llama-3.2, Ministral) and matches larger models (Qwen2.5-7B) on math, reasoning, and coding tasks > Coding: Strong performance on HumanEval, MBPP, and BigCodeBench Reasoning Benchmarks: > Reasoning-enhanced Phi-4-Mini outperforms DeepSeek-Rl-Distill-Llama-8B and matches DeepSeek-Rl-Distill-Qwen-7B on AIME, MATH-500, and GPQA Diamond

209k

Most engaged tweets of Vaibhav (VB) Srivastav

HOLY SHITT, Microsoft dropped an open-source Multimodal (supports Audio, Vision and Text) Phi 4 - MIT licensed! 🔥 > Beats Gemini 2.0 Flash, GPT4o, Whisper, SeamlessM4T v2 > Models on Hugging Face hub, integrated with/ Transformers! Phi-4-Multimodal: > Modalities: Integrates text, vision, and speech/audio > Architecture: Uses "Mixture of LoRAs" to add modality-specific adapters without fine-tuning the base model > Vision Modality: SigLIP-400M image encoder, 2-layer MLP projector, dynamic multi-crop strategy > Speech/Audio Modality: 3-layer convolution, 24 conformer blocks, 80ms token rate > Performance: Ranks first on OpenASR leaderboard, supports vision+language, vision+speech, and speech/audio tasks, outperforming larger models Phi-4-Mini: > Parameters: 3.8 billion > Architecture: 32 Transformer layers, 3,072 hidden state size, Group Query Attention (GQA) with 24 query heads and 8 key/value heads > Vocabulary: 200K tokens for multilingual support. Training Data: High-quality web and synthetic data, emphasizing math and coding > Performance: Outperforms similar-sized models and matches larger models (e.g., DeepSeek-Rl-Distill-Qwen-7B) on math and coding tasks Training Pipeline: > Language Training: Pre-training on 5 trillion tokens, post-training with function calling, summarization, and instruction-following data > Multimodal Training: Vision training (4 stages), speech/audio training (2 stages), and joint vision-speech training > Reasoning Training: Pre-trained on 60B CoT tokens, fine-tuned on 200K high-quality CoT samples, and DPO-trained on 300K preference samples Vision Benchmarks: > Outperforms Phi-3.5-Vision, Qwen2.5-VL, InternVL2.5, and matches Gemini and GPT-4o on tasks like chart understanding and OCR > Vision-Speech Benchmarks: Significantly outperforms InternOmni and Gemini-2.0-Flash Speech Benchmarks: > ASR: Achieves SOTA on CommonVoice, FLEURS, and Open ASR Leaderboard, surpassing WhisperV3 and SeamlessM4T > AST: Best performance on CoVoST2, comparable to GPT-4o on FLEURS > Speech Summarization: First open-source model with this capability, close to GPT-4o in quality Language Benchmarks: > Outperforms similar-sized models (Llama-3.2, Ministral) and matches larger models (Qwen2.5-7B) on math, reasoning, and coding tasks > Coding: Strong performance on HumanEval, MBPP, and BigCodeBench Reasoning Benchmarks: > Reasoning-enhanced Phi-4-Mini outperforms DeepSeek-Rl-Distill-Llama-8B and matches DeepSeek-Rl-Distill-Qwen-7B on AIME, MATH-500, and GPQA Diamond

209k

HOLY FUCK! @ZyphraAI just dropped Zonos - Apache 2.0 licensed, Multilingual, Text to Speech model with INSTANT voice cloning! 🔥 > Zero-shot TTS with Voice Cloning: Input text and a 10-30 second speaker sample to generate high-quality text-to-speech output > Audio Prefix Inputs: Enhance speaker matching by adding an audio prefix to the text, enabling behaviors like whispering that are hard to achieve with voice cloning alone > Multilingual Support: Supports English, Japanese, Chinese, French, and German > Audio Quality & Emotion Control: Fine-tune speaking rate, pitch, frequency, audio quality, and emotions (e.g., happiness, anger, sadness, fear) > Fast Performance: Runs at ~2x real-time speed on an RTX 4090 > Available on the Hugging Face Hub 🤗

297k

Fuck it! You can now run *any* GGUF on the Hugging Face Hub directly with @ollama 🔥 This has been a constant ask from the community, starting today you can point to any of the 45,000 GGUF repos on the Hub* *Without any changes whatsoever! ⚡ All you need to do is: ollama run hf. co/{username}/{reponame}:latest For example, to run the Llama 3.2 1B, you can run: ollama run hf. co/bartowski/Llama-3.2-1B-Instruct-GGUF:latest If you want to run a specific quant, all you need to do is specify the Quant type: ollama run hf. co/bartowski/Llama-3.2-1B-Instruct-GGUF:Q8_0 That's it! We'll work closely with Ollama to continue developing this further! ⚡

310k

Wait wtf!? @GoogleDeepMind Gemini 1.5Pro out scoring @OpenAI O1-preview on FrontierMath :O Even @AnthropicAI 3.5 Sonnet (new) beats it!

73k

People with Innovator archetype

The Innovator

Engineering Manager @Meta Reality Labs | prev @FuboTV | AI, Spatial Computing & Robotics news weekly - visionquest.beehiiv.com

610 following604 followers
The Innovator

@0G_labs Changed me / web3 enthusiast / 0Gurus @0G_labs / Never Give up

4k following2k followers
The Innovator

Web3 to explore before i sleep :)

2k following2k followers
The Innovator

产品人 | 研究Ai是工作也是爱好|探索被动收入 |专注于Ai实践,搁这聊聊Ai

370 following768 followers
The Innovator

xyus 21 | full stack dev | pimary go and ts | working as an ai engineer at a startup | trying to get better at mathematics

77 following28 followers
The Innovator

Artist + Software Developer / Married to @Helen_Crispin_ / Previously AR-VR and Neurotech

5k following43k followers
The Innovator

I write here sometimes | @contrary, previously @warpdotdev

1k following528 followers
The Innovator

Chief Technology Officer @IcehouseVenture Advocate of design thinking & equity crowdfunding. Coffee blogger, ski instructor & business author: amzn.to/1P7AHEl

6k following6k followers
The Innovator

manipulating waveforms | music • audio • dev • ai | building open source apps for music producers @soniqaudio_

615 following732 followers
The Innovator

Building the future of golf instruction and how we interface with our golf clubs. Founder of @stayhandsy

722 following352 followers
The Innovator

Building a wearable ultrasound @ Auvi Labs | UIUC

248 following146 followers
The Innovator

𝙋𝙝𝙮𝙨𝙞𝙘𝙞𝙖𝙣 𝙩𝙪𝙧𝙣𝙚𝙙 𝘿𝙚𝙁𝙞 𝙙𝙚𝙜𝙚𝙣.

938 following445 followers

Explore Related Archetypes

If you enjoy the innovator profiles, you might also like these personality types:

Supercharge your 𝕏 game,
Grow with SuperX!

Get Started for Free