AI & ML interests
Enterprise-grade AI models
Recent Activity
Papers
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Articles
IBM Granite
Granite is a family of open, enterprise-grade AI models that are performant, efficient, and trustworthy.
- π€ Granite SLMs - Our latest language models, faster, leaner, and built for agentic workloads.
- ποΈβπ¨οΈ Granite Vision - VLM with a special emphasis on document-related tasks.
- π₯ Granite Docling - Document models for enterprise document workflows.
- π£οΈ Granite Speech - Models for automatic speech recognition and spoken language understanding.
- π Granite Embedding - High-quality embedding models for RAG and semantic search.
- π¦Ί Granite Guardian - Safety and content moderation models for responsible AI deployments.
- π Granite Time Series - Models purpose-built for enterprise time series data.
- π§© Granite Libraries - Libraries of adapters that supercharge a wide range of capabilities.
Resources
- π Docs: ibm.com/granite/docs/models/granite
- π§ͺ Playground: ibm.com/granite/playground
- π GitHub: github.com/ibm-granite
-
ibm-granite/granite-4.1-30b
Text Generation β’ 29B β’ Updated β’ 21.5k β’ 110 -
ibm-granite/granite-4.1-8b
Text Generation β’ 9B β’ Updated β’ 42.2k β’ 170 -
ibm-granite/granite-4.1-3b
Text Generation β’ 3B β’ Updated β’ 18.8k β’ 60 -
ibm-granite/granite-4.1-30b-base
Text Generation β’ 29B β’ Updated β’ 2.84k β’ 24
-
ibm-granite/granite-docling-258M
Image-Text-to-Text β’ 0.3B β’ Updated β’ 391k β’ 1.18k -
ibm-granite/granite-docling-258M-mlx
Image-Text-to-Text β’ 0.3B β’ Updated β’ 3.7k β’ 93 -
granite-docling-258M demo
π276Extract and convert document content from images
-
Granite Docling 258M WebGPU
π£157Convert document images to HTML with Docling
-
ibm-granite/granite-4.1-30b
Text Generation β’ 29B β’ Updated β’ 21.5k β’ 110 -
ibm-granite/granite-4.1-8b
Text Generation β’ 9B β’ Updated β’ 42.2k β’ 170 -
ibm-granite/granite-4.1-3b
Text Generation β’ 3B β’ Updated β’ 18.8k β’ 60 -
ibm-granite/granite-4.1-30b-base
Text Generation β’ 29B β’ Updated β’ 2.84k β’ 24
-
ibm-granite/granite-docling-258M
Image-Text-to-Text β’ 0.3B β’ Updated β’ 391k β’ 1.18k -
ibm-granite/granite-docling-258M-mlx
Image-Text-to-Text β’ 0.3B β’ Updated β’ 3.7k β’ 93 -
granite-docling-258M demo
π276Extract and convert document content from images
-
Granite Docling 258M WebGPU
π£157Convert document images to HTML with Docling
spaces 11
Granite 4.0 1B Speech
Granite 4.0 1B Speech recognition and translation demo
Granite Speech WebGPU
Transcribe and translate audio to text directly in your browser
Granite Vision Document Intelligence
Document intelligence with Granite-Vision-4.1-4B
Granite 4.0 Nano WebGPU
In-browser tool calling with IBM Granite-4.0
Granite Docling 258M WebGPU
Convert document images to HTML with Docling