Hivemind-32B-Preview

Hivemind-32B-Preview is a 32B-parameter model fine-tuned for multi-turn, emotionally attentive conversation in human-facing enterprise contexts. It is built on Qwen3-32B with a training set focused on conversational depth, emotional subtext, and sustained engagement across complex interpersonal scenarios.

Model Details

  • Parameters: 32B
  • Context length: 40,960 tokens
  • Precision: bfloat16
  • Base model: Qwen3-32B
  • License: Proprietary, subject to upstream Qwen license terms

Training

Hivemind-32B-Preview was fine-tuned for multi-turn, human-facing conversations involving ambiguity and emotional subtext. The training set was purpose-built from enterprise interaction data.

Intended Use

Hivemind-32B-Preview is designed for enterprise human-agent partnership contexts: customer support, coaching-style interactions, and similar conversational deployments where sustained emotional attentiveness matters.

Scope and Ongoing Work

Hivemind-32B-Preview is a preview release. As with any conversational model, it has scope boundaries we are actively refining:

  • It is not intended as a source of medical, legal, financial, or safety-critical advice, and should not be deployed in those contexts or as a replacement for professional human support.
  • Performance is strongest in standard conversational scenarios.

We welcome failure-case reports from researchers and deployment partners at contact@hivelabs.dev.

Usage

vLLM (recommended)

vllm serve HiveLabsAI/hivemind-32b-preview --dtype bfloat16

Transformers

from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "HiveLabsAI/hivemind-32b-preview"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype="bfloat16",
    device_map="auto",
)

messages = [{"role": "user", "content": "Your message here"}]
inputs = tokenizer.apply_chat_template(
    messages, return_tensors="pt", add_generation_prompt=True
).to(model.device)
outputs = model.generate(
    inputs, max_new_tokens=2048, temperature=0.6, top_p=0.95, top_k=20
)
print(tokenizer.decode(outputs[0][inputs.shape[-1]:], skip_special_tokens=True))

About

Hivemind is developed by Hive Labs. For research collaboration, deployment questions, or to report failure cases, contact contact@hivelabs.dev.

Downloads last month
75
Safetensors
Model size
677k params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for HiveLabsAI/hivemind-32b-preview

Base model

Qwen/Qwen3-32B
Finetuned
(511)
this model
Quantizations
1 model