Question 1

What models can I fine-tune with FineTune Lab?

Accepted Answer

FineTune Lab supports fine-tuning Llama 3.3, 3.1, 2, Mistral, Qwen, and other popular open-source models. You can use LoRA (Low-Rank Adaptation) for efficient training or full fine-tuning for maximum customization. Training methods include SFT (Supervised Fine-Tuning), DPO, ORPO, and RLHF.

Question 2

How does 4-bit quantization reduce training costs?

Accepted Answer

Quantization reduces model memory requirements by up to 75% without significant quality loss. This allows training larger models or using bigger batch sizes on the same GPU. FineTune Lab displays approximate VRAM savings and monitors validation metrics in real-time to ensure quantization doesn't hurt performance.

Question 3

How does real-time training analytics work in FineTune Lab?

Accepted Answer

Training metrics use WebSocket streaming to push updates instantly to your browser. Loss curves, GPU utilization, memory usage, and throughput metrics update in real-time as each batch completes. Monitor training and validation loss simultaneously to catch overfitting the moment it occurs.

Question 4

What is GraphRAG and how does it improve chat testing?

Accepted Answer

GraphRAG uses knowledge graphs to provide grounded, citation-backed responses during model testing. Upload PDFs, TXT, or MD documents to create a knowledge graph with semantic embeddings and relationships. The chat interface displays which document sources the model used, helping you understand response accuracy and context usage.

Question 5

Can I track model predictions during training?

Accepted Answer

Yes. Configure predictions to generate during training at eval steps, epochs, or specific intervals. View sample outputs in the Training Predictions card showing prompt, ground truth, and model response. The Prediction Evolution view tracks how answers improve across epochs, providing concrete evidence of learning beyond loss numbers.

Question 6

How does LLM-as-a-Judge evaluation work?

Accepted Answer

LLM-as-a-Judge uses models like GPT-4, Claude, or custom fine-tuned models to automate response scoring at scale. During batch testing, enable the LLM-as-judge checkbox to receive numerical scores and explanations for each response. The judge evaluates on criteria like accuracy, clarity, and completeness with human-readable reasoning.

Question 7

What analytics can I export from FineTune Lab?

Accepted Answer

Export analytics in three formats: CSV (opens in Excel/Google Sheets with labeled columns), JSON (structured for data pipelines and warehouses), and PDF (report-ready charts and tables). Exports include training metrics, evaluation results, response times, cost tracking, sentiment trends, and model comparison data.

Question 8

How does checkpoint management help select the best model?

Accepted Answer

Checkpoint management uses multi-metric scoring combining eval loss, overfitting penalty (train/eval gap), perplexity, and improvement rate. The system highlights the best checkpoint automatically rather than selecting lowest raw loss. View all checkpoints by timestamp, step, and key metrics, then compare or mark preferred checkpoints for deployment.

Fine-Tune Any LLMIn Under 2 Minutes

Complete AI Training Platform

LLM Fine-Tuning Made Simple

Supported Models & Methods

Dataset Management

Training Configuration

Cloud Training

Real-Time Training Analytics

Live Monitoring

Analytics & Exports

Model Comparison

Advanced Features

Intelligent Chat Testing

GraphRAG Knowledge

Evaluation Tools

Batch Testing

Model Observability

Prediction Tracking & Validation

Training Predictions

LLM-as-a-Judge

Multi-Axis Rating

Checkpoint Selection

How It Works

Upload & Train

Monitor Training

Test & Evaluate

Deploy to Production

Frequently Asked Questions

Ready to Train Your First Model?