Introducing the SEA-LION v4.5 Suite – Agentic Power and Speed

We are glad to introduce the SEA-LION v4.5 family, which adds state-of-the-art models that work well as agents but also understand Southeast Asian (SEA) languages, cultures, and local knowledge to the SEA-LION family of models.

SEA-LION v4.5 models benefit from new post-training techniques: knowledge distillation, targeted supervised fine-tuning, and smarter model merging. With these methods, we have taken SOTA models from Qwen 3.6 and Gemma 4 and adapted them for the diverse regional and local contexts of SEA. We also trained our own speculative decoder to turbo charge our large Qwen model, for even greater efficiency (i.e. up to 5X).

SEA-LION v4.5

  • Custom distillation from Qwen3.5-397B-A17B and Gemma-4-31B-IT on an updated aisingapore/SEA-Instruct-2602
  • Focus on multilingual and multicultural fluency across English and key SEA languages including Burmese, Indonesian, Filipino, Malay, Tamil, Thai, and Vietnamese.
  • This set of models is designed to give developers and enterprises immediate access to high-capacity reasoning and agentic capabilities in English and SEA regional contexts even in resource-constrained environments.

1. Gemma-SEA-LION-v4.5-E2B-IT (Small and Efficient Agentic Base)

Built upon Gemma-4-E2B-IT, this highly streamlined model effectively operates at the size of a 2B model to maximise parameter efficiency and is optimised for resource-constrained environments. It maintains exceptional performance-to-size efficiency, making it highly effective for automated agentic tool use in localised SEA contexts with minimal memory overhead.

2. Qwen-SEA-LION-v4.5-27B-IT (Powerful Agent with Low Latency)

Built upon Qwen3.6-27B, this model preserves the advanced instruction following and multimodal agentic capabilities of its base while operating with deep linguistic and cultural understanding across SEA languages including Burmese, Indonesian, Filipino, Malay, Tamil, Thai, and Vietnamese. It features a native 262K context window, making it highly suitable for complex and agentic reasoning, document processing and enterprise RAG workflows.

3. Qwen-SEA-LION-v4.5-27B-IT-SpecDecoder (High-Throughput Booster)

To address the demands of latency-critical production environments, we are releasing a speed-optimising drafter model to complement Qwen-SEA-LION-v4.5-27B-IT, built upon z-lab’s Qwen3.6-27B-DFlash. Unlike conventional Multi Token Prediction (MTP), which drafts tokens sequentially, Qwen-SEA-LION-v4.5-27B-SpecDecoder is based on a block diffusion model that drafts blocks of tokens in one go, resulting in a much more significant speedup without sacrificing quality of output. Qwen-SEA-LION-v4.5-27B-IT-SpecDecoder works in perfect tandem with Qwen-SEA-LION-v4.5-27B-IT to dramatically increase tokens-per-second and reduce overall operational costs.

DatasetBaseline
aisingapore/Qwen-SEA-LION-v4.5-27B-IT
w/ MTPw/ DFlash
z-lab/Qwen3.6-27B-Dflash
w/ SpecDecoder
aisingapore/Qwen-SEA-LION-v4.5-27B-IT-SpecDecoder
gsm8k64.55 tok/s149.19 tok/s
2.31x
283.67 tok/s
4.39x
324.32 tok/s
5.02x
math50065.91 tok/s153.19 tok/s
2.32x
306.96 tok/s
4.66x
335.22 tok/s
5.09x
humaneval66.03 tok/s155.52 tok/s
2.36x
374.13 tok/s
5.67x
397.11 tok/s
6.01x
mbpp66.44 tok/s148.69 tok/s
2.24x
235.37 tok/s
3.54x
260.46 tok/s
3.92x
mt-bench66.40 tok/s136.89 tok/s
2.06x
153.81 tok/s
2.32x
163.37 tok/s
2.46x
Tagalog66.45 tok/s117.06 tok/s
1.76x
79.91 tok/s
1.20x
164.53 tok/s
2.48x
Burmese66.46 tok/s134.61 tok/s
2.03x
85.47 tok/s
1.29x
266.73 tok/s
4.01x
Tamil66.48 tok/s111.99 tok/s
1.68x
76.47 tok/s
1.15x
175.82 tok/s
2.64x
Indonesian66.47 tok/s119.68 tok/s
1.80x
90.30 tok/s
1.36x
134.24 tok/s
2.02x
Vietnamese66.48 tok/s112.04 tok/s
1.69x
87.51 tok/s
1.32x
133.95 tok/s
2.01x
Thai66.28 tok/s104.81 tok/s
1.58x
75.37 tok/s
1.14x
118.49 tok/s
1.79x
Chinese66.46 tok/s130.32 tok/s
1.96x
113.88 tok/s
1.71x
133.43 tok/s
2.01x
Malay66.44 tok/s127.31 tok/s
1.92x
106.24 tok/s
1.60x
169.91 tok/s
2.56x

*All the parameters setting follows default. num_speculative_tokens is 16

Speedup for Malay with SEA-LION-v4.5-27B-IT-SpecDecoder (right) compared with baseline (left)

Speedup for Burmese with SEA-LION-v4.5-27B-IT-SpecDecoder (bottom) compared with baseline (top)

Performance and Evaluation

On the SEA-HELM leaderboard, SEA-LION-v4.5 demonstrates improvement in multilingual and multicultural capabilities over their counterparts:

A table displaying the SEA Performance metrics for various models, including Gemma 4, SEA-LION v4.5, Qwen 3.5, and others, along with their respective scores and error margins.

Screenshot from SEA-HELM leaderboard
*We are constantly updating the leaderboard – more to come very soon!

Claw-Eval is an open benchmark that evaluates autonomous agents on 300 tasks across 3 splits and 9 categories, each task with human-verified rubrics. Tasks include (i) multi-turn conversational tasks with simulated user personas asking for clarification and advice, (ii) multimodal tasks testing perception and generation, as well as (iii) general tasks covering core agent tasks across communication, finance, ops, productivity and so on.

SEA-LION-v4.5 delivers strong multilingual and multicultural performance while retaining its agentic capabilities:

ModelPass^3
aisingapore/Gemma-SEA-LION-v4.5-E2B-IT9/107 (8%)
google/gemma-4-E2B-it11/107 (10%)
google/gemma-4-E4B-it7/107 (7%)
google/gemma-4-31B-it22/107 (21%)
aisingapore/Qwen-SEA-LION-v4.5-27B-IT48/107 (45%)
Qwen/Qwen3.6-27B47/107 (44%)
Qwen/Qwen3.5-27B42/107 (39%)

Scores for Claw-Eval with thinking enabled

ModelPass^3
aisingapore/Gemma-SEA-LION-v4.5-E2B-IT9/107 (8%)
google/gemma-4-E2B-it7/107 (7%)
google/gemma-4-E4B-it7/107 (7%)
google/gemma-4-31B-it23/107 (21%)
aisingapore/Qwen-SEA-LION-v4.5-27B-IT43/107 (40%)
Qwen/Qwen3.6-27B32/107 (30%)
Qwen/Qwen3.5-27B34/107 (32%)

Scores for Claw-Eval with thinking disabled

Get Started Today

The model weights, detailed model cards, and starter code are now available on our Hugging Face hub:

If you are building with the SEA-LION v4.5 suite, we would love to hear from you. For deployment inquiries or technical feedback, contact our team at sealion@aisingapore.org.