tsuzumi-7b
tsuzumi-7b
Version: 2
NTT DataLast updated May 2026

Models from Microsoft, Partners, and Community

Models from Microsoft, Partners, and Community models are a select portfolio of curated models both general-purpose and niche models across diverse scenarios by developed by Microsoft teams, partners, and community contributors
  • Managed by Microsoft: Purchase and manage models directly through Azure with a single license, world class support and enterprise grade Azure infrastructure
  • Validated by providers: Each model is validated and maintained by its respective provider, with Azure offering integration and deployment guidance.
  • Innovation and agility: Combines Microsoft research models with rapid, community-driven advancements.
  • Seamless Azure integration: Standard Microsoft Foundry experience, with support managed by the model provider.
  • Flexible deployment: Deployable as Managed Compute or Serverless API, based on provider preference.
Learn more about models from Microsoft, Partners, and Community

Key capabilities

About this model

tsuzumi is a lightweight large language model developed by NTT. tsuzumi is designed to handle both Japanese and English with high efficiency. tsuzumi was trained to follow instructions in both Japanese and English.

Key model capabilities

  • Instruction following in both Japanese and English
  • Text generation
  • Auto-regressive language modeling with transformer architecture
  • Supervised fine-tuning (SFT) for response learning
ModelSizeJapanese MT-bench; turn1? Japanese / English
writingstemhumanitiesroleplayextractioncodingmathreasoning
tsuzumi-7B7B8.6 / 8.27.6 / 7.18.45 / 8.26.3 / 6.055.6 / 2.92.3 / 2.21.1 / 1.12.1 / 4.3
https://github.com/Stability-AI/FastChat/tree/jp-stable/fastchat/llm_judge ?Evaluated in 1 turn only

Use cases

See Responsible AI for additional considerations for responsible use.

Key use cases

The provider has not supplied this information.

Out of scope use cases

The provider has not supplied this information.

Pricing

Pricing is based on a number of factors, including deployment type and tokens used. See pricing details here.

Technical specs

tsuzumi is an auto-regressive language optimized transformer. The tuned versions use supervised fine-tuning (SFT).
NameTraining DataParamsContent LengthGQATokens
tsuzumi-7bA mix of publicly available online and private data7B8k?1.4T

Training cut-off date

tsuzumi was trained until 2024/08; Knowledge cutoff is 2024/05. The pretraining data has a cutoff of May 2024.

Training time

tsuzumi was trained until August 2024.

Input formats

Models input text only.

Output formats

Models generate text only.

Supported languages

tsuzumi is designed to handle both Japanese and English.

Sample JSON response

{
  "output": "Linux?????????????????????????????????????????????????????????????????????\n\n1. ???????: Linux??????????????????????????????????????????????????????????????????? ?????????????????????????????OS???????????????\n\n2. ??????: Linux??????????????????????? ?????????????????????????????????????????????????????????????????????\n\n3. ???: Linux?? ?????????????????????????????????????????????????????????????????????????????PC??? ????????????????????????????·????????????\n\n4. ??????: Linux OS???????????????????????????????????????????????????????????????????????????????????????????"
}

Model architecture

tsuzumi is an auto-regressive language optimized transformer. The tuned versions use supervised fine-tuning (SFT).

Long context

The provider has not supplied this information.

Optimizing model performance

Basic parameters:
nameDefaults toExplanation
temperature0.15Controls randomness in the model. Lower values will make the model more deterministic and higher values will make the model more random.
max_tokens4096The maximum number of tokens to generate.
top_p1.0The cumulative probability of parameter highest probability vocabulary tokens to keep for nucleus sampling.
frequency_penalty0.0Positive values penalize new tokens based on their existing frequency in the text so far, decreasing the model's likelihood to repeat the same line verbatim.
logit_biasnullModify the likelihood of specified tokens appearing in the completion.
logprobsfalseWhether to return log probabilities of the output tokens or not.
top_logprobs0An integer between 0 and 20 specifying the number of most likely tokens to return at each token position, each with an associated log probability.
n1The number of generated response variations.
presence_penalty0.0Positive values penalize new tokens based on whether they appear in the text so far, increasing the model's likelihood to talk about new topics.
stopnullSpecification for stop words.
streamfalseWhether response is returned in partial message deltas.
Advanced Parameters – To specify, set "extra-params: allow" in your HTTP request header:
nameDefaults toexplanation
min_tokens0The minimum number of tokens to generate.
top_k-1 (no filter)The number of highest probability vocabulary tokens to keep for top-k-filtering.
repetition_penalty1.0The weight of penalty for repeated phrases. Higher values will suppress repeating similar phrases.
length_penalty1.0Penalizes sequences based on their length.

Additional assets

The provider has not supplied this information.

Training disclosure

Training, testing and validation

A mix of publicly available online and private data was used for training. The pretraining data has a cutoff of May 2024.

Distribution

Distribution channels

Available Regions: US EAST 2

More information

Download (?????????) - Terms of Use Link to Download page (?tsuzumi on Azure MaaS????????) - User Guide

Responsible AI considerations

Safety techniques

The provider has not supplied this information.

Safety evaluations

The provider has not supplied this information.

Known limitations

The provider has not supplied this information.

Acceptable use

Acceptable use policy

The provider has not supplied this information.

Quality and performance evaluations

Source: NTT Data In this section, we report the results for the tsuzumi models on Japanese standard benchmarks.
ModelSizeJapanese MT-bench; turn1※ 日本語 / 英語
writingstemhumanitiesroleplayextractioncodingmathreasoning
tsuzumi-7B7B8.6 / 8.27.6 / 7.18.45 / 8.26.3 / 6.055.6 / 2.92.3 / 2.21.1 / 1.12.1 / 4.3
https://github.com/Stability-AI/FastChat/tree/jp-stable/fastchat/llm_judge  ※1ターンでの評価を採用

Benchmarking methodology

Source: NTT Data The provider has not supplied this information.

Public data summary

Source: NTT Data The provider has not supplied this information.
Model Specifications
Last UpdatedMay 2026
Input TypeText
Output TypeText
ProviderNTT Data
Languages1 Language