supply-chain-trade-regulations
Version: 2
Description
The adapted AI model for supply chain trade regulations analysis (preview) is a 3.8B parameter, lightweight, state-of-the-art open model, trained using synthetic supply chain domain-specific datasets, focused on trade regulations.
The model is fine-tuned on the base model, Phi-3-Mini-128K-instruct. The training dataset includes both synthetic data and filtered publicly available data, with an emphasis on high-quality and reasoning-dense properties.
The model underwent a post-training process that incorporates both supervised fine-tuning and direct preference optimization for following instructions and safety measures. This fine-tuned model did not degrade compared to its baseline performance for common sense, language understanding, math, code, long context, or logical reasoning when assessed against benchmarks.
NOTE: This model is in preview
Disclaimer
One should not make any decisions based on the output of the model. Please review the output content to make sure it's accurate before relying on such output.
Resources
Here are some links to resources related to the Phi-3 class of models:
🏡 Phi-3 Portal
📰 Phi-3 Microsoft Blog
📖 Phi-3 Technical Report
🛠️ Phi-3 on Azure AI Studio
👩🍳 Phi-3 Cookbook
Model Architecture
The adapted AI model for supply chain trade regulations analysis has 3.8B parameters and is a dense decoder-only transformer model. The model is fine-tuned with supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) to ensure alignment with human preferences and safety guidelines.
Training Datasets
The training dataset includes a wide range of books and materials collected from the web:
General benchmarking
Hardware
Note that by default, the Phi-3 Mini-4K-Instruct model uses flash attention, which requires certain types of GPU hardware to run. We have tested on the following GPU types:
- Publicly available books in general supply chain domains that are processed to create Q&A pairs using GPT-4o
- Publicly available books in the trade compliance domain, including codes and regulations (federal and international)
- Verified question-answer sets commonly asked by trade compliance managers on trade regulation, scaled synthetically
- Publicly available FineWeb-Edu dataset filtered to supply chain domain documents
- Tariff & Duty section contains questions related to the product code and import duty rate specific to a country/region, (e.g., European Union, USA, etc.), e.g., What is the EU TARIC for a smartphone?
- Import-Export: examples based on the rules and regulations related to import/export, e.g., What is the purpose of the US CTPAT Trade Compliance Program?
- Policy & Training: examples related to Trade Compliance, e.g., Which countries/regions do we consider as EU customs territory?
- Trade Sanctions: examples related to current market condition, e.g., How can I use the Commerce Country Chart when I have the ECCN?
Category | GPT-4o | GPT-4o-mini | Phi-3-mini-128K-Ins | Adapted-AI-model-for-supply-chain-trade-regulations-analysis |
---|---|---|---|---|
Tariffs & Duty | 1.77 | 1.67 | 1.24 | 1.65 |
Import-Export | 1.49 | 1.45 | 1.34 | 1.48 |
Policy & Training | 1.82 | 1.75 | 1.63 | 1.76 |
Trade Sanction | 1.49 | 1.35 | 1.02 | 1.35 |
Overall (normalized to 1) | 0.82 | 0.78 | 0.65 | 0.78 |
Category | Benchmark | Phi-3-mini-128K-Ins | Adapted-AI-model-for-supply-chain-trade-regulations-analysis |
---|---|---|---|
Popular aggregated benchmark | AGI Eval | 39.5 | 37.8 |
MMLU | 69.7 | 69.0 | |
Language Understanding | ANLI | 52.3 | 52.8 |
Reasoning | PIQA | 80.1 | 78.6 |
WinoGrande | 71.0 | 72.0 | |
Math | GSM-8K | 85.3 | 76.3 |
- NVIDIA A100
- NVIDIA A6000
- NVIDIA H100
Model Specifications
LicenseMit
Last UpdatedJanuary 2025
Input TypeText
Output TypeText
PublisherMicrosoft
Languages1 Language