
shenhao deepseek
Overview
This product provides Amazon Web Services Marketplace users with a one-stop solution for deploying DeepSeek R1 series distillation models. Based on the Amazon Linux 2023 operating system, the Ollama framework and web management interface are pre-configured through Docker containerization technology. It supports four model specifications: 7B, 14B, 32B, and 70B, covering everything from lightweight to high-precision inference requirements, and is suitable for enterprise AI development, academic research and production environment deployment. Core features Multi-model support: Pre-load the full range of DeepSeek R1 distillation models (7B-70B), and users can choose flexibly according to task complexity, taking into account inference speed and accuracy. Out of the box: Integrated with the Ollama framework, there is no need to manually configure the environment, and you can access the interactive interface through a browser after starting the EC2 instance. Elastic scaling: Supports multiple Amazon Web Services EC2 instance types (such as GPU-accelerated G4dn/G5 or CPU-optimized C6i) to suit the computing needs of different models. Efficient reasoning: Model loading and memory management are optimized through the Ollama framework, combined with Docker resource isolation technology to ensure stability under multiple tasks. Applicable scenarios Intelligent customer service: Based on the 70B model, complex semantic understanding and multi-round dialogue generation are realized. Content Creation: Quickly generate marketing copy, abstracts, or translated content through the 14B/32B model. Data analysis: Use the 7B model to handle tasks such as log analysis and entity recognition in a lightweight manner. R&D testing: Provide AI teams with a standardized model testing environment to reduce deployment costs. Technical advantages Fast deployment: A pre-built Amazon Machine Image (AMI) is provided, and the environment is initialized within 5 minutes. Resource isolation: Docker containerized deployment ensures resource independence between models and avoids competitive interference. Security compliance: Based on Amazon Linux 2023 security patch and encrypted communication (HTTPS) to ensure data privacy.
Highlights
- Flexible choice of multiple models The full range of DeepSeek R1 distillation models (7B/14B/32B/70B) is preset, and users can switch freely according to task requirements. The 7B model is suitable for lightweight real-time inference, and the 70B model satisfies high-precision complex scenarios, taking into account performance and cost optimization.
- Easy to use out of the box, simple to deploy Based on the Docker containerized package, it provides an Amazon Linux 2023 system image, completes EC2 instance startup and configuration in 5 minutes, integrates Ollama, and enables model management and interactive testing with zero code foundation.
- Enterprise-grade stability and scalability It supports horizontal scaling of multiple Amazon Web Services GPU/CPU instance types, and combines automatic load balancing and connection pool management with the Ollama framework to ensure service stability in high concurrency scenarios, and is suitable for development and testing to production-level AI application deployment.
Details
Pricing
shenhao deepseek
Vendor refund policy
Returns are currently not supported, but can be cancelled at any time; please contact: Tel: 010-86391575 Technical support: NOC@cloudchinas.com Business support: Sales@cloudchinas.com
Legal
Vendor terms and conditions
Content disclaimer
Usage information
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Product name: DeepSeek R1 Model Reasoning Kit (Ollama Edition) This product is based on the Amazon Linux 2023 operating system, pre-integrates the Ollama framework and DeepSeek R1 series distillation model (7B/14B/32B/70B) to provide out-of-the-box AI inference services. Users can quickly deploy multi-specification models through Docker and enable interactive testing and management through the Web UI. It supports flexible expansion of Amazon Web Services GPU/CPU instances, and is suitable for scenarios such as intelligent customer service, content generation, and data analysis.
Core highlights One-click deployment: After subscribing through the Amazon Web Services Marketplace, launch an EC2 instance and run the service within 5 minutes. Security compliance: The initial web login password is dynamically bound to the instance ID to eliminate the risk of the default password. Multi-model switching: Supports 7B to 70B models to be called as needed, taking into account performance and cost.
Additional details
Usage instructions
- Quick Start Launching an instance
Select the AMI of this product in the Amazon Web Services console and launch an EC2 instance (g5.xlarge or above is recommended). The security group must allow ports: 22 (SSH), 3000 (web UI).
Login system SSH access (default user ec2-user): ssh -i “keyfile.pem” ec2-user @公有IP
Web UI access: Type https://公有IP:3000,用邮箱注册后,输入邮箱地址、密码登录即可使用 in your browser.
- Model management Select models
Once logged into the web UI, select the target model (7B/14B/32B/70B) from the drop-down menu.
You need to wait for the model to load for the first time (the 70B model takes about 2 minutes).
Initiate inference
Enter the prompt text (such as “Generate a short essay on AI”) and click “Run” to get the results.
- Advanced operation CLI call (SSH terminal)
curl http://localhost:11434/api/generate -d '{“model”: “deepseek-r 1:14 b”, “prompt”: “What are the advantages of Amazon Web Services?” , “stream”: false} 'monitoring resources
The web UI dashboard shows GPU/CPU utilization, memory usage, and request delays in real time.
View logs: sudo journalctl -u deepseek-r1 -f.
- Security recommendations Change the default password
As soon as you log in to the web UI, reset your password in Account Settings.
Enable HTTPS certificates
access restrictions
Restrict the web UI port (3000) through security groups to allow only trusted IP access.
- Troubleshooting the web UI is inaccessible: Check the security group rules and instance status, and confirm that port 3000 is open.
Support
Amazon Web Services infrastructure support
Amazon Web Services Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.