Sign in
Ningxia Region | Beijing Region
Categories
Your Saved List Become a Channel Partner Sell in Amazon Web Services Marketplace Global Expansion Hub Amazon Web Services Home Help
Amazon Web Services Marketplace China: shenhao deepseek
    Listing Thumbnail

    shenhao deepseek

     
    Sold by: 神灏云 
    This product is based on the Amazon Linux 2023 operating system, pre-integrates the Ollama framework and DeepSeek R1 series models, and provides out-of-the-box AI inference services. Users can quickly launch an inference environment that supports multi-scale models.

    Overview

    This product provides Amazon Web Services Marketplace users with a one-stop solution for deploying DeepSeek R1 series distillation models. Based on the Amazon Linux 2023 operating system, the Ollama framework and web management interface are pre-configured through Docker containerization technology. It supports four model specifications: 7B, 14B, 32B, and 70B, covering everything from lightweight to high-precision inference requirements, and is suitable for enterprise AI development, academic research and production environment deployment. Core features Multi-model support: Pre-load the full range of DeepSeek R1 distillation models (7B-70B), and users can choose flexibly according to task complexity, taking into account inference speed and accuracy. Out of the box: Integrated with the Ollama framework, there is no need to manually configure the environment, and you can access the interactive interface through a browser after starting the EC2 instance. Elastic scaling: Supports multiple Amazon Web Services EC2 instance types (such as GPU-accelerated G4dn/G5 or CPU-optimized C6i) to suit the computing needs of different models. Efficient reasoning: Model loading and memory management are optimized through the Ollama framework, combined with Docker resource isolation technology to ensure stability under multiple tasks. Applicable scenarios Intelligent customer service: Based on the 70B model, complex semantic understanding and multi-round dialogue generation are realized. Content Creation: Quickly generate marketing copy, abstracts, or translated content through the 14B/32B model. Data analysis: Use the 7B model to handle tasks such as log analysis and entity recognition in a lightweight manner. R&D testing: Provide AI teams with a standardized model testing environment to reduce deployment costs. Technical advantages Fast deployment: A pre-built Amazon Machine Image (AMI) is provided, and the environment is initialized within 5 minutes. Resource isolation: Docker containerized deployment ensures resource independence between models and avoids competitive interference. Security compliance: Based on Amazon Linux 2023 security patch and encrypted communication (HTTPS) to ensure data privacy.

    Highlights

    • Flexible choice of multiple models The full range of DeepSeek R1 distillation models (7B/14B/32B/70B) is preset, and users can switch freely according to task requirements. The 7B model is suitable for lightweight real-time inference, and the 70B model satisfies high-precision complex scenarios, taking into account performance and cost optimization.
    • Easy to use out of the box, simple to deploy Based on the Docker containerized package, it provides an Amazon Linux 2023 system image, completes EC2 instance startup and configuration in 5 minutes, integrates Ollama, and enables model management and interactive testing with zero code foundation.
    • Enterprise-grade stability and scalability It supports horizontal scaling of multiple Amazon Web Services GPU/CPU instance types, and combines automatic load balancing and connection pool management with the Ollama framework to ensure service stability in high concurrency scenarios, and is suitable for development and testing to production-level AI application deployment.

    Details

    Delivery method

    Delivery option
    64-bit (x86) Amazon Machine Image (AMI)

    Latest version

    Operating system
    AmazonLinux 2023.6.20250218.2

    Pricing

    shenhao deepseek

     
    This product is available free of charge. Free subscriptions have no end date and may be canceled any time.
    Additional Amazon Web Services infrastructure costs may apply. Use the Amazon Web Services Pricing Calculator  to estimate your infrastructure costs.

    Vendor refund policy

    Returns are currently not supported, but can be cancelled at any time; please contact: Tel: 010-86391575 Technical support: NOC@cloudchinas.com  Business support: Sales@cloudchinas.com 

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. Amazon Web Services Marketplace China does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     

    Delivery details

    64-bit (x86) Amazon Machine Image (AMI)

    Amazon Machine Image (AMI)

    An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.

    Version release notes

    Product name: DeepSeek R1 Model Reasoning Kit (Ollama Edition) This product is based on the Amazon Linux 2023 operating system, pre-integrates the Ollama framework and DeepSeek R1 series distillation model (7B/14B/32B/70B) to provide out-of-the-box AI inference services. Users can quickly deploy multi-specification models through Docker and enable interactive testing and management through the Web UI. It supports flexible expansion of Amazon Web Services GPU/CPU instances, and is suitable for scenarios such as intelligent customer service, content generation, and data analysis.

    Core highlights One-click deployment: After subscribing through the Amazon Web Services Marketplace, launch an EC2 instance and run the service within 5 minutes. Security compliance: The initial web login password is dynamically bound to the instance ID to eliminate the risk of the default password. Multi-model switching: Supports 7B to 70B models to be called as needed, taking into account performance and cost.

    Additional details

    Usage instructions

    1. Quick Start Launching an instance

    Select the AMI of this product in the Amazon Web Services console and launch an EC2 instance (g5.xlarge or above is recommended). The security group must allow ports: 22 (SSH), 3000 (web UI).

    Login system SSH access (default user ec2-user): ssh -i “keyfile.pem” ec2-user @公有IP

    Web UI access: Type https://公有IP:3000,用邮箱注册后,输入邮箱地址、密码登录即可使用  in your browser.

    1. Model management Select models

    Once logged into the web UI, select the target model (7B/14B/32B/70B) from the drop-down menu.

    You need to wait for the model to load for the first time (the 70B model takes about 2 minutes).

    Initiate inference

    Enter the prompt text (such as “Generate a short essay on AI”) and click “Run” to get the results.

    1. Advanced operation CLI call (SSH terminal)

    curl http://localhost:11434/api/generate  -d '{“model”: “deepseek-r 1:14 b”, “prompt”: “What are the advantages of Amazon Web Services?” , “stream”: false} 'monitoring resources

    The web UI dashboard shows GPU/CPU utilization, memory usage, and request delays in real time.

    View logs: sudo journalctl -u deepseek-r1 -f.

    1. Security recommendations Change the default password

    As soon as you log in to the web UI, reset your password in Account Settings.

    Enable HTTPS certificates

    access restrictions

    Restrict the web UI port (3000) through security groups to allow only trusted IP access.

    1. Troubleshooting the web UI is inaccessible: Check the security group rules and instance status, and confirm that port 3000 is open.

    Support

    Amazon Web Services infrastructure support

    Amazon Web Services Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Customer reviews

    Ratings and reviews

     
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 reviews
    No customer reviews yet
    Be the first to review this product .