Sign in
Ningxia Region | Beijing Region
Categories
Your Saved List Become a Channel Partner Sell in Amazon Web Services Marketplace Global Expansion Hub Amazon Web Services Home Help
Amazon Web Services Marketplace China: JunYun DeepSeek Solution
    Listing Thumbnail

    JunYun DeepSeek Solution

     
    Based on the Amazon Web Services cloud service, it provides enterprises with DeepSeek-based online inference service deployment, model distillation, model quantification, upper level development services, and other peripheral services.

    Overview

    Juyun's core services include:

    1. Inference service deployment Based on Amazon Web Services cloud inference service deployment, including quantized versions and non-quantized full-blood versions, hardware configuration support:
      • Large video memory deployment, high video memory requirements
      • Deploy small video memory and large general-purpose memory, and build inference services with fewer GPU resources through a balance between GPU memory and general-purpose memory.
    2. Model distillation We provide a closed-loop model distillation service, that is, by constructing an evaluation system, digitally comparing model effects before and after distillation through the evaluation system, we provide high-quality distillation model delivery that meets application scenarios.
    3. Model quantification Provide model quantification services and customize quantification standards according to hardware.
    4. Upper-level development services According to the user scenario, help customers build upper level development services such as Q&A, number of questions, and intelligent business response, including engineering services for various prompts required in the process, and model effect evaluation services.
    5. Other peripheral services Customers encounter various problems in building their own AI applications, including but not limited to: cross-language business, labeling, prompt engineering, structured information extraction, and vertical application services.

    Highlights

    • Flexible deployment capabilities 1. Customize the model that best suits the user scenario, not limited to whether the model is open source or closed source. 2. Multiple inference frameworks are supported, and quantization and full blood versions are supported at the same time. 3. The inference framework does not rely on large video memory or multiple graphics card configurations to reduce the dependency on GPUs.
    • Assured Delivery 1. Measurable distillation optimization technology: Build evaluation data sets for customer scenarios, digitally compare model capabilities before and after delivery, so that users can face optimization results more intuitively. 2. More secure delivery for customers: Dive into customer scenarios, provide solutions and methods to verify methods, complete test reports and usable online environments.
    • Service that keeps pace with the times 1. Good architecture makes it easier to replace the underlying model 2. It can be flexibly configured according to user needs and follows the latest open source model technology, so that users' applications can always keep up with the latest technical limits.

    Details

    Delivery method

    Pricing

    Custom pricing options

    Pricing is based on your specific requirements and eligibility. To get a custom quote for your needs, request a private offer.

    Legal

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. Amazon Web Services Marketplace China does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Support

    Vendor support

    Please contact us offline to obtain product testing qualifications. Tel: 010-62927779-5501 Email: support@marshotspot.com