feat: support yi ai model #980

Chi-Kai · 2024-05-20T09:08:05Z

Ⅰ. Describe what this PR did

Support Yi ai model, api documentation（https://platform.lingyiwanwu.com）

Ⅱ. Does this pull request fix one issue?

fixes #957

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

envoy.yaml:

admin:
  address:
    socket_address:
      protocol: TCP
      address: 0.0.0.0
      port_value: 9901
static_resources:
  listeners:
    - name: listener_0
      address:
        socket_address:
          protocol: TCP
          address: 0.0.0.0
          port_value: 10000
      filter_chains:
        - filters:
            - name: envoy.filters.network.http_connection_manager
              typed_config:
                "@type": type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager
                scheme_header_transformation:
                  scheme_to_overwrite: https
                stat_prefix: ingress_http
                # Output envoy logs to stdout
                access_log:
                  - name: envoy.access_loggers.stdout
                    typed_config:
                      "@type": type.googleapis.com/envoy.extensions.access_loggers.stream.v3.StdoutAccessLog
                # Modify as required
                route_config:
                  name: local_route
                  virtual_hosts:
                    - name: local_service
                      domains: [ "*" ]
                      routes:
                        - match:
                            prefix: "/"
                          route:
                            cluster: yi
                            timeout: 300s
                http_filters:
                  - name: wasmtest
                    typed_config:
                      "@type": type.googleapis.com/udpa.type.v1.TypedStruct
                      type_url: type.googleapis.com/envoy.extensions.filters.http.wasm.v3.Wasm
                      value:
                        config:
                          name: wasmtest
                          vm_config:
                            runtime: envoy.wasm.runtime.v8
                            code:
                              local:
                                filename: /etc/envoy/plugin.wasm
                          configuration:
                            "@type": "type.googleapis.com/google.protobuf.StringValue"
                            value: |
                              {
                                "provider": {
                                  "type": "yi",
                                  "apiTokens": [
                                    "your-apiTokens"
                                  ]
                                }
                              }
                  - name: envoy.filters.http.router
  clusters:
    - name: httpbin
      connect_timeout: 30s
      type: LOGICAL_DNS
      # Comment out the following line to test on v6 networks
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: httpbin
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: httpbin
                      port_value: 80
    - name: yi
      connect_timeout: 30s
      type: LOGICAL_DNS
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: yi
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: api.lingyiwanwu.com
                      port_value: 443
      transport_socket:
        name: envoy.transport_sockets.tls
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.transport_sockets.tls.v3.UpstreamTlsContext
          "sni": "api.lingyiwanwu.com"

Request:

curl "http://localhost:10000/v1/chat/completions"  -H "Content-Type: application/json" -d '{
  "model": "yi-large",            
  "messages": [                                                     
    {                        
      "role": "user",                
      "content": "你好，你是谁？"
    }                                                                                 
  ]                                                 
}'

Response:

{
    "id": "cmpl-1cef946a",
    "object": "chat.completion",
    "created": 12063916,
    "model": "yi-large",
    "usage": {
        "completion_tokens": 42,
        "prompt_tokens": 15,
        "total_tokens": 57
    },
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": "你好！我是一个人工智能助手，专门设计来回答问题、提供信息和帮助解决问题。人们通常叫我“AI助手”或者类似的名字。如果你有任何问题或需要帮助，请随时告诉我！"
            },
            "finish_reason": "stop"
        }
    ]
}

Ⅴ. Special notes for reviews

CLAassistant · 2024-05-20T09:08:11Z

All committers have signed the CLA.

johnlanni · 2024-05-20T09:22:17Z

cc @CH3CHO

CH3CHO

LGTM. Thanks.

feat: support yi ai model

e5fa61f

Chi-Kai requested review from johnlanni and WeixinX as code owners May 20, 2024 09:08

johnlanni requested a review from CH3CHO May 20, 2024 09:22

Merge branch 'main' into main

49d0ca6

CH3CHO approved these changes May 21, 2024

View reviewed changes

CH3CHO merged commit 33013d0 into alibaba:main May 21, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support yi ai model #980

feat: support yi ai model #980

Chi-Kai commented May 20, 2024

CLAassistant commented May 20, 2024 •

edited

johnlanni commented May 20, 2024

CH3CHO left a comment

feat: support yi ai model #980

feat: support yi ai model #980

Conversation

Chi-Kai commented May 20, 2024

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

CLAassistant commented May 20, 2024 • edited

johnlanni commented May 20, 2024

CH3CHO left a comment

Choose a reason for hiding this comment

CLAassistant commented May 20, 2024 •

edited