feat: add baichuan llm support #979

lizzy-0323 · 2024-05-20T08:23:53Z

Ⅰ. Describe what this PR did

Support baichuan ai llm, api documentation (https://platform.baichuan-ai.com/docs/api)

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

envoy.yaml:

admin:
  address:
    socket_address:
      protocol: TCP
      address: 0.0.0.0
      port_value: 9901
static_resources:
  listeners:
    - name: listener_0
      address:
        socket_address:
          protocol: TCP
          address: 0.0.0.0
          port_value: 10000
      filter_chains:
        - filters:
            - name: envoy.filters.network.http_connection_manager
              typed_config:
                "@type": type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager
                scheme_header_transformation:
                  scheme_to_overwrite: https
                stat_prefix: ingress_http
                # Output envoy logs to stdout
                access_log:
                  - name: envoy.access_loggers.stdout
                    typed_config:
                      "@type": type.googleapis.com/envoy.extensions.access_loggers.stream.v3.StdoutAccessLog
                # Modify as required
                route_config:
                  name: local_route
                  virtual_hosts:
                    - name: local_service
                      domains: ["*"]
                      routes:
                        - match:
                            prefix: "/"
                          route:
                            cluster: baichuan
                            timeout: 300s
                http_filters:
                  - name: baichuan
                    typed_config:
                      "@type": type.googleapis.com/udpa.type.v1.TypedStruct
                      type_url: type.googleapis.com/envoy.extensions.filters.http.wasm.v3.Wasm
                      value:
                        config:
                          name: baichuan
                          vm_config:
                            runtime: envoy.wasm.runtime.v8
                            code:
                              local:
                                filename: /etc/envoy/main.wasm
                          configuration:
                            "@type": "type.googleapis.com/google.protobuf.StringValue"
                            value: |
                              {
                                "provider": {
                                  "type": "baichuan",
                                  "apiTokens": [
                                    "sk-xxxxxx"
                                  ],
                                  "withSearchEnhance":false
                                }
                              }

                  - name: envoy.filters.http.router
  clusters:
    - name: baichuan
      connect_timeout: 30s
      type: LOGICAL_DNS
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: baichuan
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: api.baichuan-ai.com
                      port_value: 443
      transport_socket:
        name: envoy.transport_sockets.tls
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.transport_sockets.tls.v3.UpstreamTlsContext
          "sni": "api.baichuan-ai.com"

Request:

curl "http://localhost:10000/v1/chat/completions"  -H "Content-Type: application/json" -d '{
  "model": "Baichuan2-Turbo",            
  "messages": [                                                     
    {                        
      "role": "user",                
      "content": "你好，你是谁？"
    }                                                                                 
  ]                                                 
}'

Response:

{
   "id":"chatcmpl-Mcdc9015KFRHoSV",
   "object":"chat.completion",
   "created":1716190037,
   "model":"Baichuan2-Turbo",
   "choices":[
      {
         "index":0,
         "message":{
            "role":"assistant",
            "content":"你好！我是百川大模型，是由百川智能的工程师们创造的大语言模型，我可以和人类进行自然交流、解答问题、协助创作，帮助大众轻松、普惠的获得世界知识和专业服务。如果你有任何问题，可以随时向我提问。"
         },
         "finish_reason":"stop"
      }
   ],
   "usage":{
      "prompt_tokens":6,
      "completion_tokens":52,
      "total_tokens":58
   }
}

Ⅴ. Special notes for reviews

在baichuan ai的api文档中，我发现有一个api parameter是with_search_enhance，这是一个bool类型的参数，每次打开需要消耗额外的token，我认为该参数只需要从context json中获取即可，故没有添加到文档和代码中。

CLAassistant · 2024-05-20T08:23:59Z

All committers have signed the CLA.

johnlanni · 2024-05-20T09:22:47Z

cc @CH3CHO

CH3CHO

由于先合并了另外一个模型的 provider，代码有冲突了。麻烦和上面的评论一并处理一下。谢谢！

plugins/wasm-go/extensions/ai-proxy/README.md

Updated comments in provider.go, add "yi"

Suchun-sv · 2024-05-27T21:58:42Z

Ⅰ. Describe what this PR did

Support baichuan ai llm, api documentation (https://platform.baichuan-ai.com/docs/api)

Ⅱ. Does this pull request fix one issue?

fix #952

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

envoy.yaml:

admin:
  address:
    socket_address:
      protocol: TCP
      address: 0.0.0.0
      port_value: 9901
static_resources:
  listeners:
    - name: listener_0
      address:
        socket_address:
          protocol: TCP
          address: 0.0.0.0
          port_value: 10000
      filter_chains:
        - filters:
            - name: envoy.filters.network.http_connection_manager
              typed_config:
                "@type": type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager
                scheme_header_transformation:
                  scheme_to_overwrite: https
                stat_prefix: ingress_http
                # Output envoy logs to stdout
                access_log:
                  - name: envoy.access_loggers.stdout
                    typed_config:
                      "@type": type.googleapis.com/envoy.extensions.access_loggers.stream.v3.StdoutAccessLog
                # Modify as required
                route_config:
                  name: local_route
                  virtual_hosts:
                    - name: local_service
                      domains: ["*"]
                      routes:
                        - match:
                            prefix: "/"
                          route:
                            cluster: baichuan
                            timeout: 300s
                http_filters:
                  - name: baichuan
                    typed_config:
                      "@type": type.googleapis.com/udpa.type.v1.TypedStruct
                      type_url: type.googleapis.com/envoy.extensions.filters.http.wasm.v3.Wasm
                      value:
                        config:
                          name: baichuan
                          vm_config:
                            runtime: envoy.wasm.runtime.v8
                            code:
                              local:
                                filename: /etc/envoy/main.wasm
                          configuration:
                            "@type": "type.googleapis.com/google.protobuf.StringValue"
                            value: |
                              {
                                "provider": {
                                  "type": "baichuan",
                                  "apiTokens": [
                                    "sk-xxxxxx"
                                  ],
                                  "withSearchEnhance":false
                                }
                              }

                  - name: envoy.filters.http.router
  clusters:
    - name: baichuan
      connect_timeout: 30s
      type: LOGICAL_DNS
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: baichuan
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: api.baichuan-ai.com
                      port_value: 443
      transport_socket:
        name: envoy.transport_sockets.tls
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.transport_sockets.tls.v3.UpstreamTlsContext
          "sni": "api.baichuan-ai.com"

Request:

curl "http://localhost:10000/v1/chat/completions"  -H "Content-Type: application/json" -d '{
  "model": "Baichuan2-Turbo",            
  "messages": [                                                     
    {                        
      "role": "user",                
      "content": "你好，你是谁？"
    }                                                                                 
  ]                                                 
}'

Response:

{
   "id":"chatcmpl-Mcdc9015KFRHoSV",
   "object":"chat.completion",
   "created":1716190037,
   "model":"Baichuan2-Turbo",
   "choices":[
      {
         "index":0,
         "message":{
            "role":"assistant",
            "content":"你好！我是百川大模型，是由百川智能的工程师们创造的大语言模型，我可以和人类进行自然交流、解答问题、协助创作，帮助大众轻松、普惠的获得世界知识和专业服务。如果你有任何问题，可以随时向我提问。"
         },
         "finish_reason":"stop"
      }
   ],
   "usage":{
      "prompt_tokens":6,
      "completion_tokens":52,
      "total_tokens":58
   }
}

Ⅴ. Special notes for reviews

在baichuan ai的api文档中，我发现有一个api parameter是with_search_enhance，这是一个bool类型的参数，每次打开需要消耗额外的token，我认为该参数只需要从context json中获取即可，故没有添加到文档和代码中。

疑似api token泄露

feature: add baichuan llm

5cc7dad

lizzy-0323 requested review from johnlanni and WeixinX as code owners May 20, 2024 08:23

johnlanni requested a review from CH3CHO May 20, 2024 09:22

lizzy-0323 changed the title ~~feature: add baichuan llm support~~ feat: add baichuan llm support May 20, 2024

CH3CHO requested changes May 21, 2024

View reviewed changes

plugins/wasm-go/extensions/ai-proxy/README.md Outdated Show resolved Hide resolved

Merge branch 'main' into feature-baichuan-ai-proxy-support

b91d8d6

lizzy-0323 requested a review from CH3CHO May 21, 2024 12:00

CH3CHO approved these changes May 21, 2024

View reviewed changes

lizzy-0323 added 2 commits May 21, 2024 21:41

Update provider.go

7fa5824

Update provider.go comments

dd1f2fa

Updated comments in provider.go, add "yi"

lizzy-0323 requested a review from CH3CHO May 21, 2024 13:48

lizzy-0323 and others added 3 commits May 21, 2024 21:49

Update provider.go

f37fe57

Update provider.go

f330691

Merge branch 'main' into feature-baichuan-ai-proxy-support

bdc88bd

CH3CHO merged commit fc6a6aa into alibaba:main May 22, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add baichuan llm support #979

feat: add baichuan llm support #979

lizzy-0323 commented May 20, 2024 •

edited by johnlanni

CLAassistant commented May 20, 2024 •

edited

johnlanni commented May 20, 2024

CH3CHO left a comment

Suchun-sv commented May 27, 2024 •

edited by johnlanni

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

feat: add baichuan llm support #979

feat: add baichuan llm support #979

Conversation

lizzy-0323 commented May 20, 2024 • edited by johnlanni

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

CLAassistant commented May 20, 2024 • edited

johnlanni commented May 20, 2024

CH3CHO left a comment

Choose a reason for hiding this comment

Suchun-sv commented May 27, 2024 • edited by johnlanni

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

lizzy-0323 commented May 20, 2024 •

edited by johnlanni

CLAassistant commented May 20, 2024 •

edited

Suchun-sv commented May 27, 2024 •

edited by johnlanni