-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Gemini Experimental #234
base: main
Are you sure you want to change the base?
Conversation
@happy-qop, in case you are curious about the result : ) |
|
|
|
llm_toolkit/models.py
Outdated
@@ -350,6 +350,16 @@ class GeminiPro(GeminiModel): | |||
_vertex_ai_model = 'gemini-1.0-pro' | |||
|
|||
|
|||
class GeminiExperimental(VertexAIModel): | |||
"""Gemini Experimental, results maybe genius or delightfully unpredictable.""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's remove the "maybe genius or delightfully unpredictable." part :) I know this is what is shown on Vertex AI, but no need to include it here in our codebase.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
llm_toolkit/models.py
Outdated
"""Gemini Experimental, results maybe genius or delightfully unpredictable.""" | ||
|
||
_max_output_tokens = 8192 | ||
context_window = 32760 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not a big deal for our current workloads, but is this actually the same 1M context windows as Gemini 1.5?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added a TODO as the current doc is not very clear.
llm_toolkit/models.py
Outdated
@@ -350,6 +350,16 @@ class GeminiPro(GeminiModel): | |||
_vertex_ai_model = 'gemini-1.0-pro' | |||
|
|||
|
|||
class GeminiExperimental(VertexAIModel): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this needs to inherit from GeminiModel
instead.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed, thanks!
The current quota for |
c403e5d
to
7f32999
Compare
/gcbrun request_pr_exp.py -n test-gemini-exp -m vertex_ai_gemini-experimental |
This PR supports
gemini-experimental
, which appears to work well in some cases.Comparison experiment plan:
gemini-experimental
against our defaultcode-bison-32k
with the same temperature0.4
.gemini-experimental
with its default temperature1
.