Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

install on databricks #475

Open
Jackie0601zhou opened this issue May 16, 2024 · 9 comments
Open

install on databricks #475

Jackie0601zhou opened this issue May 16, 2024 · 9 comments

Comments

@Jackie0601zhou
Copy link

How can I install unsloth on databricks notebook?
I tried "pip install "unsloth[cu121-ampere-torch220] @ git+https://github.com/unslothai/unsloth.git"
and I met:
Cloning https://github.com/unslothai/unsloth.git to /tmp/pip-install-rst_tpkk/unsloth_e8849fa753954ad5b20ad0a81efbd0be
Running command git clone --filter=blob:none --quiet https://github.com/unslothai/unsloth.git /tmp/pip-install-rst_tpkk/unsloth_e8849fa753954ad5b20ad0a81efbd0be
fatal: unable to access 'https://github.com/unslothai/unsloth.git/': gnutls_handshake() failed: The TLS connection was non-properly terminated.
error: subprocess-exited-with-error

@danielhanchen
Copy link
Contributor

That seems like internet access does not work maybe?

@Jackie0601zhou
Copy link
Author

I used the HTTPS url and choose github as gitprovider when I added a repo to databricks. I also installed specific version of various packages.
All the steps before trainer_stats = trainer.train() were good. But when I run trainer_stats = trainer.train(), it show:
屏幕截图 2024-05-19 230637
屏幕截图 2024-05-19 230654

@jvhuang1786
Copy link

Currently getting the same issue as Jackie. I'm in a more regulated environment for databricks so I have to first download the repo and install through volumes. I have a suspicion that it's a dependency conflict but not too sure where to start looking.

@KwesiD
Copy link

KwesiD commented Jun 3, 2024

I'm also getting the same issue. I've tried installing different versions of the packages, but I end up with the same error.

image
image

@danielhanchen
Copy link
Contributor

Hmmm wait is databricks using MLFlow?

@KwesiD
Copy link

KwesiD commented Jun 5, 2024

Yes. By default Databricks logs the runs with MLFlow.

@danielhanchen
Copy link
Contributor

Hmmm ok - oh also is Databricks multi GPU?

@KwesiD
Copy link

KwesiD commented Jun 9, 2024

In my instance, I'm only using a single GPU. It's possible to set up a multi GPU cluster, though.

@danielhanchen
Copy link
Contributor

Hmmm tbh I haven't tried Databricks so I can't exactly debug it - I'll see what I can do, but can't promise anything sorrty

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants