Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

trajis-tech
/
llama-cpp-python-trajis-tech-nonavx512-cuda

llama-cpp-python
llama-cpp
wheel
windows
cuda-12
blackwell
sm_100
sm_90
sm_89
sm_86
sm_80
sm_75
sm_72
sm_70
sm_62
sm_61
cp312
Model card Files Files and versions
xet
Community

Instructions to use trajis-tech/llama-cpp-python-trajis-tech-nonavx512-cuda with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • llama-cpp-python

    How to use trajis-tech/llama-cpp-python-trajis-tech-nonavx512-cuda with llama-cpp-python:

    # !pip install llama-cpp-python
    
    from llama_cpp import Llama
    
    llm = Llama.from_pretrained(
    	repo_id="trajis-tech/llama-cpp-python-trajis-tech-nonavx512-cuda",
    	filename="{{GGUF_FILE}}",
    )
    
    output = llm(
    	"Once upon a time,",
    	max_tokens=512,
    	echo=True
    )
    print(output)
  • Notebooks
  • Google Colab
  • Kaggle
llama-cpp-python-trajis-tech-nonavx512-cuda
479 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 17 commits
trajis-tech's picture
trajis-tech
Update README.md
d6dca19 verified 2 months ago
  • .gitattributes
    1.61 kB
    Upload llama_cpp_python-0.3.16-cp312-cp312-win_amd64.whl 3 months ago
  • README.md
    920 Bytes
    Update README.md 2 months ago
  • llama_cpp_python-0.3.16-cp312-cp312-win_amd64.whl
    479 MB
    xet
    Upload llama_cpp_python-0.3.16-cp312-cp312-win_amd64.whl 3 months ago