gustavokuklinski
/

aeon-360M-GGUF

Model card Files Files and versions

gustavokuklinski commited on Sep 24

Commit

cb0fca0

·

verified ·

1 Parent(s): a130207

Update README.md

Files changed (1) hide show

README.md +42 -1

README.md CHANGED Viewed

@@ -64,4 +64,45 @@ docker run -it --rm -p 7860:7860 -v "$(pwd):/app" aeon
 | OS | CPU | GPU | RAM |
 |:---|:---|:---|:---|
 | Ubuntu 24.04.2 LTS | Intel i7-10510U | Intel CometLake-U GT2 | 16GB |
-| Windows 11 Home Edition | Intel i7-10510U | Intel CometLake-U GT2 | 8GB |

 | OS | CPU | GPU | RAM |
 |:---|:---|:---|:---|
 | Ubuntu 24.04.2 LTS | Intel i7-10510U | Intel CometLake-U GT2 | 16GB |
+| Windows 11 Home Edition | Intel i7-10510U | Intel CometLake-U GT2 | 8GB |
+## Use with llama.cpp
+Install llama.cpp through brew (works on Mac and Linux)
+```bash
+brew install llama.cpp
+```
+Invoke the llama.cpp server or the CLI.
+### CLI:
+```bash
+llama-cli --hf-repo gustavokuklinski/aeon-GGUF --hf-file aeon-360M.Q8_0.gguf -p "What is a virtual species?"
+```
+### Server:
+```bash
+llama-server --hf-repo gustavokuklinski/aeon-GGUF --hf-file aeon-360M.Q8_0.gguf -c 2048
+```
+Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
+Step 1: Clone llama.cpp from GitHub.
+```
+git clone https://github.com/ggerganov/llama.cpp
+```
+Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
+```
+cd llama.cpp && LLAMA_CURL=1 make
+```
+Step 3: Run inference through the main binary.
+```
+./llama-cli --hf-repo gustavokuklinski/aeon-GGUF --hf-file aeon-360M.Q8_0.gguf -p "What is a virtual species?"
+```
+or
+```
+./llama-server --hf-repo gustavokuklinski/aeon-GGUF --hf-file aeon-360M.Q8_0.gguf -c 2048
+```