hip_llama.cpp

Inference llama2 model on the AMD GPUs system

Getting Started

Install the dependencies.
Clone the repository.
Run the project.

cd hip_llama.cpp
make

Usage

Instructions on how to use the project.

./build/apps/llama model.bin -m test -f <input_filename> -o <output_filename>

Examples of how to inference llama2.

./build/apps/llama /shared/erc/getpTA/main/modelbin/stories110M.bin -m test -f assets/in/gen_in_128.txt -o assets/out/gen_out_128.txt

Documentation

Not available yet.

Contributing

If you have some issues or feature requests, please open issues or let us know by email of contributors bellow.

License

GPL-3.0

Contributers

Full Name	Email
Pham Manh Tien	[email protected]
Nguyen Huy Hoang	[email protected]
Nguyen Xuan Anh	[email protected]

Acknowledgments

Reference:

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
.github/workflows		.github/workflows
assets		assets
build		build
include		include
logs		logs
scripts		scripts
src		src
tests		tests
train		train
.clang-format		.clang-format
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
llama_benchmark.cpp		llama_benchmark.cpp
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
run.cc		run.cc
runq.c		runq.c
test.c		test.c
test.cpp		test.cpp
test_all.py		test_all.py
tokenizer.model		tokenizer.model
win.c		win.c
win.h		win.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hip_llama.cpp

Getting Started

Usage

Documentation

Contributing

License

Contributers

Acknowledgments

About

Releases 1

Packages

Contributors 3

Languages

License

tienpm/hip_llama.cpp

Folders and files

Latest commit

History

Repository files navigation

hip_llama.cpp

Getting Started

Usage

Documentation

Contributing

License

Contributers

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages