Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
73 commits
Select commit Hold shift + click to select a range
46140c5
latest attempt, still not functional
Neresco Apr 7, 2026
de03bee
next fix untested
Neresco Apr 8, 2026
a9514f2
next step only cpu processing and no rpc connection.
Neresco Apr 8, 2026
9024346
first working state
Neresco Apr 9, 2026
3fc9120
Update manuals for the first working state
Neresco Apr 9, 2026
a089602
Update manual and guide to port RPC again for Humans or LLM
Neresco Apr 9, 2026
5860d3a
Update with hybrid mode, client can recognize local machine models to…
Neresco Apr 10, 2026
5dfe255
Update manuals
Neresco Apr 11, 2026
9e9f31d
Implement --device rearranging like in llama.cpp and update manuals
Neresco Apr 11, 2026
95fff02
Coding style changed to Concedo style
Neresco Apr 11, 2026
c95f900
Merge pull request #1 from Neresco/concedo-style
Neresco Apr 11, 2026
894b3e4
Delete koboldcpp.py.backup
Neresco Apr 11, 2026
6f5ee5e
manual concedo style rewrite
Neresco Apr 11, 2026
c13b96a
more manual concedo style
Neresco Apr 11, 2026
81b3c43
more concedo codaing style changes
Neresco Apr 12, 2026
dbfa22b
Merge branch 'rpc-testing-function-11-04-2026' into coding-style-chan…
Neresco Apr 12, 2026
a25d67f
Merge pull request #3 from Neresco/coding-style-changes-12-04-26
Neresco Apr 12, 2026
1d75716
more coding changes functional again
Neresco Apr 12, 2026
5450a6a
Merge branch 'coding-style-changes-12-04-26' of https://github.com/Ne…
Neresco Apr 12, 2026
1598771
Merge pull request #4 from Neresco/rpc-testing-function-11-04-2026
Neresco Apr 12, 2026
7019d01
Merge pull request #5 from Neresco/coding-style-changes-12-04-26
Neresco Apr 12, 2026
0ef029c
Update koboldcpp.py
Neresco Apr 12, 2026
8f10722
Update koboldcpp.py
Neresco Apr 12, 2026
fa5d98e
restructuring
Neresco Apr 13, 2026
d09bb22
Delete test_rpc_layers.sh
Neresco Apr 13, 2026
7fabbb0
Delete build_hybrid_final.log
Neresco Apr 13, 2026
84075cc
More manuals to easier port and implement this. ROCM, CUDA Debugging …
Neresco Apr 15, 2026
61cb3a2
ROCM RPC works CUDA untested VULKAN Client + ROCM RPC Server works
Neresco Apr 16, 2026
6685f06
Rpc Server with ROCM works, VULKAN/ROCM Server with VULKAN or ROCM cl…
Neresco Apr 16, 2026
8ff2275
RPC ROCM Server implemented, Updated manuals. VULKAN/ROCM Client and …
Neresco Apr 16, 2026
a012a99
Debug RPC endpoints so all is working
Neresco Apr 17, 2026
7689b7c
More RPC debug
Neresco Apr 17, 2026
5bb0f16
More RPC debug
Neresco Apr 17, 2026
cacfb2d
Manuals Updated with AI
Neresco Apr 17, 2026
289156b
Manuals Updated with AI
Neresco Apr 17, 2026
930d9d8
GUI Support integrated, Manuals Updated
Neresco Apr 18, 2026
aaa2fe8
GUI Support integrated, Manuals Updated
Neresco Apr 18, 2026
9a27448
Merge pull request #12 from Neresco/More-Manuals-ongoing-HIP-and-CUDA…
Neresco Apr 18, 2026
431f096
More Bug fix while loading VULKAN and ROCM Solo it crashed
Neresco Apr 19, 2026
7d0ba08
More Bug fix, while loading VULKAN and ROCM Solo it crashed Should wo…
Neresco Apr 19, 2026
bb835d8
More Bug fix, while loading VULKAN and ROCM Solo it crashed Should wo…
Neresco Apr 19, 2026
65c60ce
Delete koboldcpp
Neresco Apr 19, 2026
f2b757b
Delete koboldcpp
Neresco Apr 19, 2026
168a8aa
Delete koboldcpp
Neresco Apr 19, 2026
26fde17
works again
Neresco Apr 20, 2026
f392c76
Merge branch '19_04_2026_Should_Work' of https://github.com/Neresco/k…
Neresco Apr 20, 2026
863158f
Merge pull request #15 from Neresco/19_04_2026_Should_Work
Neresco Apr 20, 2026
409b57f
Merge pull request #16 from Neresco/19_04_2026_Should_Work
Neresco Apr 20, 2026
67d1c4c
Further fix in GPU selection
Neresco Apr 20, 2026
291328c
Further fix in gpu selection
Neresco Apr 20, 2026
9b2691d
Further fix in gpu selection
Neresco Apr 20, 2026
2efa32e
More RPC GUI Fixing to set all the values someone needs
Neresco Apr 22, 2026
2008a34
Manuals Update to the last working stage with gui integration.
Neresco Apr 23, 2026
77102ec
Updated llamacpp-rpc version
Neresco Apr 24, 2026
71c2de3
Updated llamacpp-rpc version 4.0 tested with gui and cli
Neresco Apr 24, 2026
48618c1
Updated llamacpp-rpc version 4.0 tested with gui and cli
Neresco Apr 24, 2026
e9454a2
Update RPC_MANUAL.md
Neresco Apr 24, 2026
155d694
Update README.md
Neresco Apr 24, 2026
88c2481
Update README.md
Neresco Apr 24, 2026
428d5da
Update README.md
Neresco Apr 24, 2026
33526d7
Update README.md
Neresco Apr 24, 2026
e6ae94a
Update RPC_MANUAL.md
Neresco Apr 24, 2026
8d2b5ee
Update RPC_MANUAL.md
Neresco Apr 24, 2026
d82f9ae
reformatting-RPC still-functional
Neresco Apr 27, 2026
90db8a3
koboldcpp-112-2-with-RPC-29-04-2026
Neresco Apr 29, 2026
211e153
revert290426
Neresco Apr 29, 2026
fe86d4b
Merge pull request #22 from Neresco/revert290426
Neresco Apr 29, 2026
c35ba62
Merge branch 'rpc-testing-function-11-04-2026' into concedo
Neresco Apr 29, 2026
4dbfedd
more revert to a functional stage
Neresco Apr 29, 2026
b0ce22c
Merge pull request #24 from Neresco/revertfurhter
Neresco Apr 29, 2026
5e44690
Merge pull request #25 from Neresco/rpc_testing
Neresco Apr 29, 2026
af354fc
functional old
Neresco Apr 29, 2026
480bf8f
Merge branch 'rpc-testing-function-11-04-2026' of https://github.com/…
Neresco Apr 29, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
9 changes: 9 additions & 0 deletions .kilo/agent-manager.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
{
"worktrees": {},
"sessions": {},
"tabOrder": {
"local": [
"pending:1"
]
}
}
1 change: 1 addition & 0 deletions .python-version
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
3.13.9
29 changes: 16 additions & 13 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -107,6 +107,7 @@ if (LLAMA_CUBLAS)
list(APPEND GGML_SOURCES_CUDA
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-f16-f16.cu
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-q4_0-q4_0.cu
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-q5_1-q5_1.cu
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-q8_0-q8_0.cu
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-bf16-bf16.cu
)
Expand Down Expand Up @@ -195,6 +196,7 @@ if (LLAMA_HIPBLAS)
list(APPEND GGML_SOURCES_ROCM
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-f16-f16.cu
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-q4_0-q4_0.cu
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-q5_1-q5_1.cu
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-q8_0-q8_0.cu
ggml/src/ggml-cuda/template-instances/fattn-vec-instance-bf16-bf16.cu
)
Expand Down Expand Up @@ -378,6 +380,7 @@ add_library(ggml
ggml/src/ggml-alloc.c
ggml/include/ggml-alloc.h
ggml/src/ggml-backend.cpp
ggml/src/ggml-backend-meta.cpp
ggml/src/ggml-backend-impl.h
ggml/include/ggml-backend.h
ggml/include/ggml-cpp.h
Expand Down Expand Up @@ -409,7 +412,7 @@ add_library(ggml
ggml/include/gguf.h
ggml/src/gguf.cpp
${GGML_SOURCES_CUDA})
target_include_directories(ggml PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor)
target_include_directories(ggml PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor)
target_compile_features(ggml PUBLIC c_std_11) # don't bump
target_link_libraries(ggml PUBLIC Threads::Threads ${LLAMA_EXTRA_LIBS})
set_target_properties(ggml PROPERTIES POSITION_INDEPENDENT_CODE ON)
Expand All @@ -418,7 +421,7 @@ target_compile_options(ggml PRIVATE $<$<COMPILE_LANGUAGE:CUDA>:-use_fast_math -e
add_library(ggml_v1
otherarch/ggml_v1.c
otherarch/ggml_v1.h)
target_include_directories(ggml_v1 PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor)
target_include_directories(ggml_v1 PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor)
target_compile_features(ggml_v1 PUBLIC c_std_11) # don't bump
target_link_libraries(ggml_v1 PUBLIC Threads::Threads ${LLAMA_EXTRA_LIBS})
set_target_properties(ggml_v1 PROPERTIES POSITION_INDEPENDENT_CODE ON)
Expand All @@ -428,7 +431,7 @@ add_library(ggml_v2
otherarch/ggml_v2.h
${GGML_V2_CUDA_SOURCES}
${GGML_V2_LEGACY_CUDA_SOURCES})
target_include_directories(ggml_v2 PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor)
target_include_directories(ggml_v2 PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor)
target_compile_features(ggml_v2 PUBLIC c_std_11) # don't bump
target_link_libraries(ggml_v2 PUBLIC Threads::Threads ${LLAMA_EXTRA_LIBS})
set_target_properties(ggml_v2 PROPERTIES POSITION_INDEPENDENT_CODE ON)
Expand All @@ -437,7 +440,7 @@ add_library(ggml_v3
otherarch/ggml_v3.c
otherarch/ggml_v3.h
${GGML_V3_CUDA_SOURCES})
target_include_directories(ggml_v3 PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor)
target_include_directories(ggml_v3 PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor)
target_compile_features(ggml_v3 PUBLIC c_std_11) # don't bump
target_link_libraries(ggml_v3 PUBLIC Threads::Threads ${LLAMA_EXTRA_LIBS})
set_target_properties(ggml_v3 PROPERTIES POSITION_INDEPENDENT_CODE ON)
Expand All @@ -464,57 +467,57 @@ add_library(common2
common/reasoning-budget.h
tools/mtmd/mtmd-audio.cpp
tools/mtmd/mtmd-audio.h)
target_include_directories(common2 PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_include_directories(common2 PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_compile_features(common2 PUBLIC cxx_std_17) # don't bump
target_link_libraries(common2 PRIVATE ggml ${LLAMA_EXTRA_LIBS})
set_target_properties(common2 PROPERTIES POSITION_INDEPENDENT_CODE ON)

add_library(sdtype_adapter
otherarch/sdcpp/sdtype_adapter.cpp)
target_include_directories(sdtype_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_include_directories(sdtype_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_compile_features(sdtype_adapter PUBLIC cxx_std_17) # don't bump
target_link_libraries(sdtype_adapter PRIVATE common2 ggml ${LLAMA_EXTRA_LIBS})
set_target_properties(sdtype_adapter PROPERTIES POSITION_INDEPENDENT_CODE ON)

add_library(whisper_adapter
otherarch/whispercpp/whisper_adapter.cpp)
target_include_directories(whisper_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./otherarch/whispercpp ./tools ./common)
target_include_directories(whisper_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./otherarch/whispercpp ./tools ./common)
target_compile_features(whisper_adapter PUBLIC cxx_std_17) # don't bump
target_link_libraries(whisper_adapter PRIVATE common2 ggml ${LLAMA_EXTRA_LIBS})
set_target_properties(whisper_adapter PROPERTIES POSITION_INDEPENDENT_CODE ON)

add_library(music_adapter
otherarch/acestep/music_adapter.cpp)
target_include_directories(music_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./otherarch/acestep ./tools ./common)
target_include_directories(music_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./otherarch/acestep ./tools ./common)
target_compile_features(music_adapter PUBLIC cxx_std_17) # don't bump
target_link_libraries(music_adapter PRIVATE common2 ggml ${LLAMA_EXTRA_LIBS})
set_target_properties(music_adapter PROPERTIES POSITION_INDEPENDENT_CODE ON)

add_library(tts_adapter
otherarch/tts_adapter.cpp)
target_include_directories(tts_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./otherarch/ttscpp/include ./otherarch/ttscpp/src ./otherarch/qwen3tts ./tools ./common)
target_include_directories(tts_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./otherarch/ttscpp/include ./otherarch/ttscpp/src ./otherarch/qwen3tts ./tools ./common)
target_compile_features(tts_adapter PUBLIC cxx_std_17) # don't bump
target_link_libraries(tts_adapter PRIVATE common2 ggml ${LLAMA_EXTRA_LIBS})
set_target_properties(tts_adapter PROPERTIES POSITION_INDEPENDENT_CODE ON)

add_library(embeddings_adapter
otherarch/embeddings_adapter.cpp)
target_include_directories(embeddings_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./tools ./common)
target_include_directories(embeddings_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./tools ./common)
target_compile_features(embeddings_adapter PUBLIC cxx_std_17) # don't bump
target_link_libraries(embeddings_adapter PRIVATE common2 ggml ${LLAMA_EXTRA_LIBS})
set_target_properties(embeddings_adapter PROPERTIES POSITION_INDEPENDENT_CODE ON)

add_library(gpttype_adapter
gpttype_adapter.cpp)
target_include_directories(gpttype_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_include_directories(gpttype_adapter PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_compile_features(gpttype_adapter PUBLIC cxx_std_17) # don't bump
target_link_libraries(gpttype_adapter PRIVATE common2 ggml ggml_v1 ggml_v2 ggml_v3 ${LLAMA_EXTRA_LIBS})
set_target_properties(gpttype_adapter PROPERTIES POSITION_INDEPENDENT_CODE ON)

if (LLAMA_CUBLAS)
set(TARGET koboldcpp_cublas)
add_library(${TARGET} SHARED expose.cpp expose.h)
target_include_directories(${TARGET} PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_include_directories(${TARGET} PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_compile_features(${TARGET} PUBLIC cxx_std_17) # don't bump
set_target_properties(${TARGET} PROPERTIES PREFIX "")
set_target_properties(${TARGET} PROPERTIES OUTPUT_NAME "koboldcpp_cublas")
Expand All @@ -534,7 +537,7 @@ endif()
if (LLAMA_HIPBLAS)
set(TARGET koboldcpp_hipblas)
add_library(${TARGET} SHARED expose.cpp expose.h)
target_include_directories(${TARGET} PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_include_directories(${TARGET} PUBLIC . ./ggml/include ./ggml/src ./ggml/src/ggml-cpu ./include ./otherarch ./otherarch/tools ./vendor/stb ./vendor/nlohmann ./vendor ./otherarch/sdcpp ./otherarch/sdcpp/thirdparty ./tools ./common)
target_compile_features(${TARGET} PUBLIC cxx_std_17) # don't bump
set_target_properties(${TARGET} PROPERTIES PREFIX "")
set_target_properties(${TARGET} PROPERTIES OUTPUT_NAME "koboldcpp_hipblas")
Expand Down
Loading