Skip to content
View ChristianWeyer's full-sized avatar

Organizations

@thinktecture

Block or report ChristianWeyer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Local LLM Testing & Benchmarking for Apple Silicon

Swift 177 10 Updated Jun 6, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

171,501 17,502 Updated Apr 20, 2026

Open-source framework for superagents.

Python 83 4 Updated May 26, 2026

Visualizer for neural network, deep learning and machine learning models

JavaScript 33,053 3,126 Updated Jun 8, 2026

258 KB WASM runtime for Needle a 26M-parameter tool-calling transformer. Runs in browser, Cloudflare Workers, and Node.js. No backend required.

Rust 37 5 Updated May 20, 2026

26m function call model that runs on incredibly small devices

Python 2,586 174 Updated May 16, 2026

The bastard son between Cursor and Obsidian

TypeScript 686 69 Updated Jun 7, 2026
TypeScript 330 44 Updated May 15, 2026

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 16,268 1,388 Updated Jun 9, 2026

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

C 8,397 449 Updated Jun 2, 2026

Open source repository of plugins primarily intended for knowledge workers to use in Claude Cowork

Python 19,787 2,365 Updated Jun 8, 2026

EdegQuake 🌋 High-performance GraphRAG inspired from LightRag written in Rust; Transform documents into intelligent knowledge graphs for superior retrieval and generation

Rust 2,000 229 Updated Jun 9, 2026

A complete GPT language model (training and inference) in ~600 lines of pure C#, zero dependencies

C# 398 48 Updated Feb 14, 2026

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 88,197 59,615 Updated May 15, 2026

Strix Halo Installation notes and container Dockerfiles

Shell 1 Updated Feb 4, 2026

Interactive launcher and benchmarking harness for llama.cpp server throughput, with tests, sweeps, and round‑robin load tools.

Python 410 62 Updated Feb 8, 2026

A user-friendly GUI for llama.cpp — convert, quantize, and run GGUF models without touching the terminal.

Python 21 2 Updated Jun 7, 2026

Fine-tune and run LLMs locally on your M-series Mac. A powerful desktop interface built on Apple's MLX framework for zero-setup AI.

TypeScript 268 17 Updated Jan 10, 2026

Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

Python 1,483 132 Updated Oct 31, 2025

A modern desktop application for exploring, managing, and analyzing vector databases

TypeScript 242 16 Updated Jun 8, 2026

Comprehensive resource on Generative AI security — prompt injection, jailbreaking, RAG security, red teaming, guardrails, and more. One folder per topic.

Python 8 2 Updated Jan 7, 2026

Ontology, and Knowledge graph based RAG that uses local LLM.

Python 27 4 Updated Jan 3, 2026

Public roadmap for Inferencer. Ideas, feature requests, and bug reports are all welcome.

49 2 Updated May 12, 2026
TypeScript 20 1 Updated Jan 3, 2026

Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.

Python 54 6 Updated Jan 9, 2026

Docker configuration for running VLLM on dual DGX Sparks

Shell 1,556 280 Updated Jun 8, 2026

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

C# 3,711 497 Updated Jun 1, 2026
Next