Skip to content
View srikaanthh's full-sized avatar

Highlights

  • Pro

Block or report srikaanthh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Deep-Q-Network-DQN---Reinforcement-Learning Deep-Q-Network-DQN---Reinforcement-Learning Public

    DQN agent implementation for solving the bananas environment.

    Jupyter Notebook

  2. Multi-Agent-Reinforcement-Learning Multi-Agent-Reinforcement-Learning Public

    To evaluate and improve multiple language-model agents on GSM8K-style math tasks while ensuring that the reward pipeline remains transparent and auditable.

    Jupyter Notebook

  3. codegraph codegraph Public

    Forked from colbymchenry/codegraph

    Pre-indexed code knowledge graph for Claude Code, Codex, Cursor, and OpenCode — fewer tokens, fewer tool calls, 100% local

    TypeScript

  4. contextgraph contextgraph Public

    The proliferation of Retrieval-Augmented Generation (RAG) systems has revolutionized how Large Language Models (LLMs) access and synthesize information. By grounding LLM responses in external knowl…

    Python