Web hosted client (api key based access) by atomic · Pull Request #480 · NVIDIA/cuopt

atomic · 2025-10-09T23:44:09Z

Description

Add CuOptServiceWebHostedClient for connecting to web-hosted cuOpt services (e.g. NVIDIA API endpoints) with endpoint URL and API key authentication
Add create_client() factory function that returns the appropriate client based on parameters
Extends CuOptServiceSelfHostClient with endpoint URL parsing, auth headers, and URL construction for web-hosted APIs

Issue

Closes #479

Tested with NVIDIA's Build.com CuOpt model

Test script (excluded from the PR as it is an adhoc script): https://gist.github.com/atomic/9c761bd4d64d62bc039ecf88fe28fca0#file-test_cuopt_nvidia_build_com-py

export CUOPT_API_KEY=... python3 file-test_cuopt_nvidia_build_com-py.py

Checklist

I am familiar with the Contributing Guidelines.
Testing
- New or existing tests cover these changes
- Added tests
- Created an issue to follow-up
- NA
Documentation
- The documentation is up to date with these changes
- Added new documentation
- NA

Summary by CodeRabbit

New Features
- Added web-hosted CuOpt client supporting cloud deployment with API key authentication.
Bug Fixes
- Improved memory error handling in barrier solver operations.
Documentation
- Updated GPU requirements to include H100 and newer architectures.
- Refined product terminology from "optimization library" to "optimization engine."
Tests
- Added comprehensive web-hosted client test coverage.

copy-pr-bot · 2025-10-09T23:44:12Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2025-10-21T19:37:18Z

📝 Walkthrough

Walkthrough

This PR introduces a web-hosted cuOpt client interface with API key authentication for cloud deployment, refactors C++ linear programming code to explicitly pass problem handles, adds memory exception handling and pointer mode configuration, updates documentation, and includes comprehensive tests for the new client interface.

Changes

Cohort / File(s)	Summary
Python Web-Hosted Client (New Feature) `python/cuopt_self_hosted/cuopt_sh_client/cuopt_web_hosted_client.py`, `python/cuopt_self_hosted/cuopt_sh_client/__init__.py`	Introduces `CuOptServiceWebHostedClient` class extending `CuOptServiceSelfHostClient` with endpoint parsing, API key authentication, Bearer token headers, and 401/403 error handling. Adds `create_client` factory function to instantiate appropriate client type based on endpoint parameter. Exports new classes from module `__init__`.
Python Client Infrastructure `python/cuopt_self_hosted/cuopt_sh_client/cuopt_self_host_client.py`	Adds internal `_make_http_request` helper method to centralize HTTP calls (GET/POST/DELETE), replacing direct `requests` library calls throughout multiple existing methods like `_get_logs`, `stop_threads`, `delete`, `repoll`, and `status`.
C++ Handle Propagation `cpp/src/linear_programming/translate.hpp`, `cpp/src/linear_programming/solve.cu`	Updates `cuopt_problem_to_simplex_problem` function signature to accept explicit `raft::handle_t const*` parameter for handle management. Propagates handle pointer through objective copying, CSR matrix operations, bound extraction, and host-copy streams. Updates all call sites in `run_barrier`, `run_dual_simplex`, and `run_concurrent` to pass `problem.handle_ptr` explicitly.
C++ Exception & Configuration `cpp/src/dual_simplex/barrier.cu`, `cpp/src/dual_simplex/cusparse_view.cu`	Adds `rmm::out_of_memory` exception handler in barrier solver returning `NUMERICAL_ISSUES`. Initializes cuBLAS and cuSPARSE pointer modes to device in `cusparse_view_t` constructor with `RAFT_TRY` error handling wrappers.
Documentation Updates `docs/cuopt/source/introduction.rst`, `docs/cuopt/source/system-requirements.rst`	Changes descriptor from "optimization library" to "optimization engine" in introduction. Broadens GPU requirement from "H100 SXM (compute capability >= 9.0)" to "H100 SXM (compute capability >= 9.0) and above" to include GPUs with higher compute capabilities.
Benchmark Configuration `benchmarks/linear_programming/utils/get_datasets.py`	Disables netlib dataset emps download/compile flow by adding `and False` condition to conditional branch, preventing runtime execution of compilation and execution steps while leaving other branches unchanged.
Test Additions `python/cuopt_self_hosted/tests/test_web_hosted_client.py`	Introduces comprehensive test suite covering endpoint parsing, authentication header configuration, URL construction, API error handling (401/403), base path handling, factory function behavior, and client type instantiation with parameter propagation.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Out of Scope Changes check	⚠️ Warning	Most changes align with issue `#479` scope, but several modifications appear beyond the core feature: barrier.cu exception handling, cusparse_view.cu pointer mode configuration, and documentation updates (introduction.rst, system-requirements.rst) are unrelated to the web-hosted client.	Remove or clarify the purpose of unrelated changes in barrier.cu, cusparse_view.cu, and documentation files, or defer them to separate PRs focused on their specific concerns.
Docstring Coverage	⚠️ Warning	Docstring coverage is 69.05% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title 'Web hosted client (api key based access)' clearly and specifically describes the main change: adding a web-hosted client with API key authentication support.
Linked Issues check	✅ Passed	The PR fulfills issue `#479` requirements: adds CuOptServiceWebHostedClient class extending SelfHostClient, implements API key authentication via Bearer tokens, supports endpoint URL parsing, and reuses existing request-based interaction patterns.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Tip

Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

rgsl888prabhu · 2025-10-21T19:37:25Z

/ok to test e595eab

rgsl888prabhu · 2025-10-21T19:37:49Z

@CodeRabbit review and summarize status

coderabbitai · 2025-10-21T19:39:38Z

Tip

For best results, initiate chat on the files or code changes.

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash
# Get PR status and details
gh pr view 480 --json state,title,isDraft,reviews,statusCheckRollup,mergeable,mergeStateStatus,additions,deletions,changedFiles

# Get list of files changed in the PR
gh pr view 480 --json files --jq '.files[] | "\(.path) +\(.additions) -\(.deletions)"'

Length of output: 5220

🏁 Script executed:

#!/bin/bash
# Get the new web hosted client implementation
cat python/cuopt_self_hosted/cuopt_sh_client/cuopt_web_hosted_client.py

echo -e "\n\n=== INIT FILE ==="
cat python/cuopt_self_hosted/cuopt_sh_client/__init__.py

echo -e "\n\n=== MODIFIED SELF HOST CLIENT ==="
cat python/cuopt_self_hosted/cuopt_sh_client/cuopt_self_host_client.py

echo -e "\n\n=== TEST FILE ==="
cat python/cuopt_self_hosted/tests/test_web_hosted_client.py

Length of output: 61283

@rgsl888prabhu I'll review the PR and provide a status summary.

📊 PR Status Summary

Current State: OPEN (Merge Blocked - Waiting for CI checks)

CI Checks Status:

✅ Passing: changed-files, other-checks, Branch Checker, CodeRabbit, Label Checker
🔄 In Progress: check-style, multiple wheel builds (Python 3.10-3.13, amd64/arm64)
Reviews: None submitted yet

Changes: 4 files, +527/-10 lines

New: cuopt_web_hosted_client.py (251 lines)
New: test_web_hosted_client.py (232 lines)
Modified: cuopt_self_host_client.py (+40/-10)
Modified: __init__.py (+4)

🔍 Code Review

✅ Strengths

Excellent Architecture
- Clean inheritance from CuOptServiceSelfHostClient
- Factory pattern with create_client() for easy client selection
- Proper separation of concerns between web-hosted and self-hosted clients
Comprehensive Testing
- 232 lines of well-structured unit tests
- Tests cover URL parsing, authentication, error handling, and factory function
- Good use of mocking and parametrization
Authentication & Security
- Proper Bearer token implementation for API keys
- Environment variable support (CUOPT_API_KEY)
- Self-signed certificate handling
- Specific error handling for 401/403 responses
Documentation
- Detailed docstrings with parameter descriptions
- Clear usage examples in docstrings
- Good inline comments explaining design decisions
Backward Compatibility
- Parent class refactored to support extensibility via _make_http_request()
- Existing functionality preserved

🔧 Suggestions for Improvement

Error Handling Enhancement (Line 186-192, cuopt_web_hosted_client.py)

# Current: Uses ValueError for auth errors
if response.status_code == 401:
    raise ValueError("Authentication failed. Please check your API key.")

# Consider: More specific exception types
if response.status_code == 401:
    raise AuthenticationError("Authentication failed. Please check your API key.")
elif response.status_code == 403:
    raise PermissionError("Access forbidden. Please check your permissions.")

Header Merging Pattern (Line 174-178)
The current implementation replaces headers entirely. Consider a more defensive merge:

def _make_http_request(self, method: str, url: str, **kwargs):
    headers = kwargs.get("headers", {}).copy()  # Copy to avoid mutations
    headers.update(self._get_auth_headers())
    kwargs["headers"] = headers
    
    response = requests.request(method, url, **kwargs)
    # ... rest of the method

URL Construction Clarity (Lines 140-149)
The comment mentions a "curl example" - consider adding a reference to documentation or examples for clarity.
Test Coverage Enhancement
While unit tests are comprehensive, consider adding integration tests that verify:
- get_optimized_routes() works with the web-hosted client
- get_LP_solve() works with authentication headers
- URL construction is correct for all inherited methods
Type Hints
Consider adding return type hints to methods like _get_auth_headers():
```
def _get_auth_headers(self) -> Dict[str, str]:
```

📝 Minor Observations

Line 49: The log warning for missing protocol is good, but consider making the default protocol configurable if http-only endpoints exist.
Line 28: mime_type import is correct, but verify it's exported properly in the parent module's __init__.py (it is ✅).
Factory Function: The create_client() error message when API key is provided without endpoint is excellent - very clear guidance for users.

🎯 Overall Assessment

Code Quality: ⭐⭐⭐⭐½ (4.5/5)

Well-designed, tested, and documented
Follows Python best practices
Minor improvements suggested above

Readiness:

⏳ Waiting for: CI checks to complete (check-style and wheel builds)
✅ Code review: Looks good with minor suggestions
✅ Tests: Comprehensive coverage
✅ Documentation: Well documented

Recommendation: Once CI checks pass (check-style and wheel builds), this PR should be ready to merge pending maintainer review of the architectural approach.

Would you like me to elaborate on any of the suggestions or run additional analysis on specific aspects of the code?

anandhkb · 2025-10-24T16:05:16Z

Gentle ping @atomic

## Description Update introduction to use engine instead of library ## Checklist - [x] I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/cuopt/blob/HEAD/CONTRIBUTING.md). - Testing - [ ] New or existing tests cover these changes - [ ] Added tests - [ ] Created an issue to follow-up - [x] NA - Documentation - [ ] The documentation is up to date with these changes - [x] Added new documentation - [ ] NA

Hot fix with the following commits: - NVIDIA#541 - NVIDIA#544 - NVIDIA#550

github-actions · 2025-11-01T09:06:42Z