Skip to content

Conversation

@Jeffwan
Copy link
Collaborator

@Jeffwan Jeffwan commented Nov 9, 2025

Pull Request Description

[Please provide a clear and concise description of your changes here]

Related Issues

Resolves: #[Insert issue number(s)]

Important: Before submitting, please complete the description above and review the checklist below.


Contribution Guidelines (Expand for Details)

We appreciate your contribution to aibrix! To ensure a smooth review process and maintain high code quality, please adhere to the following guidelines:

Pull Request Title Format

Your PR title should start with one of these prefixes to indicate the nature of the change:

  • [Bug]: Corrections to existing functionality
  • [CI]: Changes to build process or CI pipeline
  • [Docs]: Updates or additions to documentation
  • [API]: Modifications to aibrix's API or interface
  • [CLI]: Changes or additions to the Command Line Interface
  • [Misc]: For changes not covered above (use sparingly)

Note: For changes spanning multiple categories, use multiple prefixes in order of importance.

Submission Checklist

  • PR title includes appropriate prefix(es)
  • Changes are clearly explained in the PR description
  • New and existing tests pass successfully
  • Code adheres to project style and best practices
  • Documentation updated to reflect changes (if applicable)
  • Thorough testing completed, no regressions introduced

By submitting this PR, you confirm that you've read these guidelines and your changes align with the project's contribution standards.

@gemini-code-assist
Copy link
Contributor

Summary of Changes

Hello @Jeffwan, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request focuses on refining the Aibrix project's documentation and examples to ensure clarity, stability, and correct functionality. It primarily involves updating component image references to the latest stable release, integrating a critical configuration step for Envoy Gateway, and making minor adjustments to quickstart guides and sample configurations for improved user experience and operational correctness.

Highlights

  • Version Updates: Updated all Aibrix component image tags within the stable.yaml Helm chart and the vke/model.yaml sample to the stable v0.5.0 release, moving away from release candidate versions.
  • Envoy Gateway Configuration: Enhanced the installation documentation to include a crucial step for patching the Envoy Gateway configuration to enable EnvoyPatchPolicy, which is vital for proper gateway functionality.
  • Sample Model Labels: Added a model.aibrix.ai/port: "8000" label to the model.yaml quickstart sample, improving service discovery and routing for deployed models.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request primarily focuses on improving documentation and examples, alongside updating image tags to the v0.5.0 release. The changes are beneficial and enhance clarity for users. I've identified a couple of areas for improvement: one regarding the method for configuring envoy-gateway to use a more Helm-native approach for better maintainability, and another to ensure consistency across API endpoint examples in the quickstart guide.

Comment on lines +46 to +63
# patch the configuration to enable EnvoyPatchPolicy, this is super important!
kubectl apply -f - <<EOF
apiVersion: v1
kind: ConfigMap
metadata:
name: envoy-gateway-config
namespace: envoy-gateway-system
data:
envoy-gateway.yaml: |
apiVersion: gateway.envoyproxy.io/v1alpha1
kind: EnvoyGateway
provider:
type: Kubernetes
gateway:
controllerName: gateway.envoyproxy.io/gatewayclass-controller
extensionApis:
enableEnvoyPatchPolicy: true
EOF
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

Using kubectl apply to modify a resource managed by Helm can lead to configuration drift and potential issues during chart upgrades, as Helm may not be aware of these manual changes. A more idiomatic and safer approach is to configure envoy-gateway using Helm values during installation.

The envoy-gateway Helm chart likely supports enabling EnvoyPatchPolicy via a value. The installation command could be updated to pass this setting directly. For example:

helm install eg oci://docker.io/envoyproxy/gateway-helm --version v1.2.8 -n envoy-gateway-system --create-namespace \
  --set envoyGateway.extensionApis.enableEnvoyPatchPolicy=true

Please verify the exact value path from the envoy-gateway Helm chart's values.yaml and update the documentation to use the --set flag or a values file. This will make the installation process more robust and align with Helm best practices.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is not supported yet. envoyproxy/gateway#7458 tracked in this issue

# list models
curl -v http://${ENDPOINT}/v1/models
curl -v http://${ENDPOINT}/v1/models/
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

You've added a trailing slash to the /v1/models endpoint URL. For consistency, and in case it's needed to avoid redirects, consider adding a trailing slash to the other API endpoint examples in this document as well.

For example:

  • http://${ENDPOINT}/v1/completions should become http://${ENDPOINT}/v1/completions/
  • http://${ENDPOINT}/v1/chat/completions should become http://${ENDPOINT}/v1/chat/completions/

Applying this change consistently across all example curl commands would improve the documentation's clarity and correctness.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/v1/models/ is a little bit different.

@Jeffwan Jeffwan merged commit 45ada88 into vllm-project:main Nov 9, 2025
4 checks passed
@Jeffwan Jeffwan deleted the jiaxin/release-0.5 branch November 9, 2025 00:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant