Launch A Server
Launch the server in your terminal and wait for it to initialize. Remember to add--is-embedding to the command.
Using cURL
Using Python Requests
Using OpenAI Python Client
Using Input IDs
SGLang also supportsinput_ids as input to get the embedding.
