InvokeLLM with True Streaming (displaying parts of the answer as they are generated)

The InvokeLLM integration, as it's currently implemented, returns the complete answer at once,Therefore, the response time feels longer from the users. Please review and design for the invokeLLM responding method for "true streaming (displaying parts of the answer as they are generated)", so that users can see the responding on their question answer as they are generated. This will makes them experience much better.

Please authenticate to join the conversation.

Upvoters
Status

In Review

Board
πŸ’‘

Feature Request

Date

5 months ago

Author

ηŽ‹ζ€ι 

Subscribe to post

Get notified by email when there are changes.