-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate GPT 4o without TTS/STT #210
Labels
enhancement
New feature or request
Comments
I'm also interested in this question. |
What about response time? |
I know I know :) OpenAI APIs are not yet available:
Plus, Communication Services APIs are not yet available to use with raw audio stream. If you have ideas, don't hesitate! |
m |
Audio streaming is now available with Communication Services! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
OpenAI GPT 4o model supports both in and out of text, image and audio. Understanding is finer than usual STT > model > TTS approach because the model has direct access to user behavior, emotions, etc.
Is there a way to use Communication Services and receive the raw audio flow, bypassing the STT step?
The text was updated successfully, but these errors were encountered: