apache_beam.ml.transforms.embeddings.open_ai module
- class apache_beam.ml.transforms.embeddings.open_ai.OpenAITextEmbeddings(model_name: str, columns: list[str], api_key: str | None = None, organization: str | None = None, dimensions: int | None = None, user: str | None = None, max_batch_size: int | None = None, **kwargs)[source]
Bases:
EmbeddingsManager
Embedding Config for OpenAI Text Embedding models. Text Embeddings are generated for a batch of text using the OpenAI API.
- Parameters:
model_name – Name of the OpenAI embedding model
columns – The columns where the embeddings will be stored in the output
api_key – OpenAI API key
organization – OpenAI organization ID
dimensions – Specific embedding dimensions to use (if model supports it)
user – End-user identifier for tracking and rate limit calculations
max_batch_size – Maximum batch size for requests to OpenAI API
- get_model_handler() RemoteModelHandler [source]
- get_ptransform_for_processing(**kwargs) PTransform [source]