Elastic has announced the availability of its Elastic Inference Service (EIS) via Cloud Connect for self-managed Elasticsearch deployments. This enables organizations to access cloud-hosted, GPU-powered inference for semantic search and embeddings—including models from Jina.ai—without the complexity of managing GPU infrastructure or moving their core data off-premises.
Elastic delivers GPU inference to self-managed customers via Cloud Connect.
It provides access to cloud-hosted embedding and reranking models from Jina.ai.
Customers maintain data and core infrastructure on-premises.
The service eliminates the operational overhead of managing GPU hardware.
It enables rapid deployment of semantic search capabilities.
Available immediately for Enterprise customers on Elastic Stack 9.3.
This launch addresses a key challenge for organizations with self-managed Elasticsearch clusters: accessing the computational power required for modern AI search without undergoing a disruptive migration to the cloud. EIS via Cloud Connect allows these customers to keep their sensitive data and existing architecture in place while securely offloading the resource-intensive tasks of embedding generation and search inference to Elastic's managed GPU fleet in the cloud.
Semantic search, which relies on high-quality vector embeddings for accurate results, typically requires GPU infrastructure for optimal performance. By providing on-demand access to leading models, including those from Jina.ai, Elastic enables self-managed teams to implement these advanced capabilities quickly. This removes the significant barriers of procuring, configuring, and maintaining specialized hardware, allowing teams to focus on building search experiences rather than managing infrastructure.
The service is part of Elastic's broader strategy to make AI and advanced search more accessible. "We’re making it easier for self-managed customers to adopt semantic search without taking on the complexity of GPU infrastructure," said Steve Kearns, GM of Search at Elastic. This approach allows organizations to incrementally adopt AI-powered search, leveraging the cloud for compute-intensive services while maintaining control over their data residency and core systems, a critical requirement for many regulated industries.
Available immediately for Enterprise customers on version 9.3, EIS via Cloud Connect represents a hybrid model that combines the control of self-management with the scalability and advanced capabilities of the cloud.
About Elastic
Elastic, the Search AI Company, integrates its deep expertise in search technology with artificial intelligence to help everyone transform all of their data into answers, actions, and outcomes. Elastic's Search AI Platform — the foundation for its search, observability, and security solutions — is used by thousands of companies, including more than 50% of the Fortune 500.