Magzter GOLD ile Sınırsız Olun

Magzter GOLD ile Sınırsız Olun

Sadece 9.000'den fazla dergi, gazete ve Premium hikayeye sınırsız erişim elde edin

$149.99
 
$74.99/Yıl

Denemek ALTIN - Özgür

Vector Databases You can choose From

Open Source For You

|

July 2023

Vector databases offer efficiency and scalability, and are transforming the way we harness the potential of embedding data in the digital age. There are quite a few open source vector databases that come with their own benefits. We take a quick look at them.

- Mohit Raj

Vector Databases You can choose From

In natural language processing (NLP), an embedding is a representation of text in the form of vectors. The goal of an embedding is to capture the semantic meaning of words or documents in a way that can be understood by a machine learning model.

A vector database (or an embedding database) in NLP is a specialised database designed to efficiently store, retrieve, and perform operations on high-dimensional vector data (such as the embeddings mentioned above). Vector databases are optimised to perform nearest neighbour search operations efficiently, which is a common requirement in NLP applications. They provide a way of organising and searching through large amounts of embedding data, which can be beneficial in various tasks like information retrieval, document similarity, clustering, and others.

As an example, let’s say you’ve embedded a large number of documents using a Doc2Vec model. Now, given a new document, you want to find the most similar documents in your database. To do this, you would:

1. First, embed the new document into the same high-dimensional space.

2. Next, search the vector database for the vectors closest to the new document’s vector. This is the nearest neighbour search.

Due to the high-dimensional nature of the data, this search can be computationally intensive. However, vector databases use specialised indexing and querying algorithms (like k-d trees, ball trees, or hashing techniques) to speed up these operations. Examples of such databases include FAISS developed by Facebook AI and Annoy developed by Spotify.

Open source vector databases 

Weaviate:

Open Source For You'den DAHA FAZLA HİKAYE

Open Source For You

Open Source For You

Top 10 Open Source Tools for System and IT Administrators

All reputed online services have committed system and IT administrators working behind the scenes. Here are ten open source tools they should be aware of, as these can help them monitor, automate, as well as manage complex infrastructure with relative ease.

time to read

6 mins

February 2026

Open Source For You

Google opens access to its Gemini Deep Research Agent

Google has opened access to its Gemini Deep Research Agent for the first time, allowing developers to integrate advanced autonomous research capabilities directly into their applications.

time to read

1 min

February 2026

Open Source For You

Open Source For You

NVIDIA buys SchedMD, keeps Slurm open source and vendor neutral

NVIDIA has acquired AI software company SchedMD, signalling a deeper commitment to open source technologies as competition intensifies across the artificial intelligence ecosystem.

time to read

1 min

February 2026

Open Source For You

Open Source For You

How Open Source Tools Power Modern IT Operations

Open source tools have not replaced enterprise IT platforms; they have become the connective layer that makes modern operations possible.

time to read

6 mins

February 2026

Open Source For You

Mandiant's Auralnspector enhances Salesforce security

Google-owned cybersecurity firm Mandiant has released AuraInspector, a free, open source command-line tool designed to identify dangerous access control misconfigurations in Salesforce environments, marking a significant move to democratise enterprise-grade security testing.

time to read

1 min

February 2026

Open Source For You

Google launches Universal Commerce Protocol to power agentic AI commerce

Google has introduced the Universal Commerce Protocol (UCP), a new open standard that enables AI agents to autonomously perform end-to-end commerce activities, spanning product discovery, purchasing, checkout, payments, and postpurchase experiences.

time to read

1 min

February 2026

Open Source For You

Open Source For You

Zero Trust CI/CD: The Death of Static Secrets

In an era where data breach costs continue to hit record highs, shifting to a secretless CI/CD pipeline is the most effective step to safeguard digital infrastructure.

time to read

7 mins

February 2026

Open Source For You

Open Source For You

Quantum Algorithms: The Future of Computing

Explore the essence of quantum algorithms, their groundbreaking applications, recent innovations, and the challenges that remain.

time to read

8 mins

February 2026

Open Source For You

Open Source For You

Bringing Clarity to the Chaos in AI

AI feels powerful, yet most teams struggle because they cannot define what intelligence they really need. But there are ways to address this challenge.

time to read

5 mins

February 2026

Open Source For You

Open Source For You

Top researchers return to OpenAI

OpenAI has welcomed back three high-profile researchers, Barret Zoph, Luke Metz, and Sam Schoenholz, following their brief tenure at former OpenAI CTO Mira Murati's AI startup, Thinking Machines.

time to read

1 min

February 2026

Translate

Share

-
+

Change font size