Research – Veso AI

Agent Foundations in AI Alignment: A Technical Overview

Jun 27, 2024

—

by

The concept of Agent Foundations addresses critical challenges in AI alignment research, focusing on developing robust mathematical formulations for key concepts in artificial intelligence.This approach aims to create AI systems that reliably adhere to human values and intentions, even under intense optimization pressures. Problem Statement Naive approaches to AI alignment, such as training systems to…

AI Model Quantization: Leaner, faster, cheaper inference

Feb 26, 2024

—

by

Publisher

in Research

AI Model Quantization: Leaner, faster, cheaper inference. Transforming high-precision weights into compact, efficient formats can revolutionize speed, reduce costs, and maintain accuracy. – At its core, model quantization is a technique aimed at reducing the size of a deep learning model. This is achieved by adjusting the model’s weights from high-precision formats (like 32-bit floats)…

Reducing LLM utilization, cost and latency with RAG

Feb 19, 2024

—

by

Publisher

in Research

With RAG ( retrieval augmented generation ), only the most relevant context is added to construct a prompt payload for a language model. This means that, in most cases, your inference will be cheaper, faster, and more accurate, especially when dealing with multiple files, data lakes, and sources.

What is Retrieval Augmented Generation?

Feb 5, 2024

—

by

Publisher

in Research

RAG (Retrieval Augmented Generation) is a technique that enhances the contextual awareness of large language models by feeding them information relevant to your prompt. This method proves more effective for ensuring relevance than traditional fine-tuning techniques. Imagine you have hundreds of documents, many of which reference a specific technique, such as engineering an airplane wing.…

Efficiently Extracting Data from Documents for Use with RAG

Feb 2, 2024

—

by

Publisher

in Research

Overview A new methodology leveraging advanced parsing strategies has been introduced to enhance the efficiency and accuracy of data extraction from various document formats. This method is supported by the latest advancements in parsing research, designed to optimize data for use in Retrieval-Augmented Generation (RAG) applications. PDF Parser Guidelines HTML Parser Guidelines Text Parser Guidelines…

Retrieval Augmented Generation: Accelerating Real-World Task Performance

Jan 26, 2024

—

by

Publisher

in Research

RAG combines the generative prowess of models like GPT with the precision of information retrieval, providing outputs that understand the context better than fine-tuning alone. Overview At Veso AI, we are proud to present our in-depth exploration of Retrieval Augmented Generation (RAG) and its significant impact on real workplace environments, particularly when handling critical tasks.…

Key AI Developments to Watch in 2024

Jan 26, 2024

—

by

Publisher

in Research

1. Retrieval Augmented Generation 2. Holistic AI Solutions 3. Quantized AI Models 4. AI Management Hubs 5. AI Governance Focus 6. AI Legislation Challenges 7. Robust AI Evaluation 8. Diverse AI Model Applications 9. Multimodal AI Growth 10. Orchestrated AI Interactions 11. Multi-agent AI Dynamics 12. Real-Time Model Analytics 13. LLMs in Physical Spaces 14.…

Generative AI Systems on Closed vs. Open Networks: A Comparative Analysis

Jan 25, 2024

—

by

Publisher

in Research

Overview In the rapidly evolving field of artificial intelligence, the dichotomy between closed and open networks is a subject of considerable debate. This article presents a comparative analysis of generative AI systems within these contrasting environments, with particular emphasis on the merits of data sovereignty and security offered by closed networks. Our investigation encompasses the…

Exploring an open source, local, semantic search powered file browser

Jan 24, 2024

—

by

Publisher

in Research

There is a rapidly growing demand in the market for open source, localised GenAI applications. Partially driven by: We are exploring an open source port of our proprietary toolchain which aims to run on an operating system layer, across all unstructured documents, to allow for immediate, secure, semantic search capabilities. This implementation remedies a lot…

Understanding Reciprocal Rank Fusion (RRF) for Semantic Hybrid Search

Jan 23, 2024

—

by

Publisher

in Research

Hybrid search is an innovative technique that blends the precision of full-text search with the nuanced understanding of vector search to provide well-rounded search results. One of the simplest libraries can be found here: https://github.com/Raudaschl/rag-fusion By executing a single query that incorporates both search and vector query parameters in parallel, and scoring the combined results…

Category: Research