Subnet 40

Chunking

Alpha Price

Value

Market Cap

Value

Neurons

Value

Registration Cost

Value

TAO Liquidity

Value

Alpha in Pool

Value

Total Alpha Supply

Value

% Alpha Staked

Value

ABOUT

What exactly does it do?

This subnet is dedicated to advancing Retrieval-Augmented Generation (RAG) by encouraging the development and provision of advanced chunking solutions. Its goal is to create, host, and deploy a smart chunking system that enhances similarity within chunks and maximizes dissimilarity between chunks.

Inference is building out a vertically integrated solution, being both a consumer and leading provider of intelligent Retrieval-Augmented Generation, and consequently, creating the full demand loop for the Chunking subnet.

What exactly does it do?

PURPOSE

What exactly is the 'product/build'?

Chunking involves breaking data into smaller, manageable “chunks” to facilitate processing and analysis. This method is crucial in natural language processing (NLP) and is especially beneficial for large language models (LLMs). Chunking can involve segmenting content such as dividing an article into sections, a screenplay into scenes, or a musical recording into movements.

Why Use Chunking?

For LLMs to deliver accurate responses, they must access relevant information. When the needed information exceeds the model’s training data, it must be included in the query to avoid inaccuracies. However, including the entire dataset with each query is impractical due to inference costs.

Chunking addresses this by dividing data into smaller, meaningful chunks, which are then converted into vectors and stored in a vector database. When a query is made, it is also converted into a vector, and the system retrieves the most relevant chunks from the database. This approach allows the model to process only the pertinent sections of text, significantly reducing the number of tokens processed per query.

Benefits of Chunking

Chunking makes querying more efficient and cost-effective by focusing on relevant text portions while managing resources effectively. It is a vital step for various machine learning tasks that involve large datasets, such as:

Retrieval-Augmented Generation (RAG): By maintaining a database of relevant documents, RAG provides LLMs with the necessary context for accurate query processing. Effective chunking ensures that only the most relevant texts are included, enhancing response quality.
Classification: Chunking helps in organizing texts into similar sections for better classification and labeling, improving task accuracy and efficiency.
Semantic Search: Improved chunking enhances semantic search algorithms, which focus on meaning rather than simple keyword matching, resulting in more accurate and reliable search results.

What is Retrieval-Augmented Generation (RAG)?

Retrieval-Augmented Generation (RAG) is a framework that enhances the performance of a Generative AI application’s large language model (LLM) by supplying it with the most relevant and contextually important proprietary, private, or dynamic data. This architecture improves the model’s accuracy and effectiveness in performing various tasks.

What exactly is the 'product/build'?

Why Use Chunking?

Benefits of Chunking

Retrieval-Augmented Generation (RAG): By maintaining a database of relevant documents, RAG provides LLMs with the necessary context for accurate query processing. Effective chunking ensures that only the most relevant texts are included, enhancing response quality.
Classification: Chunking helps in organizing texts into similar sections for better classification and labeling, improving task accuracy and efficiency.
Semantic Search: Improved chunking enhances semantic search algorithms, which focus on meaning rather than simple keyword matching, resulting in more accurate and reliable search results.

What is Retrieval-Augmented Generation (RAG)?

WHO

Team Info

Inference is dedicated to delivering the most immersive conversational AI experience. Their upcoming platform, Toffee, utilizes Retrieval-Augmented Generation (RAG) to provide users with expansive memory, extended conversation lengths, and domain-specific knowledge.

In developing Toffee, they identified that while RAG has seen many improvements, chunking solutions were significantly lacking. Existing methods were either too basic or overly resource-intensive, making the RAG pipeline costly and less accurate. Traditional chunking approaches (e.g., chunking every X tokens with Y overlap) were cheaper but led to higher runtime costs due to unnecessary context in LLM queries. Meanwhile, advanced semantic chunking solutions like Unstructured.io were prohibitively expensive and slow, limiting file uploads for users.

To address these issues and fulfill Toffee’s vision, Inference’s team created an algorithm that outperforms current industry solutions. Instead of developing proprietary models, they capitalized on the underdeveloped state of the field. The documentation provided includes all necessary information to develop solutions that match or exceed their current model.

Their aim is to reduce costs, enhance accuracy, and unlock new possibilities. As LLMs expand to include diverse datasets (e.g., audio, images, video), intelligent chunking becomes increasingly crucial.

The subnet is designed with a clear, transparent, and fair incentive system to push beyond current achievements. They are eager to see how miners will advance this technology.

Team Info

The subnet is designed with a clear, transparent, and fair incentive system to push beyond current achievements. They are eager to see how miners will advance this technology.

FUTURE

Roadmap

Their goal is to establish this subnet as the leading provider of advanced chunking solutions, aiming for profitability in the near future.

Phase 1: Foundation

Release their opt-in Task API, allowing validators to handle and monetize organic queries.
Provide a framework for validators to build their own Task API network.
Launch subnet dashboards for miners and validators to view performance, statistics, and compensation tracking.

Phase 2: Demand

Launch Chunking.com, their front-end service offering cutting-edge RAG solutions to developers and enterprises.
Introduce Toffee, their conversational AI platform utilizing RAG for an exceptional user experience.

Phase 3: Expansion

Include support for custom queries and evaluations for structure-dependent data, such as CSV files.
Expand to address new data modalities requiring innovative chunking solutions, including image, audio, and video.
Enhance other areas of Retrieval-Augmented Generation, such as hybrid search capabilities.

Roadmap

Their goal is to establish this subnet as the leading provider of advanced chunking solutions, aiming for profitability in the near future.

Phase 1: Foundation

Release their opt-in Task API, allowing validators to handle and monetize organic queries.
Provide a framework for validators to build their own Task API network.
Launch subnet dashboards for miners and validators to view performance, statistics, and compensation tracking.

Phase 2: Demand

Launch Chunking.com, their front-end service offering cutting-edge RAG solutions to developers and enterprises.
Introduce Toffee, their conversational AI platform utilizing RAG for an exceptional user experience.

Phase 3: Expansion

Include support for custom queries and evaluations for structure-dependent data, such as CSV files.
Expand to address new data modalities requiring innovative chunking solutions, including image, audio, and video.
Enhance other areas of Retrieval-Augmented Generation, such as hybrid search capabilities.

NEWS

Announcements

Chunking (τ, ן) Follow 22 522

SN40 (τ, ן) https://t.co/Jr2O7ys184

Chunking (τ, ן) @chunking_subnet ·

26 Dec 2024 1872071536980717998

Patch 2.4.2 - Commit Reveal
PR: https://github.com/VectorChat/chunking_subnet/pull/41

Key Changes:
- Enable CR3
- Chunks now properly end on sentence boundary, with NLTK tokenizer as ground truth
- When miners tie in a round, alpha now decreases linearly instead of exponentially

Reply on Twitter 1872071536980717998 Retweet on Twitter 1872071536980717998 3 Like on Twitter 1872071536980717998 12 X 1872071536980717998

Chunking (τ, ן) @chunking_subnet ·

12 Dec 2024 1867265400901161451

Release 2.4.0 - Enhanced Competition & Variety
PR: https://github.com/VectorChat/chunking_subnet/pull/38

Key Changes:
- New 'Loss Alpha' system accelerates rank advancement for consistently better miners
- Expanded chunk size variety: 2000-4000 chars with varying probabilities
- Mixed prompt types: 80%

Reply on Twitter 1867265400901161451 Retweet on Twitter 1867265400901161451 0 Like on Twitter 1867265400901161451 6 X 1867265400901161451

Chunking (τ, ן) @chunking_subnet ·

10 Dec 2024 1866548921247420827

Latest Chunking Subnet Benchmarks
The Power of Open, Incentivized Contest
Completed Dec 9, 2024

We have now surpassed the industry leaders @AI21Labs and @UnstructuredIO in intermediate contexts, and have widened our lead in low context!

🔗 Explore the interactive benchmark

Reply on Twitter 1866548921247420827 Retweet on Twitter 1866548921247420827 12 Like on Twitter 1866548921247420827 41 X 1866548921247420827

Chunking (τ, ן) @chunking_subnet ·

20 Nov 2024 1859338905427853371

Release 2.2.0 - Enhanced Query Generation

PR: https://github.com/VectorChat/chunking_subnet/pull/34

Key Changes:
- Improved system, initial generation, and continuation prompts
- Synthetic queries now weave together topics from each of three given articles much more, often even within the same sentence

Reply on Twitter 1859338905427853371 Retweet on Twitter 1859338905427853371 4 Like on Twitter 1859338905427853371 24 X 1859338905427853371

Chunking (τ, ן) @chunking_subnet ·

31 Oct 2024 1851935703334010915

Patch 2.1.2

PR: https://github.com/VectorChat/chunking_subnet/pull/33

Key Changes:
- When creating a unique hash for all chunks returned by a miner, whitespace is now ignored.
- Extra checks to make sure nltk tokenizers are downloaded

Reply on Twitter 1851935703334010915 Retweet on Twitter 1851935703334010915 2 Like on Twitter 1851935703334010915 8 X 1851935703334010915

Chunking (τ, ן) @chunking_subnet ·

28 Oct 2024 1850987492822171911

Hotfix 2.1.1

PR: https://github.com/VectorChat/chunking_subnet/pull/30

Key Changes
- Improved Validator Task API concurrency.
- Improved W&B tracking to address the copy-source feature on http://subnet.chunking.com, with additional classifications for rounds.

Validators
- Make sure to restrict access to the

Reply on Twitter 1850987492822171911 Retweet on Twitter 1850987492822171911 1 Like on Twitter 1850987492822171911 10 X 1850987492822171911

Subnet 40

Chunking

ABOUT

What exactly does it do?

PURPOSE

What exactly is the 'product/build'?

WHO

Team Info

FUTURE

Roadmap

NEWS

Announcements

View other subnets