PSEEDR

ModelBest Targets Agentic AI with BMTools and WebCPM Frameworks

New open-source frameworks aim to decouple Chinese AI development from OpenAI's infrastructure by enabling tool use and interactive search.

· Editorial Team

ModelBest, the commercial entity driving the OpenBMB community, has formally entered the autonomous agent landscape with the release of BMTools and WebCPM. Positioned as an open-source alternative to the closed ecosystem of ChatGPT Plugins, BMTools provides a framework for Large Language Models (LLMs) to manipulate external utilities, while WebCPM introduces a retrieval-based architecture specifically optimized for Chinese-language interactive search. The methodology underpinning WebCPM has been accepted by the Association for Computational Linguistics (ACL) for its 2023 conference.

The release marks a distinct shift in the open-source sector from developing static text generators to engineering 'Agents'—systems capable of executing multi-step tasks through external interfaces. ModelBest describes BMTools as a "self-developed large model tool learning engine", explicitly framing it as a "ChatGPT Plugins domestic substitute" for the Chinese market. This positioning suggests a strategic move to decouple domestic AI development from reliance on OpenAI’s API infrastructure.

The Tool Learning Paradigm

While frameworks like LangChain have popularized the concept of chaining LLM prompts to Python scripts or APIs, BMTools aims to systematize the learning process itself. Rather than relying solely on prompt engineering to coerce a model into using a calculator or a calendar, BMTools appears to focus on the underlying capability of the model to recognize when and how to deploy a tool.

The architecture addresses a critical bottleneck in current LLM deployment: hallucination. By offloading factual queries to external databases or search engines, models can reduce error rates. ModelBest’s release includes a repository of pre-built tools, though the extent of compatibility with non-CPM (Chinese Pre-trained Model) architectures, such as LLaMA or Bloom, remains unclear.

WebCPM: Interactive Search and Retrieval

Running in parallel to the tool-learning engine is WebCPM, which ModelBest claims is the "first open-source model in the Chinese domain based on interactive web search QA". Unlike standard Retrieval-Augmented Generation (RAG), which simply fetches documents based on a query, WebCPM mimics human search behavior. It performs interactive searching, browsing, and result synthesis to answer complex questions.

The academic rigor behind WebCPM distinguishes it from many rapid-release open-source projects. The specific methodology was "recently accepted by ACL 2023", a top-tier venue for NLP research. This peer review suggests that the model’s approach to information retrieval and synthesis has met significant technical benchmarks. The system is likely modeled after OpenAI’s WebGPT, adapting the paradigm of long-form question answering via browser interactions for the Chinese linguistic landscape.

Competitive Landscape and Limitations

The timing of this release coincides with an industry-wide pivot toward autonomous agents, evidenced by the rise of AutoGPT, HuggingGPT, and Microsoft’s jarvis-like implementations. However, most existing agent frameworks rely heavily on OpenAI’s GPT-4 as the reasoning engine. BMTools attempts to bring that reasoning capability in-house, potentially utilizing OpenBMB’s own foundation models.

However, the utility of these tools outside the Chinese ecosystem may be limited initially. The explicit focus on the "Chinese domain" for WebCPM suggests the training data and retrieval logic are optimized for domestic search engines (likely Baidu or Sogou) rather than Google or Bing. Furthermore, while the code is open-source, the latency metrics for the interactive search component—often a pain point in agentic workflows—have not been detailed in the initial release materials.

Strategic Implications

For technical executives, the launch of BMTools signals the maturation of the Chinese open-source AI stack. It demonstrates that the focus is moving beyond simply training larger parameters to building the middleware required for application integration. If BMTools achieves adoption, it could fragment the plugin standard, creating a bifurcation where Western applications optimize for OpenAI’s standard and Chinese applications build against the BMTools specification.

Sources

  • ModelBest (OpenBMB) Official Announcement
  • WebCPM Academic Paper (ACL 2023)
  • BMTools Code Repository

Key Takeaways

  • ModelBest has released BMTools, an open-source engine designed to enable LLMs to use external tools, positioning it as a competitor to ChatGPT Plugins.
  • WebCPM, a new model for interactive web search and QA, has been accepted to ACL 2023, validating its methodology via peer review.
  • The release addresses the 'hallucination' problem by enabling models to retrieve real-time data, specifically optimizing for the Chinese linguistic domain.
  • The framework represents a strategic move to build an autonomous agent ecosystem independent of OpenAI's API infrastructure.
  • Compatibility with Western foundation models (like LLaMA) and latency performance for interactive search remain unverified variables.

Sources