Product Overview

"Shu Jia" is an all-encompassing SAAS platform that consolidates web-wide information from both public and private domains. It employs state-of-the-art big data frameworks and applies deep learning-driven NLP to efficiently manage extensive data flows. The platform excels at:

- High-Trust Source Identification:

Ensuring information reliability by filtering trustworthy sources.

- Overloaded Information Filtering:

Curating relevant content from the data deluge.

- Structural Indexing:

Structuring data within a multi-dimensional knowledge base, augmented by intelligent semantics.

It serves general industry users with a suite of cloud services tailored for:

- Content supply

- Clue discovery

- Hotspot analysis

- Dissemination analysis

- Ranking analysis

- Special topic tracking

- Content monitoring

- Open API access

Architected on a multi-tenant management system and micro-service architecture, "Shu Jia" guarantees scalability and adaptability. It is designed as an integrated solution for data management, analysis, and utilization, focusing on bolstering content dissemination strategies and monetization, thereby maximizing data's inherent value.

Product Advantages
High Timeliness Data Resources

The platform features a dynamic physical address collection system with prioritized scheduling. It employs parallel collection techniques, followed by secondary scheduling of sub-modules, to ensure the high timeliness of large-scale data aggregation. Key data is collected at a minute-level frequency, with news and interactive data interconnected in real-time.

The platform features a dynamic physical address collection system with prioritized scheduling. It employs parallel collection techniques, followed by secondary scheduling of sub-modules, to ensure the high timeliness...

Comprehensive Information Coverage

The platform's reach extends to over 1,100 domestic digital newspapers, more than 300,000 media and government website collection points, over 3,000 domestic APP clients, and it nearly fully encompasses Weibo and WeChat accounts. It also covers distribution channels such as Toutiao, Baijiahao, TikTok, and Kwai. Internationally, the platform boasts a vast database of overseas news media and social platforms, with coverage in 93 languages across 183 countries. It includes over 150,000 overseas collection points, more than 7,000 overseas websites, a significant number of overseas APP clients, over 10 million overseas social media accounts, 200,000 pieces of overseas physical data, and a substantial volume of overseas data in the entire database.

The platform's reach extends to over 1,100 domestic digital newspapers, more than 300,000 media and government website collection points, over 3,000 domestic APP clients, and it nearly fully encompasses Weibo and WeCh...

Data Refined Processing and Intelligent Indexing

The platform utilizes a hybrid approach of expert-standardized indexing and machine intelligence indexing to process content information with precision. The data is sourced purely and comprehensively, with multi-dimensional and highly accurate data tags. Each piece of data is endowed with intelligent knowledge attributes, making it particularly suitable for high-trust data applications in specialized vertical markets.

The platform utilizes a hybrid approach of expert-standardized indexing and machine intelligence indexing to process content information with precision. The data is sourced purely and comprehensively, with multi-dimen...

In-Depth Content Mining with Precise and Continuous Iteration

Leveraging machine learning and deep learning frameworks, along with big data distributed computing frameworks, the platform performs in-depth content mining and denoising. A team of data analysts and algorithm engineers operates in a continuous daily iterative cycle to guarantee the precision and reliability of the analysis outcomes. By refining these aspects, the platform establishes itself as a reliable and efficient tool for real-time data collection, comprehensive coverage, intelligent data processing, and precise content analysis, catering to the needs of various industries and applications.

Leveraging machine learning and deep learning frameworks, along with big data distributed computing frameworks, the platform performs in-depth content mining and denoising. A team of data analysts and algorithm engine...

Product Functions

Content supply Cloud Service

The platform meets users' needs for Internet information content by seamlessly integrating a wide array of data from various sources. It encompasses a diverse range of information channels, such as:

- Digital newspapers
- Websites
- Apps
- Social media platforms like WeChat and Weibo
- Content platforms including Toutiao
- Short video platforms like TikTok and Kwai

The data is meticulously organized and categorized for easy navigation and use.The platform's core services are designed to provide a comprehensive and tailored experience:

1. Fine-Grained Content Retrieval: Allows users to search and access specific content with precision.
2. Information Subscription: Enables users to receive updates and notifications about topics of interest.
3. Basic Data Provision: Offers fundamental data services to support user operations.
4. Feature-Rich Data Provision: Extends beyond basic data to include advanced features and functionalities.

In addition to these services, the platform is equipped with the capability to interface automatically with the user's locally deployed application systems. This integration is customized to align with the user's unique requirements, ensuring a seamless and efficient workflow. By offering these capabilities, the platform enhances the user's ability to manage and utilize Internet information content effectively.

Clue Discovery Cloud Service

The platform harnesses the power of the entire network's public data to identify high-value content on the Internet in real-time. It utilizes cutting-edge clue discovery technology, which, when combined with intelligent semantic analysis and a robust multi-dimensional knowledge base, enables the platform to perform a range of analytical tasks:

- Clue Aggregation: Consolidating relevant data points to form a coherent picture.
- Tracking Analysis: Monitoring and analyzing the evolution of trends and topics over time.
- Forecasting Development Trends: Predicting future patterns and shifts in public discourse and interest.

The platform provides a wealth of insights, offering users a comprehensive view of current and emerging topics, including:

- Real-time Hotspots: The most discussed and relevant issues at the moment.
- Netizen Interests: The interests and preferences of social media.
- Weibo Tipping Points: Critical moments in discussions on the popular Chinese microblogging site, Weibo.
- Emergency Incidents: Immediate and urgent events requiring attention.
- Natural Disasters: Significant environmental events with potential impacts on communities.
- Recent Policy Announcements: Updates on new or proposed regulations and directives.
- Notable Meetings: Important gatherings and conferences that may influence public and private sectors.
- Historical Events of the Day: Commemoration of significant historical occurrences that align with the current date.

By delivering these insights, the platform serves as a valuable tool for users seeking to stay informed about the latest developments and trends across various domains.

Hotspot Analysis Cloud Service

The platform employs big data analytics to pinpoint current information hotspots from an extensive dataset of Internet content. It performs a comprehensive analysis and clustering these hotspots across different industries and geographical areas. The analytical process includes:

- Extraction of Fundamental Attributes: Identifying the core characteristics of each hotspot.
- Multi-Level Analysis: Examining hotspots at various levels of detail to understand their scope and impact.
- In-Depth Semantic Exploration: Delving into the meaning and context of the information to gain a deeper understanding.

The services provided by the platform are multifaceted:

- Industry-Specific and Regional Hotspot Identification: Pinpointing trends and topics that are particularly relevant to specific industries or regions.
- Trend Prediction for Hot Topics: Forecasting the trajectory of emerging topics to anticipate future discussions and interests.
- Provision of Front-Page Headlines: Supplying users with the most pertinent and timely information to stay ahead.
- Coverage of Social Media Hotspots: Including analysis of popular discussions on platforms like Weibo and WeChat.
- Ranking Lists: Offering ranking list data published on the Internet that based on various metrics, such as engagement or influence.
- Aid in Content Creation: Supporting the process of generating content that resonates with current trends and audience interests.

By offering these services, the platform acts as a strategic asset for users looking to navigate the dynamic landscape of online information, enabling them to make informed decisions and create content that is both timely and engaging.

Dissemination Analysis Cloud Service

The service leverages the power of big data analytics to assess the real-time dissemination impact of original content. It utilizes advanced technologies such as:

1. Original Content Identification: To distinguish and verify the source of the content.
2. Text Similarity Analysis: To measure the uniqueness and variation of content across different platforms.
These technologies are seamlessly integrated with a comprehensive dissemination analysis model and a robust index system to provide a detailed assessment of content performance.
The suite of services offered by the platform includes:
Calculation of dissemination Power Index:The dissemination Power Index is derived from a weighted assessment of various interaction metrics, which collectively capture the extent of audience engagement with the original content.
The calculation takes into account the following key factors:
Interaction Volume: The total number of actions such as forwarding, commenting, reading, and liking, which reflect the direct engagement from the audience.
Citation Frequency: The count of times the article is referenced or cited by other sources, indicating its influence and reach within the broader discourse.
Other Significant Indicators: Additional relevant metrics that may contribute to the overall impact of the content, such as the duration of user engagement or the diversity of platforms where the content is shared.
3. Citation Analysis: Examining how often and where the original content is referenced or shared.
4. Visualization of Content Dissemination Trajectories: Graphically representing the spread and reach of the content.
5. Analysis of User Interaction Data: Evaluating user engagement through metrics like reposts, comments, reads, and likes.

In addition to these services, the platform has the capability to:

- Automatically Generate Comprehensive Analysis Reports: Providing users with detailed insights and summaries of their content's dissemination and impact.

By delivering these services, users can gain a clear understanding of how effectively their original content resonates with the public and disseminates across different platforms, facilitating more strategic and impactful content creation and distribution strategies.

Ranking Analysis Cloud Service

The platform actively monitors a substantial volume of Internet data in real-time and offers a comprehensive set of rankings based on a tailored evaluation index system. These rankings are characterized by their multi-dimensional nature and their ability to span across different time periods, providing a nuanced perspective on various metrics.

The system encompasses several types of rankings to cater to different user needs:

Account Ranking: This focuses on the influence and reach of accounts or entities on the Internet.
Content Ranking: It evaluates and ranks content based on its popularity, engagement, and relevance.
Customized Thematic Ranking: This allows users to create and view rankings based on specific themes or topics of interest.
Cross-Channel Organization Ranking: This ranking compares and evaluates organizations across different channels or platforms. These rankings serve as a valuable tool for users to gauge the dissemination power of various entities and to uncover high-quality content. They are instrumental in aiding strategic content operations by offering insights into what resonates with audiences and how to optimize content for better visibility and impact.

Special Topic Tracking Cloud Service

The platform delves into the dynamics of information dissemination surrounding significant events, themes, and activities. It conducts a thorough analysis of the various elements that influence this process:

- Channels: The different platforms or mediums through which information is disseminated.
- Nodes: The key points or entities that facilitate the spread of information.
- Paths: The trajectories that information takes as it moves from source to audience.
- Sources: The origins of the information being disseminated.
- Key Factors: The critical variables that affect the spread and impact of information.
- Topic Drift: The shifts in the focus or narrative of a theme over time.

The platform offers a suite of services to enhance users' thematic analysis capabilities:

1. Thematic Tracking: Continuous monitoring of specific themes to identify trends and patterns in information dissemination.
2. In-Depth Analysis: A detailed examination of the factors influencing the spread and reception of information within a particular theme.
3. Automated Thematic Reports: The generation of comprehensive reports that include analytical insights and opinions on the theme under study.
4. Enhanced Analysis Capabilities: Tools and resources designed to improve users' ability to analyze and understand the complexities of thematic information dissemination.

By providing these services, the platform empowers users with a deeper understanding of how information flows and evolves within their areas of interest, enabling more informed and strategic content creation and distribution.

Content monitoring cloud service

The platform is equipped with real-time monitoring capabilities for data disseminated across a multitude of channels, ensuring efficient and precise content oversight and review. It is designed to swiftly identify a range of issues, including:

- Various Errors: Typographical, grammatical, and factual inaccuracies.
- Sensitive Information: Content that may be inappropriate or not suitable for public dissemination.
- Modification Suggestions: Recommendations for improving the content's quality and adherence to guidelines.
- Source References: Providing credible references to support the content's claims and statements.

In addition to detection and suggestions, the platform also offers intelligent monitoring and analysis Reports. Furthermore, the platform supports multimodal content proofreading, which means it can handle different types of content, from text to multimedia elements, ensuring a thorough review process that is adaptable to various content formats.

Open API Cloud Service

The platform offers a suite of standardized Application Programming Interface (API) services tailored to Internet information services, encompassing a wide range of fields and scenarios. It operates on a one-stop access model, designed to streamline the process for developers.

One-Stop Access Service: The platform delivers a seamless experience with professional technical support, comprehensive API documentation, and a suite of tools that includes debugging utilities and illustrative examples. These resources are designed to expedite the development process and enhance the efficiency of program creation.

Extensive API Offerings: Currently, the platform boasts an extensive catalog of over 200 API interfaces, catering to various business needs and applications. The API services are categorized as follows:

1. Basic Data API Services: Fundamental services that provide core data access.
2. Value-Added Scenario API Services: Specialized APIs that offer additional value in specific scenarios.
3. Intelligent Analysis API Services: APIs that incorporate advanced analytics to support data interpretation and decision-making.
4. Dissemination Analysis API Services: Tools focused on analyzing how information spreads across different channels.
5. Thematic Analysis API Services: APIs designed to analyze and understand trends related to specific themes or topics.
6. Targeted Collection API Services: APIs that facilitate the collection of data based on specific criteria or targets.

This comprehensive API ecosystem is engineered to fully empower a diverse array of business scenarios, offering flexibility and customization to meet the unique requirements of different users and applications.

Application Scenario
Media Industry
Government Affairs
Enterprises
Security Industries
The platform collects and refines Internet information data, harnessing natural language processing capabilities for analysis and indexing. It offers a spectrum of data-driven services that cover the entire media workflow, aiding media organizations in advancing their digital transformation and integrated media development.Application Scenarios: Intelligent content creation assistance, lead identification, hotspot mining and analysis, news dissemination impact assessment, personalized content delivery, rank list analysis, and thematic tracking services.
It delivers Internet data support services for government sectors, encompassing data evaluation, content indexing, policy analysis, public communication impact assessment, and government performance evaluation, among others. This assists government bodies in accelerating digital transformation and data-driven governance.Application Scenarios: Provision of government-released content data, assessment of government website influence, analysis of government new media impact, and evaluation of the impact of policies and regulations.
The platform offers data services tailored to industry policies, think tanks, intelligence, brand promotion, and enterprise/industry hotspot analysis, aimed at facilitating digital management and transformation for businesses.Application Scenarios: Customized industry information processing, think tank data analysis, brand influence assessment, enterprise risk control and alerting, and key event analysis.
With a focus on global security and defense think tanks, the platform enables public safety departments to promptly detect and address potential sensitive information and harmful content.It provides targeted information monitoring and event analysis capabilities, offering robust informational and data support for intelligence analysis and Internet space governance.Application Scenarios: Customized open-source intelligence collection, information processing and handling, think tank data analysis, and Internet space governance. By offering these tailored services, the platform empowers different sectors to leverage data effectively, enhancing their operational efficiency, strategic decision-making, and responsiveness to emerging trends and challenges.
X