The scale is 1 million+users, 1,+enterprise users, and the popularity is high, which is the sum of friends and businesses in Baidu Index!
1. There are many data entities: up to 159 million+,and the latest entities such as community organizations, Hong Kong, Macao and Taiwan have been launched. At present, the actual number of competing products is 1 million, while the other one is only over 8 million.
2. There are many dimensions: in addition to the common dimensions such as business information, court announcements, competing products information, recruitment, untrustworthy people, patents and trademarks. Tianyancha also has 8+ dimensions, such as qualification certificate, software copyright, work copyright, WeChat WeChat official account, Weibo company number, Weibo person number, Weibo number, social organization, law firm, judicial auction, Hong Kong enterprises, import and export, etc., and in the process of going online one after another, competing products only reach more than 5 dimensions.
3. The update frequency is high: thousands of crawler processes are dynamically allocated to grab data 24 hours a day, and 2,-1, new ones are added every day, and 4-5 million updates ensure that the whole database can be updated in a week or so.
4. Rich data sources: In addition to grabbing, we have signed exclusive cooperation with many data source companies because of industry leaders, and obtained the data sources of official or industry leading enterprises. For example, the sources of administrative punishment are not only industry and commerce, but also credit China, and there are more than 2 kinds of qualification certificates. The information data of listed companies directly exchange data sheets with Great Wisdom, the cooperation of trademark information with Master Quan directly comes from the Trademark Office, and the case information directly cooperates with Peking University Magic Weapon, etc., and are updated synchronously.
5. Leading data collection technology and cleaning technology: Patented independently, independently learning to crack codes, circumventing firewalls with massive (18 million) ip, and simulating users' login behavior to obtain data.
6. Leading data storage technology: TSTN, a large-scale spatio-temporal relational network with traceable relational data, includes graph storage and graph analysis technology. There are billions of entities and relationships, and there are tens of billions of attributes in entities and relationships, which ensures effective storage and fast graph-based relational query.
7. Leading data analysis technology: independent research and development of patented technology, testing the performance comparison of the same association analysis algorithm on three data storage systems (traditional relational database represented by Oracle and MySQL, key-value pair storage scheme represented by Hadoop and HBase, and TSTN system of Tianyuecha), the time is shortened from 28 hours to less than 3 seconds.
8. Leading data synchronization technology: The related data and traditional data are synchronized and updated, and the patented technology is independently developed, which is not opposed to the traditional statistics-based macro big data, but a unity that includes both macro big data and highlights micro big data.
9. Location advantage: Tianyancha Company is headquartered in Beijing, with after-sales follow-up, convenient communication on technical issues, and convenient communication with leading partners in the industry, and can visit each other at home.
1. Brand advantages: Eye-catching, legal and compliant, large market share, 1 million users, abundant funds, low price and high quality products, Baidu's popularity is twice that of the sum of the two competing products, and its brand value is high, so it can cooperate stably for a long time.