Big data, or massive data, massive data and big data, refers to the huge amount of data involved, which cannot be intercepted and managed by human resources within a reasonable time. "Big data" in the Internet industry refers to the phenomenon that Internet companies generate and accumulate user network behavior data in their daily operations. The scale of these data is so huge that it can't be measured by g or t.
Big data in China
Characteristics of big data:
Specifically, big data has four basic characteristics:
First, the amount of data is huge. Baidu data shows that its new homepage navigation needs to provide more data every day than.
1, 5PB( 1PB= 1024TB), these data will exceed 500 billion A4 sheets if printed. It has been confirmed that up to now, the data volume of all printed matter produced by human beings is only 200PB.
It only takes 133 days to deduce history.
Second, the data types are diverse. Today's data types are not only text,
More are pictures, videos, audio, geographical location information and other types of data, and personalized data accounts for an absolute majority.
Third, the processing speed is fast.
Data processing follows the "1 second law", and high-value information can be quickly obtained from all kinds of data. Fourth, the value density is low. Take video as an example. An hour of video, in the process of uninterrupted monitoring, may only have one or two seconds of useful data.
The only useful data is 1/3600. Are you ready for the arrival of the era of big data?
The era of big data is coming.
McKinsey, a world-renowned consulting firm, first proposed the arrival of the era of "big data". McKinsey said: "Data has penetrated into every industry and business functional field today and become an important factor of production. The mining and application of massive data indicates a new wave of productivity growth and the arrival of consumer surplus. "
In 20XX, the word bigdata was mentioned more and more. People use it to describe and define the massive data generated in the era of information explosion, and to name the related technological development and innovation. It once appeared on the cover of the The New York Times Wall Street Journal column, entered the White House news, appeared in some Internet-themed lecture salons in China, and was even written into the investment recommendation report by the sensitive smell of Guo Jin Securities, Guotai Junan and Galaxy Securities.
2. The rapid expansion of data determines the future development of the enterprise. Although many enterprises may not be aware of the hidden dangers brought by the explosive growth of data, as time goes by, people will become more and more aware of the importance of data to enterprises. As The New York Times said in a column in February 20XX, the era of "big data" has arrived. In business, economy and other fields, decisions will be based on data and analysis, not on experience and intuition. Gary king, a sociology professor at Harvard University, said: "This is a revolution. The huge data resources have begun the quantification process in various fields, and all fields, whether academic, commercial or government, will start this process. "
Big data application case:
1, medical industry
In a hospital in Toronto, Canada, there are more than 3000 premature babies' data readings every second. Through the analysis of these data, hospitals can know in advance which premature babies have problems and take targeted measures to prevent premature babies from dying.
2. Smart grid in the energy industry has now realized the terminal in Europe, which is the so-called smart meter. In Germany, in order to encourage the use of solar energy, solar energy will be installed at home. In addition to selling electricity to you, you can buy back the surplus electricity from your solar energy. Data is collected every five minutes or ten minutes through the power grid, and the collected data can be used to predict customers' electricity consumption habits, so as to infer how much electricity the whole power grid needs in the next 2-3 months. With this forecast, you can buy a certain amount of electricity from power generation or power supply enterprises. Because electricity is a bit like futures, it will be cheaper to buy it in advance and more expensive to buy it in stock. Through this forecast, the procurement cost can be reduced.
3. Communication industry
Telecom operators can analyze a variety of user behaviors and trends through tens of millions of customer data and sell them to enterprises in need. This is a brand-new information economy. Through big data analysis, China Mobile conducts targeted monitoring, early warning and tracking of the whole business operated by enterprises. The system automatically captures the market changes at the first time, and then pushes them to the designated person in charge in the fastest way, so that he can understand the market situation in the shortest time and move out the night traffic package in combination with his own experience.
4. Retail industry
Retail enterprises also monitor customers' in-store walking and interaction with goods. They combine these data with transaction records for analysis, so as to give suggestions on which goods to sell, how to place them and when to adjust the price. This method has helped a retail enterprise to reduce the inventory of 17%, and at the same time, on the premise of maintaining market share, it has increased the proportion of self-owned brand goods with high profit margin.
Three-minute speech on big data (2)
Since we talk about big data, we must first understand the definition of big data. What is big data? Imagine if in the future, a person has more computer equipment than the current global computing power, a person generates more data than the current global data, and even your pet dog generates more information than the current global data. What will happen in the world? So let's take a look at some definitions of big data given by official website.
So as to export data useful for the business. A purposeful definition of big data business: "Use big data to solve economic and social problems. Improve business added value. Or supporting the cause "Big data not only refers to its size, but also includes what data is made of and how to use it. This is different from the current system. It is very difficult to deal with complex data sets by using data-based management tools that have been sold or applications that have been processing data so far.
Let's look at two more from China.
So what is big data? In fact, it is a lot of data. It stores a lot of information in the form of data, and then accumulates it until the data is too big to be used simply. Big data is nothing magical. Just like the question in the movie Endless: Humans usually only use 20% of their brains. What will the world be like if the remaining 80% of their brain potential is stimulated? In the management of enterprises, industries and countries, usually less than 20% of the data is effectively used (even less). What will the world be like if the value of the remaining 80% data is stimulated? Especially with the new Moore's law of massive data, the data exploded, and then the data was applied more effectively. What will the world be like? Think with your imagination.
So how is big data formed?
In the past, we paid attention to the data generated by trading system and business system, and analyzed and displayed it through data warehouse. In fact, terminals, especially individuals, have all kinds of assembly line operations, such as purchasing lists, browsing history on the Internet, photos and Weibo. But we didn't pay attention to them. In the era of big data, we pay more attention to these large amounts of data and hope to analyze them and find value. Therefore, big data is actually generated in systems, terminals and individuals. I used to have it, but I have never used it. This is the reason and significance of the formation of big data. No matter how much data is blocked or not used, it is worthless. The flights in China are very late, compared with the punctuality of American flights. Among them, a good practice of American air traffic control agencies has played a positive role. To put it simply, the United States will announce the delay rate and average delay time of airlines and flights in the past year, so that customers can naturally choose flights with high punctuality rate when purchasing air tickets, thus attracting airlines to strive to improve punctuality rate through market means. This simple method is more direct and effective than any management means (such as the macro-control means of China government).
Let's start with this printed textbook and give two examples. For the advantages of Fukushima nuclear power plant and convenience store, big data is used. For example, supermarkets have to buy a lot of goods, and then some people who buy them are not very good at piling them up. Even if the price is reduced, they may not be able to sell them, resulting in losses. Then convenience stores will choose to buy goods according to the people living around them. Eat more fast food near office workers' apartments, and eat more daily necessities in places where there are many elderly or middle-aged people. For Fukushima nuclear power plant, if the information of monitoring nuclear power plant is collected and analyzed in time, problems may be found and leaked, so as to solve the problems and avoid losses. This is to predict possible accidents in the future so as to avoid them effectively. For another example, astronomers study the trajectories of celestial bodies and find more planets. Collecting these data together is big data. By analyzing these data, it can be calculated that there may be planets hitting the earth in the future. Then take some necessary measures to eliminate this devastating blow. How to recognize the present? For example, in medicine, cells are observed by medical equipment, and then whether cells are cancer cells is inferred according to other tests. If we turn the basis of judgment into data, input it into a computer and give her some learning ability, such as giving him a cancer cell, he will remember the characteristics of these cells, and over time, his accuracy will be higher and higher. The earlier it plays a role in medicine, the more obvious it will be. This is a simple understanding of his application. After that, Zhao Fei will talk about this in more depth and detail.
So, is big data good for us? That's not true. Just like that medical example. The developed machine will make the staff who diagnose cancer cells lose their jobs. Just as factories were gradually automated in the 1980s, workers would lose their jobs.
Very serious. If you are a horse, then you certainly don't like the industrial revolution.
Next, let's talk about the problems faced by big data now, that is, how to make good use of these big data to serve people and benefit us. Take LSST as an example. It is a wide-area astronomical telescope jointly developed by many countries. Now it's on a mountain in Chile. Construction began in 20XX. In 20XX, Charles Simonyi and Bill Gates donated USD 20XX and USD 654.38+million to the LSST project respectively. The LSST project is still seeking nearly $400 million from the National Science Foundation. . It can shoot all day every three days. By 20XX, it will be equipped with a super digital camera, and the photos taken will reach 3.2 billion pixels. Such photos need 1500 HDTV screen to be displayed. LSST's camera takes more than 200,000 photos every year, which is beyond the analytical ability of researchers. If we make good use of these data, it will be a huge wealth, but there is no effective solution at present. There is not so much manpower to analyze, and there is no such equipment to analyze directly. So this pile of data is useless and worthless. So big data is a science with great potential. Next, watch a video of TED talk with you.
The speaker is Kenneth Cookeye. I work for the Wall Street Journal (Asia Edition) and the Tribune International Pioneer. He is a member of the American Council on Foreign Relations and a regular business and technical commentator of CNN, BBC and NPR. He has a deep research on big data. Listen to what he said about big data. After reading this speech, please ask Zhao Fei to bring a deeper understanding of big data.