Current location - Trademark Inquiry Complete Network - Futures platform - What is a fitting index?
What is a fitting index?
Fitting index simulation index/fitting index/consistency index

Fitting is the research field of econometrics. The so-called fitting index is simply the correlation between the selected variable and the explained variable.

Stock \ Fund Fitting Index:

Index fund is a kind of fund that matches the target index, tracks the change of the target index and realizes the synchronous growth with the market. The investment of index funds adopts the investment strategy of fitting the target index return rate, and invests in the constituent stocks of the target index in a diversified way, so that the stock portfolio return rate fits the average return rate of the capital market represented by the target index.

Simple operation and high transparency.

Theoretically speaking, the operation method of index fund is very simple, as long as you buy the corresponding proportion of securities according to the proportion of each securities in the index and hold it for a long time.

Second, index funds are cheap. Because index funds adopt holding strategy, they don't need to exchange shares frequently, and the transaction cost is much lower than that of active management funds.

In addition, the performance of index funds is highly transparent. When investors see that the target benchmark index tracked by index funds has gone up, they will know how much their index funds can go up today. Therefore, many institutional investors and some individual investors who can see clearly the general trend and individual stocks prefer to invest in index funds, and there is no need to worry about "earning the index but not making money".

Effectively avoid non-systematic risks

Compared with other funds, the advantage of index funds is that they can effectively avoid non-systematic risks, so index funds widely diversify their investments, and the fluctuation of any stock will not affect the overall performance of index funds, thus diversifying risks. On the other hand, because the indexes pegged by index funds generally have a long tracking history, the risks of index funds can be predicted to some extent.

Therefore, in the long run, the investment performance of index funds is better than other funds. In 2006, index funds became the most profitable fund varieties in the market with an average annual cumulative net growth rate of 125.87%. Such funds will not invest too much money in certain securities or industries. Generally, full investment will be maintained, and there is no market speculation.

Empirical study on key factor fitting index investment method

Indexing investment is a kind of securities investment that attempts to completely copy a certain securities price index or build a portfolio according to the principle of compiling securities price index. Funds invested in this way are called index funds, and their income level target is the change range of the underlying index. Since 1990s, the performance of most equity fund managers on Wall Street in the United States has been lower than the market index in the same period. In this way, the index fund with the core idea of copying the market index trend has developed rapidly around the world, which has formed a huge impact and challenge to the traditional thinking of securities investment. In the United States, index funds are becoming more and more popular because their returns exceed 65~80% of similar funds. Among the new funds flowing into the same fund market, the proportion flowing into index funds increased from 2% in 1994 to 3 1% in 1999. At the end of 1999, the total amount of American index funds reached $338 billion, accounting for 8.37% of the total amount of American equity funds. The largest index fund in the United States and the largest * * * mutual fund Vanguard S&; P 500 manages $654.38+005 billion.

The emergence of indexed investment in China is relatively late, mainly because China's securities market is still relatively young and is still being explored and developed. The investor group in China is still immature, lacking scientific investment ideas and imperfect supervision of market behavior. Non-market behaviors such as banker's speculation have a great influence on the stock index. Due to these reasons, China's stock index often deviates from the market and cannot reflect the real situation of the market.

As far as the indexed investment method is concerned, the common method in the market is to completely copy a certain securities price index, or to build a portfolio according to the principle of compiling securities price index. This traditional indexed investment method is passive and can play a good role in the normal operation of the market, but when some sample stocks rise or fall abnormally fast, they will lose the opportunity to make further profits and stop losses in time. In order to make up for this deficiency, various alternative methods came into being.

Francesco Corielli and Massimiliano Marcellino(2002) think that the tracking index is a copy of the index, which contains far fewer stocks than the index, and the tracking error does not contain non-recurring components. They used the dynamic factor extraction method to establish an index replacement portfolio, and verified it with the Monte Carlo empirical index and the euro STOXX50 index. The verification results are encouraging, and the alternative combinations basically complete the tracking curve [7]. Wu Chongfeng (2000) analyzed the sample stocks of SSE 30 Index from1July 8, 1998 to1March 29, 1999, and concluded that SSE 30 Index was replaced by a combination of six stocks [6].

From the above research, we find that the indexed investment method does not necessarily need to build a portfolio according to the principle of compiling the securities price index, but can track the index by building an alternative portfolio. On this basis, the author puts forward the key factor fitting index investment method, and holds that the stock index is composed of its sample stocks according to the principle of compiling stock price index, and its trend reflects the * * * interaction of these sample stocks, but not every sample stock contributes the same to the index. There are key factors in the stock index, and the influence of these key factors on the stock index is reflected in the performance of the sample stocks they represent. Similarly, not every key factor contributes equally to it. Among the key factors, there are the most representative key sample stocks, which play a decisive role in the stock index. As long as we catch them, we will catch the stock index. In other words, as long as we invest in the combination of these key factors, we will invest in the stock index. In addition, the representative key sample stocks in the same key factor can be replaced, which can make stock index investment more flexible without affecting portfolio indexation, and make up for the shortcomings of traditional methods to some extent.

Next, we will take the SSE 50 Index as the research object, and make an empirical study on the key factor indexation investment method. The structure of the paper is as follows: firstly, research design is carried out to determine the research procedures, models, samples and data; Then, factor analysis is carried out on the data to extract the key factors of the SSE 50 Index. On this basis, we will make correlation test and regression analysis between the portfolio constructed according to key factors and the actual SSE 50 index to verify this method. Finally draw a conclusion.

research design

I. Research Project and Model Design

The first step is to find out the key factors that affect the trend of the SSE 50 index.

Based on the daily returns of the constituent stocks of the SSE 50 Index, we conducted factor analysis, and extracted n * * * identical factors reflecting the trend of the SSE 50 Index, representing n key factors influencing the trend of the SSE 50 Index. The multi-factor model is constructed as follows:

index 50 = a 1 * f 1+A2 * F2+……+An * Fn+ε

Among them: Index50 is the SSE 50 index; Fn is the nth * * * cofactor; An is the contribution rate of the n * * * th same factor to the SSE 50 Index; ε is the residual.

After finding out these N key factors, further find out the sample stocks represented by these N key factors. The correspondence is as follows:

F 1~a 1 (stock 1 1)+a2 (stock12)+...

F2~b 1 (stock 2 1)+b2 (stock 22)+…

…………………………………

fn ~ n 1(stock n 1)+N2(stock N2)+……

Where: Fn is the nth * * * cofactor; Stock is a sample stock represented by the same factor; A, B ... N is the contribution rate of sample stocks to the same factor, that is, the factor load.

By observing the factor load of the same factor, we can analyze and judge the key factors reflected by each same factor and make corresponding explanations.

The second step is to prove whether the N key factors we found can really reflect the trend of the SSE 50 Index. We use the most representative sample stocks to construct portfolio Portfolio50, and compare it with SSE 50 Index 50 to verify whether Portfolio50 is equivalent to Index 50.

To this end, we find out the most representative I sample stocks among the N key factors, and construct the portfolio according to the proportion of their variance contribution to the total variance as the weight, as follows:

Combination 50 = w 1 * stock 1+w2 * stock 2+…+wi * stock 1

Among them: Portfolio50 is the daily return rate of the constructed portfolio; STOCKi is the daily return rate of the I-th most representative sample stock participating in the construction of the portfolio; Wi is the weight of the first sample stock.

Calculate the daily return rate of portfolio 50 and the daily return rate of SSE 50 index 50. After the correlation test, Portfolio50 and Index50 were analyzed by linear regression. The regression model is constructed as follows:

Portfolio50=a+b*(Index50)+ε

Among them: Portfolio50 is the daily return rate of the constructed portfolio; Index50 is the daily yield of SSE 50 Index; A is a constant term; B is the regression coefficient; ε is the residual.

If the model is verified, A approaches 0 and B approaches 1, then Portfolio50≈Index50, that is, Portfolio50 is equivalent to Index50, indicating that the key factors we found can truly reflect the trend of SSE 50 index, and Portfolio50 can replace SSE 50 index for indexed investment.

Second, the model variable calculation

The daily yield of the constituent stocks of SSE 50 Index is calculated by relative yield. In the case of rights issue, share delivery and cash dividend, it is calculated according to the following formula:

In which: rit is the T-day yield of Class I stocks; Pt and Pt- 1 are the closing prices on T and t- 1 respectively; C is the T-day cash dividend per share based on t- 1; As is based on the share allotment ratio of t- 1 day; S is the corresponding price per share on t- 1; Ad is the proportion of shares sent per share on t- 1

The daily yield index 50 of SSE 50 Index is also calculated by relative yield, and the formula is as follows:

Among them: Rt is the T-day yield of SSE 50 Index; Pt and Pt- 1 are the closing prices of SSE 50 Index on T day and t- 1 day respectively.

Thirdly, study sample selection.

The original trading data such as the closing price of SSE 50 index, the closing price of constituent stocks, and cash dividends required in this study come from the "Great Wisdom Securities Information Platform V5.00" made by Shanghai Wanguo Stock Market Appraisal and Consulting Co., Ltd.

In the process of factor analysis, the sample data period is from February 3, 2002 to March 3, 2004, and each sample stock contains 309 data records. Missing values caused by temporary suspension for various reasons are filled by the adjacent data average method.

Considering that the listing date of sample shares of some newly listed companies is too short, the number of sample data is insufficient, the performance is prone to abnormal fluctuations, and the operating mechanism of all aspects of the company is not perfect, in order to prevent a few data from interfering with the inspection and exclude sample shares, after the key factors are established, their attributes are judged according to professional knowledge. Five sample stocks were excluded, namely: Baiyun Airport (600004), Huaxia Bank (6000 15), China Southern Airlines (600029), CITIC Securities (600030) and Changjiang Electric Power (600900).

To sum up, the sample stocks of factor analysis include 45 sample stocks of SSE 50 Index, each of which contains 309 daily income records. There are 309 groups, 13905 daily retirement records.

In the process of correlation test and regression analysis, the SSE 50 index was officially released on June 2, 2004, with the index code of 0000 16 and the benchmark date of June 365438+February 0, 2003. So far, the amount of data is too small to be calculated directly. However, for the smooth launch of SSE 50, SSE released SSE 50 plate concept index 993265 from 65438+2003 10/2 October. Its compilation method and trend are basically the same as those of SSE 50, but the cardinality adopted is different. Here, we use the data of SSE 50 concept index 993265 instead of the data of SSE 50 index 0000 16 for calculation. The calculation time span is from July 22, 2003 to March 2004 12. Similarly, the processing method of missing values adopts the method of average filling of adjacent data, and * * * statistics 155 groups of data.

factor analysis

Table 1 KMO statistics and bartlett spherical test table

Kaiser-Meyer-holguin sampling adequacy measurement. .958

Bartlett sphericity test about. Chi-square test 9857.426

df 990

Sign. .000

Firstly, we use KMO statistics and Bartlett's spherical test to determine whether the sample data meets the prerequisite of factor analysis. It can be seen that the KMO statistic of partial correlation between the test variables in the table is 0.958, which is close to 1, indicating that there is not much difference in the degree of correlation between variables, and the data is very suitable for factor analysis. At the same time, the result of Bartlett's spherical hypothesis test is also rejected, which strongly identifies the correlation between variables, indicating that there is * * * the same information between the daily returns of sample stocks, which meets the premise of extracting * * * the same factor. See table 1.

The factor extraction method used in this paper is principal component analysis. Considering the interpretability of * * * identity factors, orthogonal rotation is adopted in the process of extracting factors, and the specific rotation method is orthogonal rotation with maximum variance. According to the standard that the cumulative contribution rate of the extracted principal component * * * is above 85%, 20 factors with the same * * * are extracted from one * * *. The information extraction adequacy test table (omitted) tells us that according to the appeal * * * same factor extraction standard, the information extraction of sample stocks is basically sufficient.

Table 2 *** Table of percentage differences explained by the same factor

Factor f1f2f3f4f6fF7 F8 F9f10

The variance percentage is 42.3116.849 4.540 3.208 2.395 2.856 2.367 2.133 2.0351.844.

Cumulative% 42.31149.160 53.700 56.908 59.764 62.158 64.525 66.658 68.693 70.537

The factor f11f12 f13 f14 f15 f16 f17 f18 f/.

Percentage of variance1.7281.6741.5531.491.41.3241

Cumulative% 72.265 73.939 75.49176.982 78.392 79.71681.002 82.263 83.464 84.618.

We use the percentage of variance explained by the same factor (Table 2) as the weight of the factor's contribution to the index, and the corresponding multi-factor model is as follows:

index 50 = 0.423 1 * f 1+0.0685 * F2+0.0454 * F3+0.032 1 * F4+0.0286 * F5+0.0239 * F6+0.0237 * F7+0.02 13 * F8+0.0204 * F9+0.0 184 * f 654

After orthogonal rotation with the largest variance, the variable with factor load greater than 0.4 between factors is proposed, and then a relatively large value is taken according to the contribution of the same sample stock to the same factor. We get the following list of 20 sample stocks, mainly expressed by the same factor, as shown in Table 3.

Table 4 *** List of sample stocks represented by the same factor

F 1 600028 China Petrochemical F5 600664 Harbin Pharmaceutical Group

600808 Maanshan Iron and Steel Co., Ltd. 600038 Hafei Co., Ltd.

600688 Shanghai Petrochemical F6 600839 Sichuan Changhong

6000 19 Baoshan Iron and Steel 600033 Fujian Expressway

600026 China Shipping Development 600008 share capital

600569 Angang F7 60059 1 Shanghai Airlines

600050 China Unicom 60022 1 Hainan Airlines

600036 China Merchants Bank F8 600795 Guodian Power

600350 Shandong Infrastructure 6000 1 1 Huaneng International

600649 Raw water shares 600642 Shenneng shares

600000 Shanghai Pudong Development Bank F9 600643 Aijian shares

F2 600602 Radio and Television Electronics F 10 600887 Yili Stock

600832 Oriental Pearl 600597 Bright Dairy Industry

600637 Radio and Television Information F 1 1 6000 16 Minsheng Bank

600 100 Tsinghua Tongfang F 12 6008 1 1 Oriental Group

600 17 1 Shanghai Belling F 13 600652 ASHI shares

60060 1 Founder Technology F 14 600006 Dongfeng Motor

F3 600609 Jinbei Automobile F 15 6008 12 Huabei Pharmaceutical

600805 Da Yue Investment F 16 600705 North Asia Group

600 104 Shanghai Auto F 17 600895 Zhangjiang Hi-Tech

F4 6007 17 Tianjin Port F 18 600863 Inner Mongolia Huadian

6000 18 Shanghai Port Container F 19 600098 Guangzhou Holdings

Shanghai Airport F20-

The corresponding relationship between each * * * same factor and sample stock factor load is as follows:

f 1 ~ 0.84(600028)+0.84(600808)+0.83(600688)+0.82(6000 19)+0.65(600026)+0.6 1(600569)+0.6 1(600050)+0.55(60000

F2 ~ 0.88(600602)+0.86(600832)+0.85(600637)+0.78(600 100)+0.69(600 17 1)+0.49(60060 1)

F3 ~ 0.8 1(600609)+0.75(600805)+0.63(600 104)

F4 ~ 0.76(6007 17)+0.67(6000 18)+0.46(600009)

F5~0.88(600664)+0.85(600038)

F6 ~ 0.66(600839)+0.49(600033)+0.46(600008)

F7 ~ 0.72(60059 1)+0.67(60022 1)

F8 ~ 0.56(600795)+0.55(6000 1 1)+0.52(600642)

F9~0.83(600643)

f 10 ~ 0.75(600887)+0.40(600597)

f 1 1 ~ 0.80(6000 16)

f 12 ~ 0.8 1(6008 1 1)

F 13~0.8 1(600652)

F 14~0.97(600006)

F 15~0.80(6008 12)

F 16~0.77(600705)

F 17~0.78(600895)

F 18~0.75(600863)

F 19~0.52(600098)

F20~ -

Observing the corresponding relationship between the sample stock list represented by the * * * same factor and the factor load, we can analyze and judge the key factors reflected by each * * * same factor as follows:

F 1 The corresponding sample stocks are: 600028 China Petrochemical, 600808 Maanshan Iron and Steel Co., Ltd., 600688 Shanghai Petrochemical, 6000 19 baoshan iron & steel, 600026 China Shipping Development, 600569 Anyang Iron and Steel, 600050 China Unicom, 60036 China Merchants Bank and 600356. These are well-known large-cap blue-chip stocks with excellent operating performance and high return on net assets, including several bank stocks, which can be said to be large-cap stocks in the large-cap market and blue-chip stocks in the blue-chip. We can define the factor F 1 as "large-cap dark blue stocks".

Sample stocks corresponding to F2 are: 600602 Radio and Television Electronics, 600832 Oriental Pearl, 600637 Radio and Television Information, 600 100 Tsinghua Tongfang, 600 17 1 Shanghai Belling, 60060 1 Founder Technology. These stocks are outstanding representatives of high-tech industries, mainly engaged in computers.

Sample stocks corresponding to F3 are: 600609 Jinbei Automobile, 600805 Da Yue Investment and 600 104 Shanghai Automobile, which are typical automobile stocks. With the rise of the automobile industry in recent years, the performance shows steady growth, and we can define the factor F3 as "blue chip of automobile".

The sample stocks corresponding to F4 are: 6007 17 Tianjin Port, 600 18 Shanghai Port Container and 600009 Shanghai Airport, which are closely related to the logistics and transportation of land, sea and air ports. We can define F4 factor as "port logistics unit".

F5' s corresponding sample stocks are: 600664 Harbin Pharmaceutical Group and 600038 Hafei Shares, which have obvious regional color and touch the pulse of the development of the old industrial base in Northeast China. We can define the factor F5 as "Northeast Old Industrial Stock".

F6' s corresponding sample shares are: 600839 Sichuan Changhong, 600033 Fujian Expressway and 600008, of which 600033 Fujian Expressway and 600008 are mainly engaged in public welfare undertakings and infrastructure. We can define F6 factor as "basic public welfare stock". However, 600839 Sichuan Changhong's main business is TV sets, air conditioners and other household appliances, with outstanding performance. Being classified into this category can be regarded as an exception caused by reasons other than statistics.

The sample stocks corresponding to F7 are: 60059 1 Shanghai Airlines, 60022 1 Hainan Airlines, two high-quality stocks in the domestic air transport industry. We can define F7 factor as "air transport unit".

The sample stocks corresponding to F8 are: 600795 Guodian Power, 60001kloc-0/Huaneng International, and 600642 Shenneng Shares, which obviously represent electric energy. We can define the factor F8 as "power share".

The sample stock corresponding to F9 is: 600643 Aijian, which is a non-bank financial stock among the 50 constituent stocks in Shanghai Stock Exchange. We can define factor F9 as "non-bank financial stocks".

The sample stocks corresponding to F 10 are: 600887 Yili and 600597 Bright Dairy, both leading dairy products. The consumption of dairy products is closely related to the daily life of ordinary people, and its performance also reflects the affluence of ordinary people's lives from a certain angle. We can define the factor F 10 as "dairy consumer stock".

F 1 1 The corresponding sample stocks are: 6000 16 Minsheng Bank, banking stocks. The sample stock corresponding to F 12 is: 6008 1 1 Oriental Group, which is a comprehensive stock and involved in the fields of finance, e-commerce, building materials and communication. The sample stock corresponding to F 13 is: 600652 Aishi, which is mainly engaged in computer hardware and network equipment. The sample stocks corresponding to F 14 are: 600006 Dongfeng Motor, automobile industry stocks. The sample stock corresponding to F 15 is: 6008 12 Huabei Pharmaceutical, which produces and sells pharmaceutical and chemical products. The sample stock corresponding to F 16 is: 600705 North Asia Group, mainly engaged in transportation, logistics and trade. The sample stocks corresponding to F 17 are: 600895 Zhangjiang Hi-Tech, real estate stocks. The sample stock corresponding to F 18 is: 600863 Inner Mongolia Huadian, mainly engaged in dynamic power generation and heating. The sample stock corresponding to F 19 is: 600098 Guangzhou Holdings, which is engaged in comprehensive stocks such as energy, logistics and infrastructure. The stocks represented by these factors are highly targeted. Although some stocks can be attributed to the above factors, from a statistical point of view, they should be listed separately to ensure a complete reflection of the original information. The factor loads of the sample stocks corresponding to F20 are all less than 0.4, indicating that the explanatory power is very small, and the sample stocks reflected are scattered, so they have no analytical value from a professional point of view, so they are excluded.

As for Baiyun Airport (600004), Huaxia Bank (6000 15), China Southern Airlines (600029), CITIC Securities (600030) and Changjiang Electric Power (600900), which were rejected because of their short listing time, we can use our professional knowledge to classify them and verify them in future analysis. Baiyun Airport (600004) is mainly engaged in airport logistics, which can be classified as F4; Huaxia Bank (6000 15) is a banking stock, which can be classified as F11; China Southern Airlines (600029) is mainly engaged in air transportation, which can be classified as F7; CITIC Securities (600030) is a non-bank financial stock, which can be classified as F9, and Changjiang Electric Power (600900) is mainly engaged in electric energy, which can be classified as F8.

To sum up, through the factor analysis of the daily return data of the constituent stocks of the SSE 50 Index, we have extracted the same factor of 19 * * * from F 1 to F 19, which represents the key factor of 19 that affects the trend of the SSE 50 Index. The multi-factor model is constructed as follows:

index 50 = 0.423 1 * f 1+0.0685 * F2+0.0454 * F3+0.032 1 * F4+0.0286 * F5+0.0239 * F6+0.0237 * F7+0.02 13 * F8+0.0204 * F9+0.0 184 * f 654

Correlation test and regression analysis

We combine 19 representative sample stocks extracted from factor analysis to construct Portfolio50, and the weight of each sample stock is equal to the percentage of variance explained by each * * * factor in the cumulative percentage. For example, the weight of the factor F 1 is equal to (42.311/83.464 = 0.5069). Considering that the number of stocks represented by F 1 factor is relatively large and the weight ratio is relatively large, the first four stocks are selected, and the weight of each stock is one quarter of the weight of F 1 factor, with a total of 22 sample stocks.

The composition of the portfolio is as follows:

portfolio 50 = 0. 1267 *(600028)+(600808)+(600688)+(6000 19))+0.082 1 *(600602)+0.0544 *(600609)+0.0384 *(6007 17)+0.0342

The correlation test table of Portfolio50 and Index50 (omitted) shows that the correlation coefficient of Portfolio50 and Index50 is 0.943 at the confidence level of 0.0 1, which indicates that Portfolio50 is highly correlated with Index50.

Table 4 Regression Model and Test Results Table

Sum of squares of model.

1 regression 0.0251.0251238.863.000

Residual .003153.000

Total .028154

Table 5 Regression Coefficient and Test Results Table

Non-standardized coefficient of model. be relevant

B standard. Zero-order part of error β

1 (constant) 7.235E-04 .000 2.004 .047

index 50 1 . 02 1 . 029 . 943 35 . 197 . 000 . 943 . 943。

From the regression model and test results (Table 4), we can see that the regression model has obvious statistical significance. It can be seen from the regression coefficient and test results (Table 5) that the coefficient b of the regression model has obvious statistical significance, and the value of b is 1.02 1. Although the test of constant term is not statistically significant, it is irrelevant. For common sense, we usually keep it in the equation, and the value is 0.0007235.

Based on this, we can establish the following regression model:

Combination 50 = 0.0007235+1.021* (index 50)

Where: constant term a=0.0007235, which is very close to 0, and regression coefficient b= 1.02 1, which is also close to 1. So we can think that Portfolio50≈Index50.

Finally, we evaluate and analyze the effectiveness of regression model fitting (the process is abbreviated). From the goodness of fit briefing and Durbin-Watson statistics of the fitting model, it is known that the determined coefficient R2 is 0.89, and the adjusted determined coefficient R2 is 0.889, which shows that the fitting effect of the model is remarkable. The Durbin-Watson statistic is 1.786, and the value is around 2. It can be seen that there is no obvious correlation between residuals. In order to further analyze the normality of the model, that is, whether the residual ε obeys normal distribution, we have made a histogram of residual distribution and a normal PP diagram (see figure 1 and figure 2). It can be seen that the residual of this model basically obeys normal distribution.

Figure 1 histogram of residual distribution Figure 2 Normal PP diagram of residual.

conclusion

Based on the above empirical research, we draw the following conclusions:

1. During the period from 65438+February 3, 2002 to 65438+March 8, 2004, the returns of 50 sample stocks of the SSE 50 Index were affected by the key factor of 19. The most representative of these 19 key factors are 22 sample stocks such as 600028 China Petrochemical and 600602 Radio and Television Electronics. From another perspective, the overall trend of these 22 sample stocks basically reflects the trend of 50 sample stocks in the SSE 50 Index.

2. The key factors affecting the SSE 50 Index have a strong plate effect. The stock trends with the same or similar enterprise nature, main business, regional characteristics and operating performance are highly correlated and can be classified as the same key factor. At the same time, however, individual stocks performed equally well. Almost every department has a unique performance. Due to many reasons such as management and capital operation, these maverick stocks have stepped out of their own characteristics and become an indispensable bright spot in the market, making important contributions to the index.

3. Judging from the influence of individual stocks on the key factors of SSE 50 Index, if the number of sample stocks represented by a key factor is small, it means that these sample stocks are more representative. On the contrary, if a key factor represents a large number of sample stocks, it means that these sample stocks are replaceable, that is to say, if the portfolio needs to be adjusted, it can be adjusted among the factors representing most sample stocks without affecting the representativeness of the portfolio.

4. If you want to index the SSE 50 Index, you don't need to invest in all 50 sample stocks, but only need to invest in 22 key sample stocks that best represent the key factors of 19. The investment portfolio is as follows: investment portfolio 50 = 0.1267 * (600028)+(600808)+(600688)+(600019))+0.0821* (600602). The test results show that the return rate of Portfolio50 constructed by these 22 representative key sample stocks basically reflects the return rate of Index50 of SSE 50 index, and their risks are at the same level, that is, Portfolio50 can be used to replace SSE 50 index for indexed investment. In addition, because the stocks represented by the same key factor are substitutable, the structure of portfolio Portfolio50 is more flexible, and we can adjust portfolio Portfolio50 according to the specific situation of the market without affecting its reflection on the index.

The above conclusions show that we have verified the key factor fitting index investment method from the perspective of empirical research, that is, index investment does not have to completely copy the stock index, and there are key factors in the stock index. The investment portfolio constructed by these key factors can fit the corresponding stock index for indexed investment. This method can be applied to various indexes, and the operation is flexible and active. Fund managers can also combine other analytical tools to adjust the fitted portfolio according to the specific situation of the market, so as to achieve the best investment performance.