{"id":264,"date":"2020-12-09T05:45:49","date_gmt":"2020-12-08T20:45:49","guid":{"rendered":"https:\/\/www.ritzcolor.net\/?p=264"},"modified":"2020-12-09T05:52:38","modified_gmt":"2020-12-08T20:52:38","slug":"kaggle%e3%83%88%e3%83%a9%e3%82%a4-home-credit-4","status":"publish","type":"post","link":"https:\/\/www.ritzcolor.net\/?p=264","title":{"rendered":"kaggle\u30c8\u30e9\u30a4 -Home credit 4-"},"content":{"rendered":"\n<p>\u5c11\u3057\u9593\u304c\u7a7a\u3044\u3066\u3057\u307e\u3044\u307e\u3057\u305f\u304c\u3001\u30d9\u30fc\u30b9\u30e2\u30c7\u30eb\u3092\u4f5c\u6210\u3057\u307e\u3057\u305f\u306e\u3067\u3001\u305d\u306e\u30b3\u30fc\u30c9\u3068\u6d41\u308c\u3092\u7d39\u4ecb\u3057\u307e\u3059\u3002\u4e0b\u8a18\u306e\u624b\u9806\u306e\u3046\u3061\u30013,4,5,6\u3092\u3055\u3055\u3063\u3068\u884c\u3044\u30017\u3092\u5b9f\u884c\u3057\u305f\u3068\u3044\u3046\u5f62\u306b\u306a\u308a\u307e\u3059\u3002<\/p>\n\n\n\n<ol id=\"block-797a07eb-7056-4722-bc67-9e6130130475\"><li>\u660e\u3089\u304b\u306b\u3057\u305f\u3044\u554f\u3044\u3084\u3001\u554f\u984c\u306e\u5b9a\u7fa9<\/li><li>\u8a13\u7df4\u304a\u3088\u3073\u30c6\u30b9\u30c8\u30c7\u30fc\u30bf\u306e\u53d6\u5f97<\/li><li>\u30c7\u30fc\u30bf\u306e\u6574\u5f62\u3001\u4f5c\u6210\u3001\u30af\u30ec\u30f3\u30b8\u30f3\u30b0<\/li><li>\u30d1\u30bf\u30fc\u30f3\u306e\u5206\u6790\u3001\u7279\u5b9a\u3001\u307e\u305f\u63a2\u7d22\u7684\u306b\u30c7\u30fc\u30bf\u3092\u5206\u6790\u3059\u308b<\/li><li>\u554f\u984c\u306e\u30e2\u30c7\u30eb\u5316\u3001\u4e88\u6e2c\u3001\u89e3\u6c7a<\/li><li>\u554f\u984c\u89e3\u6c7a\u306e\u30b9\u30c6\u30c3\u30d7\u3068\u6700\u7d42\u7684\u306a\u89e3\u6c7a\u65b9\u6cd5\u3092\u8996\u899a\u5316\u3001\u5831\u544a<\/li><li>\u7d50\u679c\u306e\u63d0\u51fa<\/li><\/ol>\n\n\n\n<p>\u6700\u521d\u306bcsv\u306e\u7a2e\u985e\u3092\u304a\u3055\u3089\u3044\u3067\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>train_df = pd.read_csv(&quot;application_train.csv&quot;)\ntest_df = pd.read_csv(&quot;application_test.csv&quot;)\nbur = pd.read_csv(&quot;bureau.csv&quot;)\nbur_b = pd.read_csv(&quot;bureau_balance.csv&quot;)\ncd_b = pd.read_csv(&quot;credit_card_balance.csv&quot;)\nins_p = pd.read_csv(&quot;installments_payments.csv&quot;)\npos_c = pd.read_csv(&quot;POS_CASH_balance.csv&quot;)\nprv_a = pd.read_csv(&quot;previous_application.csv&quot;)<\/code><\/pre><\/div>\n\n\n\n<p>\u4e0a\u8a18\u306e\u3046\u3061\u3001\u307e\u305a\u306ftrain_df\u3068test_df\u4ee5\u5916\u3092\u6574\u5f62\u3057\u3066\u3044\u304d\u307e\u3059\u3002\u6700\u521d\u306fbureau\u3068bureau_balance\u306b\u3064\u3044\u3066\u3002\u3053\u3061\u3089\u306f\u30ab\u30fc\u30c9\u306e\u8abf\u67fb\u4f1a\u793e\u306e\u60c5\u5831\u306e\u3088\u3046\u3067\u4e8c\u3064\u306fSK_ID_BUREAU\u3068\u3044\u3046ID\u3067\u3064\u306a\u304c\u3063\u3066\u3044\u308b\u306e\u3067\u3001\u3053\u308c\u3092\u7d50\u5408\u3057\u307e\u3059\u3002\u4eca\u6c17\u3065\u304d\u307e\u3057\u305f\u304c\u3001\u305b\u3063\u304b\u304f\u4f5c\u3063\u305f\u306e\u306b\u3053\u306e\u30c6\u30fc\u30d6\u30eb\u3092train, test\u306b\u7d50\u5408\u3059\u308b\u306e\u5fd8\u308c\u3066\u307e\u3057\u305f\u3002\u6b21\u56de\u4fee\u6b63\u3002\u3002\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>bur_com = pd.merge(bur, bur_b, on=&quot;SK_ID_BUREAU&quot;, how=&quot;left&quot;)<\/code><\/pre><\/div>\n\n\n\n<p>\u7d9a\u3044\u3066\u3001\u6b8b\u308a4\u3064\u306ecredit_card_balance.csv\u3001installments_payments.csv\u3001 POS_CASH_balance.csv\u3001previous_application.csv\u3092\u6574\u5f62\u3057\u3066\u3044\u304d\u307e\u3059\u3002\u3053\u308c\u3089\u306fSK_ID_CURR\u3068SK_ID_PREV\u304c\u30ad\u30fc\u3068\u306a\u3063\u3066\u3044\u308b\u3088\u3046\u3067\u3059\u3002<\/p>\n\n\n\n<p>\u4e00\u65b9\u3067\u3001train_df, test_df\u306fSK_ID_CURR\u3067\u30e6\u30cb\u30fc\u30af\u306b\u306a\u3063\u3066\u3044\u308b\u306e\u3067\u5f8c\u3005\u7d50\u5408\u3059\u308b\u3053\u3068\u3092\u8003\u3048\u308b\u3068\u305d\u3061\u3089\u306b\u5408\u308f\u305b\u3066SK_ID_CURR\u3067\u30e6\u30cb\u30fc\u30af\u306b\u3057\u3066\u304a\u304f\u3068\u3084\u308a\u3084\u3059\u305d\u3046\u3067\u3059\u3002<\/p>\n\n\n\n<p>\u307e\u305f\u4e2d\u8eab\u3082\u6708\u6b21\u6b8b\u9ad8\u306a\u3069\u306a\u306e\u3067\u3001SK_ID_CURR\u3067\u5e73\u5747\u5316\u3057\u3066\u304a\u304d\u307e\u3059\u3002\u3053\u3053\u3067\u6570\u5024\u306e\u60c5\u5831\u306f\u5e73\u5747\u5316\u3055\u308c\u3066\u307e\u3059\u304c\u3001\u30aa\u30d6\u30b8\u30a7\u30af\u30c8\u306e\u60c5\u5831\u306f\u7834\u68c4\u3055\u308c\u3066\u307e\u3059\u3002\u30ab\u30e9\u30e0\u3092\u5206\u5272\u3059\u308b\u306a\u3069\u3092\u3057\u3066\u60c5\u5831\u3092\u6709\u52b9\u6d3b\u7528\u3059\u3079\u304d\u306a\u306e\u3067\u3001\u53cd\u7701\u30dd\u30a4\u30f3\u30c8\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>cd_b2 = cd_b.groupby([&quot;SK_ID_CURR&quot;], as_index=False).mean()\nins_p2 = ins_p.groupby([&quot;SK_ID_CURR&quot;], as_index=False).mean()\npos_c2 = pos_c.groupby([&quot;SK_ID_CURR&quot;], as_index=False).mean()\nprv_a2 = prv_a.groupby([&quot;SK_ID_CURR&quot;], as_index=False).mean()<\/code><\/pre><\/div>\n\n\n\n<p>\u5408\u308f\u305b\u3066train, test\u3068\u7d50\u5408\u3057\u305f\u3068\u304d\u306b\u91cd\u8907\u3059\u308b\u30ab\u30e9\u30e0\u304c\u3042\u308b\u3068\u30ab\u30e9\u30e0\u540d+x, \u30ab\u30e9\u30e0\u540d+y\u306e\u3088\u3046\u306b\u3001\u51fa\u3069\u3053\u308d\u3082\u308f\u304b\u3089\u306a\u304f\u306a\u3063\u3066\u3057\u307e\u3046\u306e\u3067\u3001\u4e88\u3081\u30ab\u30e9\u30e0\u540d\u3092\u5909\u66f4\u3057\u3066\u304a\u304d\u307e\u3059\u3002\u3055\u3089\u306b\u7d50\u5408\u5f8c\u306eSK_ID_PREV\u306f\u4e0d\u8981\u306e\u305f\u3081\u524a\u9664\u3057\u307e\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>#SK_ID_PREV\u306fID\u3067\u4e0d\u8981\u306e\u305f\u3081\u524a\u9664\u3059\u308b\ncd_b2 = cd_b2.drop(&quot;SK_ID_PREV&quot;, axis=1)\nins_p2 = ins_p2.drop(&quot;SK_ID_PREV&quot;, axis=1)\npos_c2 = pos_c2.drop(&quot;SK_ID_PREV&quot;, axis=1)\nprv_a2 = prv_a2.drop(&quot;SK_ID_PREV&quot;, axis=1)\n\n#\u30ab\u30e9\u30e0\u540d\u304c\u91cd\u8907\u3059\u308b\u305f\u3081\u3001\u3069\u3053\u306e\u3082\u306e\u304b\u308f\u304b\u308b\u3088\u3046\u306b\u540d\u79f0\u5909\u66f4\ncd_b3 = cd_b2.rename(columns={&quot;MONTHS_BALANCE&quot;:&quot;MONTHS_BALANCE_CRE&quot;, &quot;SK_DPD&quot; : &quot;SK_DPD_CRE&quot;, &quot;SK_DPD_DEF&quot; : &quot;SK_DPD_DEF_CRE&quot;})<\/code><\/pre><\/div>\n\n\n\n<p>\u3053\u3053\u307e\u3067\u4f5c\u3063\u305f\u30c6\u30fc\u30d6\u30eb\u3092\u7d50\u5408\u3057\u307e\u3059\u3002\u4e0b\u8a18\u306f\u4fee\u6b63\u3057\u3066\u3044\u307e\u3059\u304chow\u3092right\u306b\u3057\u3066\u3044\u307e\u3057\u305f\u3002right\u3060\u3068\u3001\u5148\u306b\u8a18\u8ff0\u3057\u305f\u30c6\u30fc\u30d6\u30eb(\u4e0b\u8a18\u3067\u306fins_p2\u3084df)\u3092\u53f3\u304b\u3089\u304f\u3063\u3064\u3051\u308b\u305f\u3081\u5927\u91cf\u306eNaN\u304c\u767a\u751f\u3057\u3066\u3057\u307e\u3044\u307e\u3059\u3002\u5f8c\u307b\u3069\u5927\u91cf\u306e\u6b20\u640d\u5024\u51e6\u7406\u3092\u3059\u308b\u3053\u3068\u306b\u306a\u3063\u3066\u307e\u3059\u3002\u3053\u308c\u307e\u305f\u53cd\u7701\u3002\u3002\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>#\u7d50\u5408\u3059\u308b\u305f\u3081\u306b\ndf = pd.merge(ins_p2, cd_b3, on=&quot;SK_ID_CURR&quot;, how=&quot;left&quot;)#\u91cd\u8907\u306a\u3057. df.info()\u3067x, y\u304c\u3042\u308b\u304b\u30c1\u30a7\u30c3\u30af\ndf2 = pd.merge(df, pos_c2, on=&quot;SK_ID_CURR&quot;, how=&quot;left&quot;)#MONTHS_BALANCE, SK_DPD_x, SK_DPD_DEF_x \u304c\u91cd\u8907\ndf3 = pd.merge(df2, prv_a2, on=&quot;SK_ID_CURR&quot;, how=&quot;left&quot;)<\/code><\/pre><\/div>\n\n\n\n<p>\u7d9a\u3044\u3066\u3001train\u3068df3\u3092\u7d50\u5408\u3059\u308b\u305f\u3081\u306b\u3082\u3001\u6539\u3081\u3066\u91cd\u8907\u30ab\u30e9\u30e0\u3092\u51e6\u7406\u3057\u307e\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>#\u30ab\u30e9\u30e0\u540d\u304c\u91cd\u8907\u3059\u308b\u305f\u3081\u3001\u3069\u3053\u306e\u3082\u306e\u304b\u308f\u304b\u308b\u3088\u3046\u306b\u540d\u79f0\u5909\u66f4\n#AMT_CREDIT_x, AMT_ANNUITY_x, AMT_GOODS_PRICE_x \u304c\u91cd\u8907\u3057\u3066\u3044\u308b\u306e\u3067\u3001\u7d50\u5408\u524d\u306b\u540d\u524d\u3092\u5909\u66f4\u3057\u3066\u304a\u304f\ndf4 = df3.rename(columns={&quot;AMT_CREDIT&quot;:&quot;AMT_CREDIT_PRE&quot;, &quot;AMT_AN\nNUITY&quot; : &quot;AMT_ANNUITY_PRE&quot;, &quot;AMT_GOODS_PRICE&quot; : &quot;AMT_GOODS_PRICE_PRE&quot;})<\/code><\/pre><\/div>\n\n\n\n<p>\u7d9a\u3044\u3066train, test\u3068\u7d50\u5408\u3057\u307e\u3059\u3002\u4e0b\u8a18\u30e1\u30e2\u306b\u66f8\u3044\u3066\u3042\u308a\u307e\u3059\u304c\u3001how\u306e\u65b9\u5411\u3092\u5b8c\u5168\u306b\u9593\u9055\u3063\u3066\u3044\u305f\u6a21\u69d8\u3067\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plane\"><code>#train, test\u3068df4\u3092\u7d50\u5408\n#right\u306b\u3057\u305f\u3089train\u306b\u304a\u3044\u3066\u3001TARGET\u3067NA\u304c\u767a\u751f\u3002\u3069\u3046\u3084\u3089\u6700\u521d\u306b\u66f8\u3044\u305f\u65b9\u3092\u7d50\u5408\u3059\u308b\u65b9\u5411\u3089\u3057\u3044\ntrain_df2 =  pd.merge(train_df, df4, on=&quot;SK_ID_CURR&quot;, how=&quot;left&quot;)\ntest_df2 =  pd.merge(test_df, df4, on=&quot;SK_ID_CURR&quot;, how=&quot;left&quot;)<\/code><\/pre><\/div>\n\n\n\n<p>\u3053\u308c\u3067\u5fc5\u8981\u306a\u60c5\u5831\u3092train, test\u306b\u8ffd\u52a0\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3057\u305f\u3002\u73fe\u5728\u8003\u3048\u3066\u3044\u308b\u30e2\u30c7\u30eb\u306f\u30e9\u30f3\u30c0\u30e0\u30d5\u30a9\u30ec\u30b9\u30c8\u306a\u306e\u3067\u3001\u30aa\u30d6\u30b8\u30a7\u30af\u30c8\u306f\u30e9\u30d9\u30eb\u30b3\u30fc\u30c7\u30a3\u30f3\u30b0(\u4f8b\u3048\u3070MALE, FAMELE\u30920, 1\u306b\u5909\u63db\u3059\u308b\u306a\u3069)\u3092\u884c\u3044\u3001\u6570\u5024\u60c5\u5831\u306b\u5909\u63db\u3059\u308b\u5fc5\u8981\u304c\u3042\u308a\u307e\u3059\u3002<\/p>\n\n\n\n<p>\u4eca\u56de\u53cd\u6620\u3055\u305b\u308b\u3053\u3068\u306f\u3067\u304d\u307e\u305b\u3093\u3067\u3057\u305f\u304c\u3001\u6570\u5024\u5316\u3057\u3066\u3082\u9023\u7d9a\u6027\u3092\u6301\u305f\u305b\u3066\u306f\u884c\u3051\u306a\u304b\u3063\u305f\u306e\u3067\u3001\u305d\u308c\u3082\u5bfe\u5fdc\u5fc5\u8981\u3067\u3057\u305f\u3002\u4f8b\u3048\u3070\u7537\u6027:0, \u5973\u6027:1, \u672a\u56de\u7b54:2\u3068\u3044\u3046\u3075\u3046\u306b\u5909\u63db\u3057\u305f\u3068\u3057\u307e\u3059\u3002\u3053\u306e\u3068\u304d\u3001\u7537\u6027(0)\u3068\u672a\u56de\u7b54(2)\u306e\u5e73\u5747\u3092\u3068\u3063\u305f\u5834\u5408\u3001\u5973\u6027(1)\u3068\u3044\u3046\u4e88\u6e2c\u306b\u306a\u3063\u3066\u3057\u307e\u3044\u307e\u3059\u3002\u3053\u308c\u3067\u306f\u306a\u3093\u306e\u3053\u3068\u304b\u308f\u304b\u3089\u306a\u3044\u306e\u3067\u3001\u306a\u3093\u3089\u304b\u306e\u5bfe\u5fdc\u3092\u8003\u3048\u306a\u304f\u3066\u306f\u884c\u3051\u306a\u304b\u3063\u305f\u3067\u3059\u3002\u53cd\u7701\u3070\u304b\u308a\u3067\u3059\u304c\u3001\u53cd\u7701\u3002\u3002\u3002<\/p>\n\n\n\n<p>\u3068\u308a\u3042\u3048\u305a\u7121\u7406\u3084\u308a\u6570\u5024\u306b\u5909\u63db\u3057\u3066\u3044\u308b\u30b3\u30fc\u30c9\u304c\u3053\u3061\u3089\u3067\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plane\"><code>#\u30aa\u30d6\u30b8\u30a7\u30af\u30c8\u3092\u542b\u3093\u3067\u3044\u308b\u30ab\u30e9\u30e0\u3092\u62bd\u51fa\u3002\ntrain_df2.select_dtypes(include=object)<\/code><\/pre><\/div>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plane\"><code>#\u3069\u3093\u306a\u7a2e\u985e\u304c\u3042\u308b\u304b\u78ba\u8a8d\nprint(train_df2[&quot;NAME_CONTRACT_TYPE&quot;].value_counts())\n#Null\u304c\u3042\u308b\u3068\u5909\u63db\u3067\u304d\u306a\u3044\u70ba\u3001Unknown\u3068\u3057\u3066\u7f6e\u63db\ntrain_df2[&quot;NAME_CONTRACT_TYPE&quot;] = train_df2[&quot;NAME_CONTRACT_TYPE&quot;].fillna(&quot;Unknown&quot;)\n##Cash loans\u30921, Revolving\u30922\u3001Unknown\u30920\u3068\u3057\u3066\u7f6e\u63db\ntrain_df2[&quot;NAME_CONTRACT_TYPE&quot;] = train_df2[&quot;NAME_CONTRACT_TYPE&quot;].map( {&#39;Unknown&#39;: 0, &#39;Cash loans&#39;: 1, &#39;Revolving loans&#39;: 2} ).astype(int)<\/code><\/pre><\/div>\n\n\n\n<p>\u3053\u308c\u3092\u3042\u306815\u56de\u3084\u308b\u306e\u304b\u3068\u601d\u3063\u3066\u3044\u307e\u3057\u305f\u304c\u3001\u7c21\u5358\u306b\u5b9f\u884c\u3059\u308b\u30b3\u30fc\u30c9\u304c\u898b\u3064\u304b\u3063\u305f\u306e\u3067\u3001\u305d\u3061\u3089\u3092\u7d39\u4ecb\u3057\u307e\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>#\u5909\u63db\u3057\u305f\u3044\u30ab\u30e9\u30e0\u3092\u30ea\u30b9\u30c8\u306b\u5165\u308c\u307e\u3059\u3002\u4e0b\u8a18\u306f\u5168\u3066\u3067\u306f\u306a\u3044\u306e\u3067\u6ce8\u610f\u3002\ncolumns = [&quot;NAME_TYPE_SUITE&quot;,&quot;NAME_INCOME_TYPE&quot;, &quot;NAME_EDUCATION_TYPE&quot;,&quot;NAME_FAMILY_STATUS&quot;, &quot;NAME_HOUSING_TYPE&quot;,&quot;OCCUPATION_TYPE&quot;, &quot;WEEKDAY_APPR_PROCESS_START&quot;,&quot;ORGANIZATION_TYPE&quot;,&quot;FONDKAPREMONT_MODE&quot;,&quot;HOUSETYPE_MODE&quot;, &quot;WALLSMATERIAL_MODE&quot;, &quot;EMERGENCYSTATE_MODE&quot;]<\/code><\/pre><\/div>\n\n\n\n<p>\u4e0b\u8a18\u3092\u5b9f\u884c\u3059\u308b\u3053\u3068\u3067\u3001\u305d\u308c\u305e\u308c\u3092\u30a8\u30f3\u30b3\u30fc\u30c9\u3057\u3066\u304f\u308c\u307e\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>from sklearn.preprocessing import LabelEncoder\n\nfor c in columns:\n    lbl = LabelEncoder() \n    lbl.fit(list(train_df2[c].values))\n    train_df2[c] = lbl.transform(list(train_df2[c].values))\n\nfor c in columns:\n    lbl = LabelEncoder() \n    lbl.fit(list(test_df2[c].values))\n    test_df2[c] = lbl.transform(list(test_df2[c].values))    <\/code><\/pre><\/div>\n\n\n\n<p>\u6700\u5f8c\u306bNull\u78ba\u8a8d\u3057\u305f\u3068\u3053\u308d\u3001\u9014\u4e2d\u306e\u7d50\u5408\u30df\u30b9\u306e\u305b\u3044\u3067\u5927\u91cf\u306e\u6b20\u640d\u5024\u304c\u3002\u3002\u3002\u306a\u3093\u3068\u306a\u304f0\u3067\u7f6e\u63db\u3057\u3061\u3083\u3063\u3066\u307e\u3059\u304c\u3001\u672c\u6765\u5168\u3066\u306e\u6b20\u640d\u5024\u306e\u50be\u5411\u3092\u307f\u306a\u304c\u3089\u7269\u306b\u3088\u3063\u3066\u306f\u524a\u9664\u3082\u8003\u616e\u3057\u3064\u3064\u5e73\u5747\u3001\u4e2d\u592e\u5024\u306a\u3069\u306e\u51e6\u7406\u304c\u5fc5\u8981\u3067\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-plane\"><code>#\u6700\u7d42\u7684\u306bNull\u3092\u78ba\u8a8d\n#\u3068\u308a\u3042\u3048\u305aNulll\u3092\u5168\u30660\u3067\u7f6e\u63db\ntrain_df3 = train_df2.fillna(0)\ntest_df3 = test_df2.fillna(0)<\/code><\/pre><\/div>\n\n\n\n<p>\u3053\u3053\u307e\u3067\u6765\u305f\u3089\u3001\u6b20\u640d\u5024\u304c\u306a\u3044\uff0b\u30c7\u30fc\u30bf\u304c\u5168\u3066\u6570\u5024\u306b\u306a\u3063\u3066\u3044\u308b\u306e\u3067\u6a5f\u68b0\u5b66\u7fd2\u3092\u3068\u308a\u3042\u3048\u305a\u56de\u3059\u3053\u3068\u304c\u3067\u304d\u305d\u3046\u3067\u3059\u3002\u4eca\u56de\u306f\u30c7\u30fc\u30bf\u5b9a\u7fa9\u5f8c\u3001\u30e9\u30f3\u30c0\u30e0\u30d5\u30a9\u30ec\u30b9\u30c8\u3092\u5b9f\u884c\u3057\u3066\u3044\u307e\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code>from sklearn.ensemble import RandomForestClassifier\n\n#\u76ee\u7684\u3067\u3042\u308bTARGET\u4ee5\u5916\u3092X_train\u3068\u3057\u3066\u5b9a\u7fa9\nX_train = train_df3.drop(&quot;TARGET&quot;, axis=1)\n\n#\u76ee\u7684\u3067\u3042\u308bTARGET\u3092Y_train\u3068\u3057\u3066\u5b9a\u7fa9\nY_train = train_df3[&quot;TARGET&quot;]\n\n#test\u3092X_test\u3068\u3057\u3066\u5b9a\u7fa9\nX_test = test_df3\n\n#\u30c7\u30fc\u30bf\u5f62\u72b6\u3092\u78ba\u8a8d\u3002X_train\u3068X_test\u3067\u30ab\u30e9\u30e0\u6570\u304c\u6574\u5408\u3057\u3066\u306a\u3044\u3068\u30a8\u30e9\u30fc\u8981\u56e0\u3068\u306a\u308a\u307e\u3059\u3002\nprint(X_train.shape, Y_train.shape,X_test.shape)\n#\u51fa\u529b\u7d50\u679c\u2192171\u304c\u4e0d\u4e00\u81f4\u3060\u3068\u5f8c\u306b\u30a8\u30e9\u30fc\u306b\u306a\u308a\u307e\u3059\u3002(307511, 171) (307511,) (48744, 171)<\/code><\/pre><\/div>\n\n\n\n<p>\u30e9\u30f3\u30c0\u30e0\u30d5\u30a9\u30ec\u30b9\u30c8\u3067\u4e88\u6e2c\u3092\u5b9f\u884c\u3002\u4eca\u56de\u306f\u5404\u7a2e\u30d1\u30e9\u30e1\u30fc\u30bf\u3092\u8abf\u6574\u3057\u3066\u3044\u307e\u305b\u3093\u3002\u3053\u3061\u3089\u3082\u5f8c\u3005\u5fc5\u8981\u3067\u3059\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code># \u30e9\u30f3\u30c0\u30e0\u30d5\u30a9\u30ec\u30b9\u30c8\u306b\u3088\u308b\u5206\u985e\u30e2\u30c7\u30eb\u3092\u4f5c\u6210\u3057\u3001\u30e2\u30c7\u30eb\u306e\u7cbe\u5ea6\u3092\u78ba\u8a8d\u3057\u3066\u4e0b\u3055\u3044\u3002\nrandom_forest = RandomForestClassifier(n_estimators=100)\nrandom_forest.fit(X_train, Y_train)\n\nY_pred_r = random_forest.predict(X_test)\nrandom_forest.score(X_train, Y_train)\nacc_random_forest = round(random_forest.score(X_train, Y_train) * 100, 2)\n\nprint(acc_random_forest)<\/code><\/pre><\/div>\n\n\n\n<p>\u3053\u3053\u307e\u3067\u6765\u307e\u3057\u305f\u3089\u3001submission\u306b\u5408\u308f\u305b\u3066\u30c7\u30fc\u30bf\u3092\u51fa\u529b\u3057\u307e\u3059\u3002\u30ab\u30e9\u30e0\u540d\u79f0\u304csample_submission\u3068\u9055\u3046\u3068\u63d0\u51fa\u6642\u306b\u30a8\u30e9\u30fc\u306b\u306a\u308a\u307e\u3059\u306e\u3067\u3054\u6ce8\u610f\u3002<\/p>\n\n\n\n<div class=\"hcb_wrap\"><pre class=\"prism line-numbers lang-python\" data-lang=\"Python\"><code># CSV\u30d5\u30a1\u30a4\u30eb\u306b\u4fdd\u5b58\nsubmission = pd.DataFrame({\n        &quot;SK_ID_CURR&quot;: test_df3[&quot;SK_ID_CURR&quot;],\n        &quot;TARGET&quot;: Y_pred\n    })\n\nsubmission.to_csv(&#39;my_submission1.csv&#39;, index=False)<\/code><\/pre><\/div>\n\n\n\n<p>\u3068\u3044\u3046\u3053\u3068\u3067\u63d0\u51fa\u3057\u305f\u3068\u3053\u308d\u3001\u30b9\u30b3\u30a2\u306f0.50000\u3002\u7d42\u308f\u3063\u3066\u3044\u308b\u30b3\u30f3\u30da\u3060\u304b\u3089\u304b\u9806\u4f4d\u306f\u51fa\u305a\u3002\u6c17\u306b\u306a\u308b\u6240\u304c\u305f\u304f\u3055\u3093\u3042\u308b\u306e\u3067\u4fee\u6b63\u3092\u884c\u3063\u3066\u3044\u304d\u305f\u3044\u3068\u601d\u3044\u307e\u3059\u3002<\/p>\n\n\n\n<p>\u5177\u4f53\u7684\u306b\u601d\u3044\u3064\u304f\u4fee\u6b63\u5185\u5bb9\u306f\u3053\u3061\u3089\u3002<br>\u30fb\u9069\u3057\u305f\u7d50\u5408\u306e\u5b9f\u65bd-&gt;how=left\u306b\u3057\u3066\u3044\u305f\u306e\u3067\u6b20\u640d\u5024\u5927\u91cf<br>\u30fb\u5e73\u5747\u5316\u3057\u305f\u969b\u306e\u30aa\u30d6\u30b8\u30a7\u30af\u30c8\u60c5\u5831\u306e\u6271\u3044-&gt;\u4eca\u306f\u5168\u3066\u7834\u68c4\u3057\u3061\u3083\u3063\u3066\u307e\u3059<br>\u30fb\u30ab\u30e9\u30e0\u306e\u7d5e\u308a\u8fbc\u307f-&gt;171\u306e\u30ab\u30e9\u30e0\u3067\u51e6\u7406\u3002\u30d2\u30fc\u30c8\u30de\u30c3\u30d7\u306a\u3069\u3067\u7d5e\u308a\u8fbc\u307f\u30c8\u30e9\u30a4<br>\u30fb\u30e9\u30d9\u30eb\u30a8\u30f3\u30b3\u30fc\u30c9\u306e\u9069\u5207\u51e6\u7406-&gt;\u96e2\u6563\u5024\u3068\u9023\u7d9a\u5024\u306e\u533a\u5225<br>\u30fb\u6a5f\u68b0\u5b66\u7fd2\u30e2\u30c7\u30eb\u3068\u30d1\u30e9\u30e1\u30fc\u30bf\u30c1\u30e5\u30fc\u30cb\u30f3\u30b0-&gt;\u30af\u30ed\u30b9\u30d0\u30ea\u30c7\u30fc\u30b7\u30e7\u30f3 \u306a\u3069\u30c8\u30e9\u30a4<\/p>\n\n\n\n<p>\u3068\u3044\u3046\u3053\u3068\u3067\u6b21\u56de\u4ee5\u964d\u3067\u4e0a\u8a18\u3092\u4fee\u6b63\u3057\u3066\u3044\u304d\u305f\u3044\u3068\u601d\u3044\u307e\u3059\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u5c11\u3057\u9593\u304c\u7a7a\u3044\u3066\u3057\u307e\u3044\u307e\u3057\u305f\u304c\u3001\u30d9\u30fc\u30b9\u30e2\u30c7\u30eb\u3092\u4f5c\u6210\u3057\u307e\u3057\u305f\u306e\u3067\u3001\u305d\u306e\u30b3\u30fc\u30c9\u3068\u6d41\u308c\u3092\u7d39\u4ecb\u3057\u307e\u3059\u3002\u4e0b\u8a18\u306e\u624b\u9806\u306e\u3046\u3061\u30013,4,5,6\u3092\u3055\u3055\u3063\u3068\u884c\u3044\u30017\u3092\u5b9f\u884c\u3057\u305f\u3068\u3044\u3046\u5f62\u306b\u306a\u308a\u307e\u3059\u3002 \u660e\u3089\u304b\u306b\u3057\u305f\u3044\u554f\u3044\u3084\u3001\u554f\u984c\u306e\u5b9a\u7fa9 \u8a13\u7df4\u304a\u3088\u3073\u30c6 [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":175,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":""},"categories":[3],"tags":[5,6,7],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/www.ritzcolor.net\/wp-content\/uploads\/2020\/10\/character_program_smart.png?fit=400%2C400&ssl=1","_links":{"self":[{"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=\/wp\/v2\/posts\/264"}],"collection":[{"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=264"}],"version-history":[{"count":3,"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=\/wp\/v2\/posts\/264\/revisions"}],"predecessor-version":[{"id":267,"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=\/wp\/v2\/posts\/264\/revisions\/267"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=\/wp\/v2\/media\/175"}],"wp:attachment":[{"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=264"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=264"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ritzcolor.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=264"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}