You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

101 lines
3.4 KiB

This file contains ambiguous Unicode characters!

This file contains ambiguous Unicode characters that may be confused with others in your current locale. If your use case is intentional and legitimate, you can safely ignore this warning. Use the Escape button to highlight these characters.

# 第三届阿里云磐久智维算法大赛(亚军)
## [官网地址](<https://tianchi.aliyun.com/competition/entrance/531947/introduction>)
## [官方数据地址](https://tianchi.aliyun.com/competition/entrance/531947/information?lang=zh-cn)
## 项目目录结构
```
├── Dockerfile
├── README.md
├── code
│   ├── catboost_fs.py +++++++++++++++++++++++++++++++ 模型训练代码
│   ├── generate_feature.py ++++++++++++++++++++++++++ 特征生成代码
│   ├── generate_pseudo_label.py ++++++++++++++++++++ 伪标签代码
│   ├── get_crashdump_venus_fea.py +++++++++++++++++++ 新数据特征生成代码
│   ├── requirements.txt +++++++++++++++++++++++++++++ python包版本
│   ├── stacking.py ++++++++++++++++++++++++++++++++++ 模型融合代码
│   └── utils.py +++++++++++++++++++++++++++++++++++++ 小工具脚本
├── data
│   ├── preliminary_a_test +++++++++++++++++++++++++++ 初赛A榜测试数据集
│   ├── preliminary_b_test +++++++++++++++++++++++++++ 初赛B榜测试数据集
│   └── preliminary_train ++++++++++++++++++++++++++++ 训练集数据
├── docker_push.sh +++++++++++++++++++++++++++++++++++++++++ Docker镜像构建、push脚本
├── feature
│   └── generation +++++++++++++++++++++++++++++++++++ 特征生成文件夹
├── log ++++++++++++++++++++++++++++++++++++++++++++++++++++ 日志文件夹
│   ├── catboost.log +++++++++++++++++++++++++++++++++ 模型运行日志
├── model ++++++++++++++++++++++++++++++++++++++++++++++++++ 模型文件
├── prediction_result ++++++++++++++++++++++++++++++++++++++ 模型预测结果文件夹
│   ├── cat_prob_result.csv ++++++++++++++++++++++++++ CATBOOST模型预测概率
│   ├── catboost_result.csv ++++++++++++++++++++++++++ CATBOOST模型预测结果
│   └── stacking_result.csv ++++++++++++++++++++++++++ 模型融合结果
├── run.log ++++++++++++++++++++++++++++++++++++++++++++++++ 代码运行日志
├── run.sh ++++++++++++++++++++++++++++++++++++++++++++++++ 代码运行脚本
├── tcdata ++++++++++++++++++++++++++++++++++++++++++++++++ 复赛测试集数据文件夹(具体文件请使用初赛相关文件更改文件名替换)
│   ├── final_crashdump_dataset_b.csv ++++++++++++++++ 复赛B榜新数据文件
│   ├── final_sel_log_dataset_b.csv ++++++++++++++++++ 复赛测试集日志文件
│   ├── final_submit_dataset_b.csv +++++++++++++++++++ 复赛测试集ID
│   └── final_venus_dataset_b.csv ++++++++++++++++++++ 复赛B榜新数据文件
├── user_data
│   └── tmp_data +++++++++++++++++++++++++++++++++++++ 临时文件
└── 答辩PPT
└── 悦智AI实验室_20220525.pdf
```
## 运行环境
Python版本为3.8各个Python包版本见requirements.txt使用如下命令即可安装
```
pip install -r code/requirements.txt
```
## 构建镜像运行代码
### 构建镜像
```
docker build -t [你的镜像仓库]:[TAG] .
```
### 运行镜像
```
docker run [你的镜像ID] sh run.sh
```
### push 镜像
```
docker push [你的仓库地址]:[TAG]
```
### 运行&push 镜像
```
bash docker_push.sh
```
## 运行代码
```
bash run.sh
```
## 终榜亚军
<img src="assets/亚军盖章.jpg" alt="亚军盖章" title="亚军盖章" width="500" height = "700" />