perfspec-learning/README.md

117 lines
4.3 KiB
Markdown
Raw Normal View History

2025-01-27 08:59:18 +00:00
---
gitea: none
include_toc: true
---
2025-01-20 04:31:16 +00:00
# PerfSPEC Learning Phase
2025-01-20 05:51:54 +00:00
Based in [PrefSPEC: Performance Profiling-based Proactive Security Policy Enforcement for Containers](https://ieeexplore.ieee.org/document/10577533) document presented in [1], thir repository contains source files used to generate and process data.
2025-01-27 08:21:39 +00:00
Main Reference: [PrefSPEC document](PerfSPEC.pdf) as [White paper](https://en.wikipedia.org/wiki/White_paper)
2025-01-20 07:53:45 +00:00
[Presentación in Spanish](presentacion.pdf)
2025-01-27 08:21:39 +00:00
[How to install](https://repo.jesusperez.pro/jesus/perfspec-learning/src/branch/main/install.md) covers basic enviroment,tools, and recommendations.
2025-01-20 08:26:40 +00:00
<div style="margin: auto">
2025-01-20 08:31:40 +00:00
<a target="_blank" href="perfspec-learning/src/branch/main/presentacion.pdf"><img src="imgs/prefSPEC-learning.png" width="800"></a>
2025-01-20 08:26:40 +00:00
</div>
2025-01-27 08:21:39 +00:00
__PerfSPEC__
2025-01-27 08:59:18 +00:00
>[!IMPORTANT]
With `PerfSPEC` [Security Policies](https://en.wikipedia.org/wiki/Security_policy) can be managed / watched in **Proactive** mode by using <u>ranking</u>, <u>learning</u> and <u>profiles</u> for safetiness and performance.
2025-01-27 08:21:39 +00:00
2025-01-27 08:59:18 +00:00
It has three phases:
2025-01-20 05:51:54 +00:00
- Ranking
- Learning
- Runtime
This repository is focused in __Learning__ phase with attention on:
- Event logs info load and process
- Predictive learning model
2025-01-27 08:59:18 +00:00
> [!NOTE]
> It is considered that __event data collection__ in `raw-audit-logs.log.xz` are realistic and representative to simulate
2025-01-20 05:51:54 +00:00
administrative operations.
## Files
2025-01-20 06:51:02 +00:00
2025-01-27 08:59:18 +00:00
### Data
2025-01-20 05:51:54 +00:00
- `raw-audit-logs.log` contains raw Kubernetes audit logs collected using the `audit-policy.yaml` audit policy.
2025-01-27 08:59:18 +00:00
### Layout
2025-01-20 06:57:26 +00:00
Tools are distributed in directories:
2025-01-20 06:58:41 +00:00
- [Collect](collect)
- [Process](process)
- [Learning](learning)
2025-01-20 06:57:26 +00:00
2025-01-27 08:59:18 +00:00
<details open>
<summary>Files layout</summary>
2025-01-27 09:01:46 +00:00
Content structure overview with notes
2025-01-27 08:59:18 +00:00
<pre>
├── PerfSPEC.pdf Reference document
├── README.md
├── about.md
├── actions_distribution.pdf Generated actions distribytion
├── collect Collect logs scripts
│   ├── audit-policy.yaml
│   ├── collect.py
│   └── helm-charts.json
├── data Extracted from compress archive
│   ├── actions-dataset-audit.txt
│   ├── actions-logs.log
│   ├── actions_distribution.pdf
│   ├── main-audit-logs.log
│   └── raw-audit-logs.log
├── data_sample.tar.xz Compress archive with 'data'
├── imgs
├── install.md Installation notes
├── intro.md
├── learning
│   └── python
│   ├── __pycache__ Ignored in git
│   ├── lib_perfspec.py
│   ├── model_perfspec.py
│   ├── prepare_perfspec.py
│   ├── run_perfspec.py
│   └── train_perfspec.py
├── models Extracted from compress archive
│   ├── checkpoints
│   │   ├── model_at_epoch_175.keras
│   │   └── model_at_epoch_185.keras
│   ├── history.json
│   └── perfSPEC_model.keras
├── models_sample.tar.xz Comperss archive with 'models'
├── presentacion.pdf Presentation slides
└── raw-audit-logs.log.xz Main Raw Logs file
</pre>
</details>
2025-01-27 08:21:39 +00:00
As some tasks can be used in [Python](https://python.org) or [Rust](https://www.rust-lang.org/) there are or will be directories for each programming languge inside directories tasks.
2025-01-20 06:57:26 +00:00
2025-01-27 08:21:39 +00:00
Each `task/programming-language` use a common __data__ directory where processing output files is generated.
2025-01-20 07:53:45 +00:00
2025-01-27 08:59:18 +00:00
## Collect data
2025-01-20 06:51:02 +00:00
2025-01-20 06:57:26 +00:00
If you wish to [collect](collect) your own dataset, there are several source files that might help:
2025-01-20 06:51:02 +00:00
- `collect/collect.py` is a script to trigger the installation and uninstallation of public Helm repositories.
- `collect/helm-charts.json` is a backup of Helm charts used at the time of the collection.
2025-01-27 08:59:18 +00:00
## Process data
2025-01-20 06:51:02 +00:00
2025-01-27 08:59:18 +00:00
## Learning
2025-01-20 06:51:02 +00:00
2025-01-20 05:51:54 +00:00
## Reference
[1]: [H. Kermabon-Bobinnec et al., "PerfSPEC: Performance Profiling-based Proactive Security Policy Enforcement for Containers," in IEEE Transactions on Dependable and Secure Computing, doi: 10.1109/TDSC.2024.3420712.](https://ieeexplore.ieee.org/document/10577533)