db benchmarks下載db benchmarks源代碼下載

db benchmarks

其他源碼

1.0.0

下載

DB基準徽標

基準•簡介•為什麼這很重要•功能•測試原理•安裝•UI

介紹

https://db-benchmarks.com旨在製作數據庫和搜索引擎基準：

⚖️公平而透明- 應該在此數據庫 /搜索引擎提供此或該性能的情況下清楚

高質量- 控制變化係數可以產生結果，如果您今天，明天或下週進行查詢，則保持不變

？容易再現- 任何人都可以在自己的硬件上重現任何測試

易於理解- 圖表非常簡單

➕可擴展- 可插入式體系結構允許添加更多數據庫進行測試

並保留所有100％開源！

該存儲庫提供了一個可以完成工作的測試框架。

為什麼這很重要？

許多數據庫基準不是客觀的。其他人則做得不足以確保結果的準確性和穩定性，在某些情況下，這會打破基準的整個想法。一些例子：

德魯伊vs clickhouse vs岩石

https://imply.io/blog/druid-nails-cost-cost-felcipicy-challenge-challenge-against-clickhouse-and-rockset/：

實際上，我們想在同一硬件（M5.8xlarge）上進行基準測試，但是我們對M5.8xlarge擁有的唯一預生產配置實際上是m5d.8xlarge ...而是我們在C5.9xlarge實例上運行

壞消息，伙計們：當您在不同的硬件上運行基準時，至少您不能說“ 106.76％”和“ 103.13％”的其他東西。即使您在同一裸金屬服務器上進行測試，也很難將變化係數低於5％。不同服務器的3％差異很可能被忽略。鑑於所有這些，如何確保最終結論是正確的？

許多數據庫和引擎

https://tech.marksblogg.com/benchmarks.html

馬克在許多不同的數據庫和搜索引擎上進行了出租車測試做得很好。但是，由於測試是在不同的硬件上進行的，因此結果表中的數字並不能真正可比。在評估表中的結果時，您始終需要牢記這一點。

點擊屋與其他人

https://clickhouse.com/benchmark/dbms/

當您僅運行每個查詢3次時，每個查詢很可能會獲得每個查詢的變化係數很高。這意味著，如果您一分鐘後進行測試，則可能會獲得20％的變化。一個人如何在自己的硬件上重現測試？不幸的是，我找不到如何做到這一點。

測試原則

我們的信念是，公平的數據庫基準應遵循一些關鍵原則：

✅在完全相同的硬件上測試不同的數據庫

否則，當差異很小時，您應該確認錯誤餘量。

✅在每次測試之前清除完整的OS緩存測試

否則您將無法測試冷查詢。

✅正在測試的數據庫應禁用其所有內部緩存

否則，您將測量緩存性能。

✅最好也要測量冷跑。對於經常發生冷查詢的分析查詢尤其重要

否則，您會完全隱藏數據庫如何處理I/O。

✅在測試過程中沒有其他運行

否則，您的測試結果可能非常不穩定。

✅您需要在每個查詢之前重新啟動數據庫

否則，儘管清除了內部緩存，但以前的查詢仍然會影響當前查詢的響應時間。

✅您需要等到數據庫啟動後完全熱身

否則，您可能最終會與數據庫的I/O熱身過程競爭，這可能會嚴重破壞您的測試結果。

✅最好如果您提供一種變化係數，因此每個人都了解您的結果的穩定性，並確保自己足夠低

變化係數是一個非常好的度量標準，它顯示了您的測試結果的穩定性。如果它高於n％，則不能說一個數據庫比另一個數據庫快。

✅最好在固定的CPU頻率上測試

否則，如果您使用的是“按需” CPU調速器（通常是默認值），則可以輕鬆地將500ms響應時間轉換為1000+ MS。

✅最好在SSD/NVME而不是HDD上測試

否則，根據文件在HDD上的位置，您可以獲得低/更高/更高的I/O性能（我們測試），這至少可以使您的冷查詢結果錯誤。

測試框架

在https://db-benchmarks.com後端使用的測試框架是完全開源的（AGPLV3許可證），可以在https://github.com/db-benchmarks/db-benchmarks上找到。這就是它的作用：

將數據加載自動到存儲庫中包含的數據庫/搜索引擎。
可以在Docker中運行具有特定CPU/RAM約束的數據庫/搜索引擎。
測試時：
- 自動清除OS緩存
- 在每次冷運行之前自動清除數據庫緩存
- 每次冷運行之前重新啟動數據庫
- 照顧您的CPU溫度以避免節流
- 在進行查詢時照顧變化係數，並可以立即停止：
  - 簡歷足夠低
  - 而且進行的查詢數量足夠
- 啟動數據庫/搜索引擎後，讓它執行熱身階段（磁盤中需要的預讀數據），停止等待：
  - 幾秒鐘沒有IO
  - 它可以連接到數據庫/搜索引擎
- 停止數據庫/搜索引擎後，等待直到完全停止
- 可以接受不同的超時：開始，熱身，初始連接，獲取有關數據庫/搜索引擎的信息，查詢
- 可以模仿一個物理核心，該物理核心允許更客觀地（ --limited ）基準對數據庫的算法功能進行基準測試。
- 可以接受所有值作為命令行參數以及環境變量，以便於與CI系統更容易集成
- --test將測試結果保存到文件
- --save保存將測試結果從文件中保存到遠程數據庫（均未經過測試的數據庫）
- 在測試時跟踪很多事情：
  - 服務器信息：CPU，內存，運行過程，文件系統，主機名
  - 當前的存儲庫信息，以確保沒有本地更改
  - 性能指標：微秒匯總的統計數據中的每個查詢響應時間：
    - 所有查詢的變化係數
    - 80％最快查詢的變化係數
    - 冷查詢的響應時間
    - AVG（響應時間）
    - AVG（80％最快查詢的響應時間）
    - 最慢查詢的響應時間
  - 數據庫/搜索引擎信息：
    - select count(*) ，然後select * limit 1以確保在不同數據庫中數據收集相似
    - 內部數據庫/搜索引擎數據結構狀態（塊，碎片，細分市場，分區，零件等）
可以輕鬆限制測試內部或外部的CPU/RAM消耗（使用環境變量cpuset和mem ）。
允許通過手動測試和準備測試查詢的框架開始，輕鬆地啟動每個數據庫/搜索引擎。

安裝

在部署測試框架之前，請確保您有以下內容：

Linux服務器完全致力於測試
新鮮的CPU熱糊劑，以確保您的CPU不會油門
PHP 8和：
- curl模塊
- mysqli模塊
docker
docker-compose
控制CPU溫度以防止節流的sensors
dstat
cgroups v2

安裝：

來自存儲庫的git克隆：

git clone [email protected]:db-benchmarks/db-benchmarks.git
cd db-benchmarks

將.env.example複製為.env
在.env中更新mem和cpuset ，並使用內存的默認值（在Megabytes中）和CPU，測試框架可以用於輔助任務（數據加載，獲取有關數據庫的信息）
調整JVM限制您的測試的ES_JAVA_OPTS 。通常是Docker機器分配的內存大小

開始

準備測試

首先，您需要準備測試：

轉到特定測試的目錄（所有測試都必須在目錄中./tests中），例如“ HN_SMALL”：

 cd tests/hn_small

運行初始腳本：

./init

這將：

從互聯網下載數據集
構建表/索引

運行測試

然後運行../../test test（它在項目root的文件夾中）以查看選項：

To run a particular test with specified engines, memory constraints and number of attempts and save the results locally:
	/perf/test_engines/test
	--test=test_name
	--engines={engine1:type,...,engineN}
	--memory=1024,2048,...,1048576 - memory constraints to test with, MB
	[--times = N] - max number of times to test each query, 100 by default
	[--dir = path] - if path is omitted - save to directory ' results ' in the same dir where this file is located
	[--probe_timeout = N] - how long to wait for an initial connection, 30 seconds by default
	[--start_timeout = N] - how long to wait for a db/engine to start, 120 seconds by default
	[--warmup_timeout = N] - how long to wait for a db/engine to warmup after start, 300 seconds by default
	[--query_timeout = N] - max time a query can run, 900 seconds by default
	[--info_timeout = N] - how long to wait for getting info from a db/engine
	[--limited] - emulate one physical CPU core
	[--queries = /path/to/queries] - queries to test, ./tests/ < test name > /test_queries by default
To save to db all results it finds by path
	/perf/test_engines/test
	--save=path/to/file/or/dir, all files in the dir recursively will be saved
	--host=HOSTNAME
	--port=PORT
	--username=USERNAME
	--password=PASSWORD
	--rm - remove after successful saving to database
	--skip_calm - avoid waiting until discs become calm
----------------------
Environment variables:
	All the options can be specified as environment variables, but you can ' t use the same option as an environment variables and as a command line argument at the same time.

並運行測試：

../../test --test=hn_small --engines=elasticsearch,clickhouse --memory=16384

如果您以本地模式（開發）進行測試，並且不關心測試不准確，則可以通過設置參數來避免光盤鎮定和CPU檢查--skip_inaccuracy

../../test --test=hn_small --engines=elasticsearch,clickhouse --memory=16384 --skip_inaccuracy

現在，您在./results/ （在存儲庫的根源中）中有測試結果，例如：

 # ls results/
220401_054753

保存到DB可視化

現在，您可以將結果上傳到數據庫中，以進一步可視化。在https://db-benchmarks.com/上使用的可視化工具也是開源的，可以在https://github.com/db-benchmarks/ui上找到。

這是您可以保存結果的方法：

username=login password=pass host=db.db-benchmarks.com port=443 save=./results ./test

或者

 ./test --username=login --password=pass --host=db.db-benchmarks.com --port=443 --save=./results

提取拉

我們渴望看到您的測試結果。如果您認為應該將它們添加到https://db-benchmarks.com，請向此存儲庫提出結果。

請記住以下內容：

您的結果應位於目錄中./results 。
如果是新的測試/引擎，則應包含其他任何更改。
重要的是，我們和其他任何人都應該能夠重現您的測試並希望獲得類似的結果。

然後，我們將：

查看您的結果，以確保他們遵循測試原則。
在我們的硬件上重現您的測試，以確保它們與其他測試相當。
與您討論任何出現的問題。
而且，如果一切都結帳，我們將合併您的拉請請求。

目錄結構

 .
  |-core                                    <- Core directory, contains base files required for tests.
  |  |-engine.php                           <- Abstract class Engine. Manages test execution, result saving, and parsing of test attributes.
  |  |-helpers.php                          <- Helper file with logging functions, attribute parsing, exit functions, etc.
  |-misc                                    <- Miscellaneous directory, intended for storing files useful during the initialization step.
  |  |-func.sh                              <- Meilisearch initialization helper script.
  |-plugins                                 <- Plugins directory: if you want to extend the framework by adding another database or search engine for testing, place it here.
  |  |-elasticsearch.php                    <- Elasticsearch plugin.
  |  |-manticoresearch.php                  <- Manticore Search plugin.
  |  |-clickhouse.php                       <- ClickHouse plugin.
  |  |-mysql.php                            <- MySQL plugin.
  |  |-meilisearch.php                      <- Meilisearch plugin.
  |  |-mysql_percona.php                    <- MySQL (Percona) plugin.
  |  |-postgres.php                         <- Postgres plugin.
  |  |-typesense.php                        <- Typesense plugin.
  |-results                                 <- Test results directory. The results shown on https://db-benchmarks.com/ are found here. You can also use `./test --save` to visualize them locally.
  |-tests                                   <- Directory containing test suites.
  |  |-hn                                   <- Hackernews test suite.
  |  |  |-clickhouse                        <- Directory for "Hackernews test -> ClickHouse".
  |  |  |  |-inflate_hook                   <- Engine initialization script. Handles data ingestion into the database.
  |  |  |  |-post_hook                      <- Engine verification script. Ensures the correct number of documents have been ingested and verifies data consistency.
  |  |  |  |-pre_hook                       <- Engine pre-check script. Determines if tables need to be rebuilt, starts the engine, and ensures availability.
  |  |  |-data                              <- Prepared data collection for the tests.
  |  |  |-elasticsearch                     <- Directory for "Hackernews test -> Elasticsearch".
  |  |  |  |-logstash_tuned                 <- Logstash configuration directory for the "tuned" type.
  |  |  |  |  |-logstash.conf
  |  |  |  |  |-template.json
  |  |  |  |-elasticsearch_tuned.yml
  |  |  |  |-inflate_hook                   <- Engine initialization script for data ingestion.
  |  |  |  |-post_hook                      <- Verifies document count and data consistency.
  |  |  |  |-pre_hook                       <- Pre-check script for table rebuilding and engine initialization.
  |  |  |-manticoresearch                   <- Directory for testing Manticore Search in the Hackernews test suite.
  |  |  |  |-generate_manticore_config.php  <- Script for dynamically generating Manticore Search configuration.
  |  |  |  |-inflate_hook                   <- Data ingestion script.
  |  |  |  |-post_hook                      <- Verifies document count and consistency.
  |  |  |  |-pre_hook                       <- Pre-check for table rebuilds and engine availability.
  |  |  |-meilisearch                       <- Directory for "Hackernews test -> Meilisearch".
  |  |  |  |-inflate_hook                   <- Data ingestion script.
  |  |  |  |-post_hook                      <- Ensures correct document count and data consistency.
  |  |  |  |-pre_hook                       <- Pre-check for table rebuilds and engine start.
  |  |  |-mysql                             <- Directory for "Hackernews test -> MySQL".
  |  |  |  |-inflate_hook                   <- Data ingestion script.
  |  |  |  |-post_hook                      <- Ensures document count and consistency.
  |  |  |  |-pre_hook                       <- Pre-check for table rebuilds and engine start.
  |  |  |-postgres                          <- Directory for "Hackernews test -> Postgres".
  |  |  |  |-inflate_hook                   <- Data ingestion script.
  |  |  |  |-post_hook                      <- Verifies document count and data consistency.
  |  |  |  |-pre_hook                       <- Pre-check for table rebuilds and engine availability.
  |  |  |-prepare_csv                       <- Prepares the data collection, handled in `./tests/hn/init`.
  |  |  |-description                       <- Test description, included in test results and used during result visualization.
  |  |  |-init                              <- Main initialization script for the test.
  |  |  |-test_info_queries                 <- Contains queries to retrieve information about the data collection.
  |  |  |-test_queries                      <- Contains all test queries for the current test.
  |  |-taxi                                 <- Taxi rides test suite, with a similar structure.
  |  |-hn_small                             <- Test for a smaller, non-multiplied Hackernews dataset, similar structure.
  |  |-logs10m                              <- Test for Nginx logs, similar structure.
  |-.env.example                            <- Example environment file. Update the "mem" and "cpuset" values as needed.
  |-LICENSE                                 <- License file.
  |-NOTICE                                  <- Notice file.
  |-README.md                               <- You're reading this file.
  |-docker-compose.yml                      <- Docker Compose configuration for starting and stopping databases and search engines.
  |-important_tests.sh
  |-init                                    <- Initialization script. Handles data ingestion and tracks the time taken.
  |-logo.svg                                <- Logo file.
  |-test                                    <- The executable file to run and save test results.

如何使用特定數據集啟動特定數據庫 /搜索引擎

test=logs10m cpuset= " 0,1 " mem=32768 suffix=_tuned docker-compose up elasticsearch

將要：

啟動Elasticsearch以以下設置測試“ logs10m”：
suffix=_tuned ：地圖./tests/logs10m/es/data/idx_tuned作為數據目錄
mem=32768將RAM限制為32GB，如果未指定，則將使用File .env使用默認值
cpuset="0,1" ：Elasticsearch的容器僅在CPU內核0和1上運行（這可能是第一個整個物理CPU）

停止 - 只是CTRL-C 。

筆記

UI的原始測試結果佈局受到ClickHouse基準測試的重大啟發-https：//clickhouse.com/benchmark/dbms/。謝謝您，Alexey Milovidov和Clickhouse團隊！

❤️貢獻

想參與該項目嗎？您可以做出貢獻：

功能願望清單：

不僅衡量響應時間，還要衡量資源消耗，例如：
- 每個查詢的RAM消耗
- CPU消耗
- IO消費
不僅要測量響應時間，還要測量吞吐量。
使其易於在CI中使用，以便對每個新提交進行測試，並且如果比以前慢，則測試不會通過。
使其友好。
提高冷查詢測試的質量（目前，每個查詢只進行一次冷運行，這使得該指標僅用於信息目的，它的質量不如快速的AVG”）。

展開

附加信息

版本 1.0.0
類型其他源碼
更新時間 2025-03-12
大小 6.41MB
來自於 Github

相關應用

ip location db

2024-11-10
yugabyte db

2024-11-06
DB工具箱app

2024-02-25
白鯨DB系統app

2023-06-21
DB Mail Pro 郵件伺服器

2009-07-06
ASP 分頁類Kin_Db_Pager

2009-05-19

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部

db benchmarks