๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

Data Engineering8

Windows 11์—์„œ Docker Desktop์œผ๋กœ pyspark-notebook ์„ธํŒ…ํ•˜๊ธฐ 1. Docker Desktop์„ ์„ค์น˜ํ•œ๋‹ค.https://docs.docker.com/desktop/install/windows-install/ WindowsGet started with Docker for Windows. This guide covers system requirements, where to download, and instructions on how to install and update.docs.docker.com โ€ป ์•„๋ž˜ ๋งํฌ์—์„œ ๋‹ค์šด ๋ฐ›์€ exe ํŒŒ์ผ์€    "ํ˜„์žฌ PC์—์„œ๋Š” ์ด ์•ฑ์„ ์‹คํ–‰ํ•  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค" ๊ฐ€ ๋–ด๋Š”๋ฐ, ์œ„ ๋งํฌ์—์„œ๋Š” ๋ฐ”๋กœ ์‹คํ–‰์ด ๋๋‹ค.https://www.docker.com/products/docker-desktop/ Docker Desktop: The #1 Con.. 2024. 10. 1.
NoSQL ๋ถ„์‚ฐ KVS  Amazon DynamoDB  ์™€์ด๋“œ ์ปฌ๋Ÿผ ์Šคํ† ์–ด row key์™€ column์˜ ์กฐํ•ฉ์„ ์ €์žฅํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ dynamic columns๋ฅผ ์ €์žฅํ•˜๋Š” NoSQL ๋ฐ์ดํ„ฐ ์ €์žฅ์†Œdynamic columns๋Š” ๊ฐ™์€ ํ…Œ์ด๋ธ”์ด๋ผ๋„ row ๋ณ„๋กœ column ์ด๋ฆ„๊ณผ ํฌ๋งท์ด ๋‹ค๋ฅผ ์ˆ˜ ์žˆ์Œ์„ ์˜๋ฏธํ•จcolumn์„ ๊ทธ๋ฃนํ•‘ํ•œ column family ๊ฐœ๋…์„ ์ œ๊ณตํ•˜๊ธฐ๋„ ํ•˜๋Š”๋ฐ, ํ•จ๊ป˜ ์ €์žฅ๋˜๋ฏ€๋กœ ํ•จ๊ป˜ ๋งŽ์ด ์‚ฌ์šฉ๋˜๋Š” column์„ ๊ทธ๋ฃนํ•‘ํ•ด์•ผ ํ•จ Apache Cassandra๋ถ„์‚ฐํ˜• ์™€์ด๋“œ ์ปฌ๋Ÿผ ์Šคํ† ์–ด (์˜คํ”ˆ์†Œ์Šค)TBD ๋„ํ๋จผํŠธ ์Šคํ† ์–ด JSON ๊ฐ™์€ ๋ณต์žกํ•œ ์Šคํ‚ค๋งˆ๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ฅผ ๊ทธ๋Œ€๋กœ ์ €์žฅํ•˜๊ณ  ์ฟผ๋ฆฌํ•  ์ˆ˜ ์žˆ๋Š” NoSQL ๋ฐ์ดํ„ฐ ์ €์žฅ์†ŒRDBMS์ฒ˜๋Ÿผ ๋น ๋ฅธ ๊ฒ€์ƒ‰์„ ์œ„ํ•œ ์ธ๋ฑ์Šค๋„ ์žˆ์œผ๋ฉฐ,๋น„์Šทํ•œ ๋„ํ๋จผํŠธ๋ฅผ ํ•จ๊ป˜ ์ €์žฅํ•˜๊ฑฐ๋‚˜ (Embedd.. 2024. 6. 13.
Greenplum Database์˜ Architecture vmware Docs์˜ About the Greenplum Architecture ๊ธ€์˜ ์š”์•ฝ๋ณธ์ž„์„ ๋ฐํž™๋‹ˆ๋‹ค. Chrome์˜ ํ•œ๊ตญ์–ด ๋ฒˆ์—ญ์„ ์‚ฌ์šฉํ•˜๊ณ  ์ผ๋ถ€ ๋ง์”จ๋ฅผ ๋ฐ”๊พธ์—ˆ์Šต๋‹ˆ๋‹ค. Greenplum Database ๋ž€? Greenplum Database๋Š” PostgreSQL ์˜คํ”ˆ ์†Œ์Šค ๊ธฐ์ˆ ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” MPP(๋Œ€๋Ÿ‰ ๋ณ‘๋ ฌ ์ฒ˜๋ฆฌ) ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„œ๋ฒ„์ž…๋‹ˆ๋‹ค. MPP( ๋น„๊ณต์œ  ์•„ํ‚คํ…์ฒ˜ ๋ผ๊ณ ๋„ ํ•จ )๋Š” ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•ด ํ˜‘๋ ฅํ•˜๋Š” ๋‘ ๊ฐœ ์ด์ƒ์˜ ํ”„๋กœ์„ธ์„œ๊ฐ€ ์žˆ๋Š” ์‹œ์Šคํ…œ์„ ๋‚˜ํƒ€๋‚ด๋ฉฐ, ๊ฐ ํ”„๋กœ์„ธ์„œ์—๋Š” ์ž์ฒด ๋ฉ”๋ชจ๋ฆฌ, ์šด์˜ ์ฒด์ œ ๋ฐ ๋””์Šคํฌ๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค. Greenplum์€ ์ด๋Ÿฌํ•œ ๊ณ ์„ฑ๋Šฅ ์‹œ์Šคํ…œ ์•„ํ‚คํ…์ฒ˜๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ˆ˜ ํ…Œ๋ผ๋ฐ”์ดํŠธ ๊ทœ๋ชจ์˜ ๋ฐ์ดํ„ฐ ์›จ์–ดํ•˜์šฐ์Šค์˜ ๋กœ๋“œ๋ฅผ ๋ถ„์‚ฐํ•˜๊ณ  ์‹œ์Šคํ…œ์˜ ๋ชจ๋“  ๋ฆฌ์†Œ์Šค๋ฅผ ๋ณ‘๋ ฌ๋กœ ์‚ฌ์šฉํ•˜์—ฌ ์ฟผ๋ฆฌ๋ฅผ ์ฒ˜๋ฆฌํ•  ์ˆ˜ .. 2023. 11. 10.
๋ฐ˜์ •๊ทœํ™” (์˜๋ฏธ, ๋ชฉ์ , ๋Œ€์•ˆ, ๋ฐฉ๋ฒ•) DATA ON-AIR ๋ฐ˜์ •๊ทœํ™”์™€ ์„ฑ๋Šฅ ๊ธ€์˜ ์š”์•ฝ๋ณธ์ž„์„ ๋ฐํž™๋‹ˆ๋‹ค. ๋ฐ˜์ •๊ทœํ™”๋ž€ ์ •๊ทœํ™”๋œ ์—”ํ„ฐํ‹ฐ, ์†์„ฑ, ๊ด€๊ณ„์— ๋Œ€ํ•ด ์‹œ์Šคํ…œ์˜ ์„ฑ๋Šฅํ–ฅ์ƒ๊ณผ ๊ฐœ๋ฐœ/์šด์˜์˜ ๋‹จ์ˆœํ™”๋ฅผ ์œ„ํ•ด ์ค‘๋ณต, ํ†ตํ•ฉ, ๋ถ„๋ฆฌ ๋“ฑ์„ ์ˆ˜ํ–‰ํ•˜๋Š” ๋ฐ์ดํ„ฐ ๋ชจ๋ธ๋ง์˜ ๊ธฐ๋ฒ•์„ ์˜๋ฏธํ•œ๋‹ค. ์™œ ๋ฐ˜์ •๊ทœํ™” ํ•˜๋Š”๊ฐ€? ๋ฐ์ดํ„ฐ ๋ฌด๊ฒฐ์„ฑ์ด ๊นจ์งˆ ์ˆ˜ ์žˆ๋Š” ์œ„ํ—˜์„ ๋ฌด๋ฆ…์“ฐ๊ณ  ๋ฐ์ดํ„ฐ๋ฅผ ์ค‘๋ณตํ•˜์—ฌ ๋ฐ˜์ •๊ทœํ™”๋ฅผ ์ ์šฉํ•˜๋Š” ์ด์œ ๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ์กฐํšŒํ•  ๋•Œ ๋””์Šคํฌ I/O๋Ÿ‰์ด ๋งŽ์•„์„œ ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜๊ฑฐ๋‚˜ ๊ฒฝ๋กœ๊ฐ€ ๋„ˆ๋ฌด ๋ฉ€์–ด ์กฐ์ธ์œผ๋กœ ์ธํ•œ ์„ฑ๋Šฅ์ €ํ•˜๊ฐ€ ์˜ˆ์ƒ๋˜๊ฑฐ๋‚˜ ์นผ๋Ÿผ์„ ๊ณ„์‚ฐํ•˜์—ฌ ์ฝ์„ ๋•Œ ์„ฑ๋Šฅ์ด ์ €ํ•˜๋  ๊ฒƒ์ด ์˜ˆ์ƒ๋˜๋Š” ๊ฒฝ์šฐ ๋ฐ˜์ •๊ทœํ™”๋ฅผ ์ˆ˜ํ–‰ํ•˜๊ฒŒ ๋œ๋‹ค. ๊ธฐ๋ณธ์ ์œผ๋กœ ์ •๊ทœํ™”๋Š” ์ž…๋ ฅ/์ˆ˜์ •/์‚ญ์ œ์— ๋Œ€ํ•œ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ์กฐํšŒ์— ๋Œ€ํ•ด์„œ๋„ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ์—ญํ• ์„ ํ•œ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์ •๊ทœํ™”๋งŒ์„ ์ˆ˜ํ–‰ํ•˜๋ฉด ์—”ํ„ฐํ‹ฐ์˜ ๊ฐฏ์ˆ˜๊ฐ€ ์ฆ๊ฐ€ํ•˜๊ณ .. 2023. 11. 10.