๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
Data Engineering

Windows 11์—์„œ Docker Desktop์œผ๋กœ pyspark-notebook ์„ธํŒ…ํ•˜๊ธฐ

by ํ–‰๋ฑ 2024. 10. 1.

1. Docker Desktop์„ ์„ค์น˜ํ•œ๋‹ค.

https://docs.docker.com/desktop/install/windows-install/

 

Windows

Get started with Docker for Windows. This guide covers system requirements, where to download, and instructions on how to install and update.

docs.docker.com

 

โ€ป ์•„๋ž˜ ๋งํฌ์—์„œ ๋‹ค์šด ๋ฐ›์€ exe ํŒŒ์ผ์€

    "ํ˜„์žฌ PC์—์„œ๋Š” ์ด ์•ฑ์„ ์‹คํ–‰ํ•  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค" ๊ฐ€ ๋–ด๋Š”๋ฐ, ์œ„ ๋งํฌ์—์„œ๋Š” ๋ฐ”๋กœ ์‹คํ–‰์ด ๋๋‹ค.

https://www.docker.com/products/docker-desktop/

 

Docker Desktop: The #1 Containerization Tool for Developers | Docker

Docker Desktop is collaborative containerization software for developers. Get started and download Docker Desktop today on Mac, Windows, or Linux.

www.docker.com

 

 

2. Docker Desktop์„ ์‹คํ–‰ํ•˜๊ณ  Learning Center์˜ ํ”„๋กœ๊ทธ๋žจ 3๊ฐœ๋ฅผ ๋”ฐ๋ผ๊ฐ€๋ดค๋‹ค.

    welcome-to-docker ๋ผ๋Š” ์•ฑ์„ ์‹คํ–‰ํ•ด๋ณผ ์ˆ˜ ์žˆ๊ฒŒ ๋˜์–ด์žˆ๋‹ค!

 

 

3. Docker Desktop ์ƒ๋‹จ ๊ฒ€์ƒ‰๋ฐ” (๋˜๋Š” Ctrl+K) ์—์„œ pyspark-notebook์„ ๊ฒ€์ƒ‰ํ•˜๊ณ  Latest Run์„ ํด๋ฆญํ–ˆ๋‹ค.

    Optional Settings์—์„œ Host Port๋ฅผ 8888(:8888) ๋กœ ํ–ˆ๋‹ค.

 

 

4. ๊ทธ๋Ÿผ ์•ฑ์ด ์ž˜ ์‹คํ–‰๋˜๋ฉด์„œ Logs ํƒญ์— ๋กœ๊ทธ๋“ค์ด ๋ณด์ธ๋‹ค.

 

 

5. ๋ธŒ๋ผ์šฐ์ €์—์„œ localhost:8888์„ ์ณ์„œ ๋“ค์–ด๊ฐ€๋ฉด Jupyter๊ฐ€ ๋œฌ๋‹ค.

    ๊ทธ๋Ÿฐ๋ฐ ์—ฌ๊ธฐ์„œ (์บก์ฒ˜๋ฅผ ๋ชปํ–ˆ์ง€๋งŒ..) ํ† ํฐ์„ ์ž…๋ ฅํ•˜๋ผ๊ณ  ๋‚˜์˜ค๊ณ ,

    ํ† ํฐ์„ ํ™•์ธํ•˜๊ธฐ ์œ„ํ•ด์„  jupyter server list ์ปค๋งจ๋“œ๋ฅผ ์‹คํ–‰ํ•ด์„œ ์ถœ๋ ฅ๋˜๋Š” URL์—์„œ ํ† ํฐ์„ ๋ณต์‚ฌํ•ด์˜ค๋ผ๊ณ  ๋‚˜์˜จ๋‹ค.

 

 

6. ๊ทธ๋Ÿฌ๋ฉด ๋‹ค์‹œ Docker Desktop์œผ๋กœ ๊ฐ€์„œ Exec ํƒญ์— ๊ฐ€์„œ jupyter server list ์ปค๋งจ๋“œ๋ฅผ ์น˜๊ณ  ํ† ํฐ์„ ํ™•์ธ ํ›„ ๋ณต๋ถ™ํ•ด์ค€๋‹ค.

 

 

7. ๊ทธ๋Ÿผ Jupyter๊ฐ€ ์ž˜ ์ผœ์ง„๋‹ค.

 

 

8. Notebook - Python 3 (ipykernel) ์„ ์„ ํƒํ•˜๋ฉด Notebook์ด ๋œฌ๋‹ค.

   pyspark๋ฅผ ๋ฐ”๋กœ importํ•˜๊ณ  python ์ฝ”๋“œ๋ฅผ ์ž‘์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค.

 

โ€ป ๊ฐ„๋‹จํ•œ pyspark ์ฝ”๋“œ๋Š” ์•„๋ž˜ ๋งํฌ์˜ 6๋ฒˆ์— ๋‚˜์˜ค๋Š” ์ฝ”๋“œ๋ฅผ ๊ทธ๋Œ€๋กœ ๋ณต๋ถ™ํ–ˆ๋‹ค.

https://it-sunny-333.tistory.com/88

 

jupyter ๋…ธํŠธ๋ถ์—์„œ pyspark ์‚ฌ์šฉํ•˜๊ธฐ

์œ— ๊ธ€์—์„œ docker๋กœ spark-hadoop-cluster๋ฅผ ๊ตฌ์„ฑํ–ˆ๋‹ค. ์—ฌ๊ธฐ์— jupyter ๋…ธํŠธ๋ถ์„ ๋ถ™์—ฌ์„œ pyspark๋ฅผ ์‚ฌ์šฉํ•ด๋ณธ๋‹ค. jupyter ๋…ธํŠธ๋ถ ์—ญ์‹œ ๋„์ปค ์ปจํ…Œ์ด๋„ˆ๋ฅผ ์‚ฌ์šฉํ•  ๊ฒƒ์ด๋‹น. 1. jupyter์šฉ ๋„์ปค ์ปจํ…Œ์ด๋„ˆ ์ƒ์„ฑ docker run -it

it-sunny-333.tistory.com

 

 

๋‚ด ์นœ๊ตฌ ๊ฝฅํ•˜๊ฐ€ ์•Œ๋ ค์ค€๋Œ€๋กœ ๊ทธ๋Œ€๋กœ ํ–ˆ๋Š”๋ฐ ๋๋‹ค...

์•„๋ž˜ ๋งํฌ๋ฅผ ์•Œ๋ ค์คฌ๋Š”๋ฐ ๋˜‘๊ฐ™์ด ํ•˜์ง€๋Š” ์•Š์•˜์Œ!

https://lamanus.kr/85

 

Docker๋กœ ์†์‰ฝ๊ฒŒ Jupyter์™€ Spark ์‚ฌ์šฉํ•˜๊ธฐ

์ด์ „์— ์œˆ๋„์šฐ(Windows) 10 ํ™˜๊ฒฝ์—์„œ ์ŠคํŒŒํฌ(Spark)๋ฅผ ์„ค์น˜ํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๋Œ€ํ•œ ๊ธ€์„ ์ž‘์„ฑํ•œ ์ ์ด ์žˆ์—ˆ๋Š”๋ฐ, ํ•œ ๋ฒˆ ์…‹ํŒ…ํ•ด๋†“๊ณ  ์ž˜ ์“ฐ๋‹ค๊ฐ€ ์ปดํ“จํ„ฐ๋ฅผ ํฌ๋งทํ•œ๋‹ค๊ฑฐ๋‚˜ ํ•˜๋ฉด์„œ ํ™˜๊ฒฝ ๊ตฌ์„ฑ์ด ๋ฐ”๋€Œ๋ฉด ์—ฌ๊ฐ„ ๊ณจ์น˜์•„

lamanus.kr

 

๋Œ“๊ธ€