๐Ÿฅ
close
ํ”„๋กœํ•„ ์‚ฌ์ง„

๐Ÿฅ

  • ๋ถ„๋ฅ˜ ์ „์ฒด๋ณด๊ธฐ (65)
    • linux (7)
    • ์ปดํ“จํ„ฐ๊ตฌ์กฐ (0)
    • ์šด์˜์ฒด์ œ (3)
    • ๋ฐ์ดํ„ฐ (41)
      • Spark (23)
      • ํ•˜๋‘ก (10)
      • database (1)
    • aws (1)
    • ๊ธฐํƒ€ (7)
    • ๋„คํŠธ์›Œํฌ (1)
    • ์–ธ์–ด (0)
      • scala (0)
      • ์˜์–ด (0)
    • Generative AI (4)
  • ํ™ˆ

[Spark] TaskMemoryManager: Failed to allocate a page (2097152 bytes), try again. ์˜ค๋ฅ˜ ๋ฐœ์ƒ ์‹œ ํ•ด๊ฒฐ ๋ฐฉ๋ฒ•

๋ฎจ์ œ์ƒํ™ฉ dataframe ๋‘ ๊ฐœ๋ฅผ ์กฐ์ธํ•˜๋ ค๋Š”๋ฐ ์•„๋ž˜์™€ ๊ฐ™์€ ์›Œ๋‹์ด ์ฃผ๋ฅด๋ฅต ๋ฐœ์ƒํ•˜๋”๋‹ˆ ์˜ค๋ฅ˜๋ฅผ ์ถœ๋ ฅํ•˜๊ณ  ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜์ด ์ข…๋ฃŒ๋๋‹ค. ... 23/05/22 04:39:35 WARN TaskMemoryManager: Failed to allocate a page (2097152 bytes), try again. 23/05/22 04:39:35 WARN TaskMemoryManager: Failed to allocate a page (2097152 bytes), try again. 23/05/22 04:39:35 WARN TaskMemoryManager: Failed to allocate a page (2097152 bytes), try again. 23/05/22 04:39:36 WARN TaskMemoryMa..

  • format_list_bulleted ๋ฐ์ดํ„ฐ/Spark
  • · 2023. 5. 22.

parquet-cli๋ฅผ ํ†ตํ•ด parquet ํŒŒ์ผ์˜ ์—ฌ๋Ÿฌ๊ฐ€์ง€ ์ •๋ณด ํ™•์ธ (metadata, schema ๋“ฑ)

parquet-tools ๋ฅผ ์จ๋„ ๋˜๋Š”๋ฐ parquet-cli๊ฐ€ ์ข€ ๋” ๊ฐ€๋ฒผ์›Œ์„œ parquet-cli๋ฅผ ์„ค์น˜ํ–ˆ๋‹ค. (์Šคํ‚ค๋งˆ๋งŒ ํ™•์ธํ•˜๋ฉด ๋˜๋Š” ์‚ฌ๋žŒ...) (env) [testuser@test-server-1 ~]$ pip install parquet-cli Collecting parquet-cli Using cached parquet_cli-1.3-py2.py3-none-any.whl (3.6 kB) Collecting pyarrow>=0.9.0.post1 Using cached pyarrow-6.0.1-cp36-cp36m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (25.6 MB) Collecting pandas>=0.22.0 Using cached pandas-..

  • format_list_bulleted linux
  • · 2023. 5. 16.

[Python] ๊ฐ€์ƒํ™˜๊ฒฝ ๋งŒ๋“ค๊ธฐ (virtualenv ์„ค์น˜, ์ƒ์„ฑ, ํ™œ์„ฑํ™”, ์ œ๊ฑฐ)

virtualenv vs venv venv๋Š” ํ‘œ์ค€ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ผ ์„ค์น˜ ๋ถˆํ•„์š”ํ•˜์ง€๋งŒ virtualenv๋Š” ์™ธ๋ถ€ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋กœ ๋ณ„๋„ ์„ค์น˜ ํ•„์š” ์†๋„: virtualenv๊ฐ€ ๋” ๋น ๋ฆ„ ํ™•์žฅ์„ฑ: virtualenv๊ฐ€ ๋” ์ข‹์Œ virtualenv๋Š” python ๋ฒ„์ „์ด ๋‹ค๋ฅธ ๊ฐ€์ƒํ™˜๊ฒฝ์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ์ง€๋งŒ venv๋Š” ๋ถˆ๊ฐ€๋Šฅ virtualenv๋Š” pip ์—…๊ทธ๋ ˆ์ด๋“œ ๊ฐ€๋Šฅํ•˜๋‚˜ venv๋Š” ๋ถˆ๊ฐ€๋Šฅ → virtualenv๊ฐ€ ๋” ์ข‹์•„๋ณด์—ฌ์„œ virtualenv๋ฅผ ์‚ฌ์šฉํ•˜๊ธฐ๋กœ ํ–ˆ๋‹ค. 1. virtualenv ์„ค์น˜ [testuser@test-server-1 ~]$ python3 -m pip install --user -U virtualenv Collecting virtualenv Downloading https://files.pythonhost..

  • format_list_bulleted linux
  • · 2023. 5. 16.

[Spark] CSV ํŒŒ์ผ ๋กœ๋“œํ•˜๊ธฐ

data_3 = hc.read.\ csv('/my/path/partition={PARTITION}/*'.format(PARTITION=my_partition), header=False, schema=customSchema) # header๊ฐ€ ์žˆ์„ ๊ฒฝ์šฐ schema๋Š” ๋”ฐ๋กœ ์—†์–ด๋„ ๋จ (header=True)

  • format_list_bulleted ๋ฐ์ดํ„ฐ/Spark
  • · 2023. 4. 25.
[Hive] set hive.msck.repair.batch.size ๋ช…๋ น์–ด ์‚ฌ์šฉ ๋ถˆ๊ฐ€ ์กฐ์น˜

[Hive] set hive.msck.repair.batch.size ๋ช…๋ น์–ด ์‚ฌ์šฉ ๋ถˆ๊ฐ€ ์กฐ์น˜

set hive.msck.repair.batch.size=1; set hive.msck.path.validation=ignore; Hive ์—์„œ ์œ„ ๋ช…๋ น์–ด ์‚ฌ์šฉ ์‹œ์— ์•„๋ž˜์™€ ๊ฐ™์€ ์˜ค๋ฅ˜๊ฐ€ ๋ฐœ์ƒํ–ˆ๋‹ค. Error: Error while processing statement: Cannot modify hive.msck.path.validation at runtime. It is not in list of params that are allowed to be modified at runtime (state=42000,code=1)โ€‹ hive-site์— ์•„๋ž˜ ์„ค์ •๊ฐ’ ์ถ”๊ฐ€ ํ›„ ์ •์ƒ ๋™์ž‘ ํ™•์ธํ•จ key: hive.security.authorization.sqlstd.confwhitelist.append value: h..

  • format_list_bulleted ๋ฐ์ดํ„ฐ/ํ•˜๋‘ก
  • · 2023. 4. 21.
TRINO -> Hive metastore ์‚ฌ์šฉ ์‹œ HIVE_METASTORE_ERROR ์˜ค๋ฅ˜ ์กฐ์น˜

TRINO -> Hive metastore ์‚ฌ์šฉ ์‹œ HIVE_METASTORE_ERROR ์˜ค๋ฅ˜ ์กฐ์น˜

io.trino.spi.TrinoException: testserver-1:9083: java.net.SocketTimeoutException: Read timed out at io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastore.getPartitionNamesByFilter(ThriftHiveMetastore.java:1080) at io.trino.plugin.hive.metastore.thrift.BridgingHiveMetastore.getPartitionNamesByFilter(BridgingHiveMetastore.java:335) at io.trino.plugin.hive.metastore.ForwardingHiveMetastore.getPa..

  • format_list_bulleted ๋ฐ์ดํ„ฐ
  • · 2023. 4. 21.
  • navigate_before
  • 1
  • ยทยทยท
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • navigate_next
๊ณต์ง€์‚ฌํ•ญ
์ „์ฒด ์นดํ…Œ๊ณ ๋ฆฌ
  • ๋ถ„๋ฅ˜ ์ „์ฒด๋ณด๊ธฐ (65)
    • linux (7)
    • ์ปดํ“จํ„ฐ๊ตฌ์กฐ (0)
    • ์šด์˜์ฒด์ œ (3)
    • ๋ฐ์ดํ„ฐ (41)
      • Spark (23)
      • ํ•˜๋‘ก (10)
      • database (1)
    • aws (1)
    • ๊ธฐํƒ€ (7)
    • ๋„คํŠธ์›Œํฌ (1)
    • ์–ธ์–ด (0)
      • scala (0)
      • ์˜์–ด (0)
    • Generative AI (4)
์ธ๊ธฐ ๊ธ€
Copyright ยฉ •8• ๋ชจ๋“  ๊ถŒ๋ฆฌ ๋ณด์œ .
SKIN: Copyright ยฉ ์ญˆ๋ฏธ๋กœ์šด ์ƒํ™œ All rights reserved. Designed by JJuum.
and Current skin "dev-roo" is modified by Jin.

ํ‹ฐ์Šคํ† ๋ฆฌํˆด๋ฐ”