Skip to content

dhkdn9192/data_engineer_career

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

image

Table of Contents


1. Data Engineering

1-1. Hadoop

1-2. Spark

1-3. Kafka

1-4. Parquet

  • All About Parquet (link)
  • I spent 8 hours learning Parquet (link)

1-5. Iceberg

1-6. Airflow

1-7. Hive

  • HiveServer2 (link)
  • Hive Design and Architecture (link)
  • Hive ACID (link)
  • Hive Replication (link)
  • Hive Query Planner and Optimizer (link)
  • Partition, Bucket, Index
  • Partitioning vs Bucketing(CLUSTERED BY) (link1, link2)
  • Which is faster, SORT BY or ORDER BY in HiveQL?
  • What is HCatalog?
  • Hive UDF란?
  • Hive의 View와 Table
  • HiveQL Merge Into
  • STORED AS의 INPUTFORMAT, OUTPUTFORMAT, SERDE (link1, link2)

1-99. others


2. Cloud Computing

2-1. Docker and k8s

2-2. AWS


3. Back-end


4. Computer Science

4-1. Operation System

4-2. Database

4-3. Network

4-4. Data Structure and Algorithm

4-5. Programming Language

4-6. common

객체지향프로그래밍, 디자인패턴, 아키텍처패턴, 개발방법론, 소프트웨어공학 등

  • OOP
  • 객체-관계 매핑 (Object Relational Mapping, ORM) (link)
  • Lambda architecture (link)

5. etc