⚙ 5. 비용 최소화 구현 - 코드

🔨 저번 포스팅에서는 선형회귀를 직접 코드로 작성해보았다. 이번에는 그 결과 나온 비용을 최소화하는 코드를 알아보자.

1. cost함수 / 비용 구하기

🔨 필요한 라이브러리를 임포트하고 X,Y 데이터를 정의해주자.

import tensorflow as tf
import numpy as np
import matplotlib as mpl
import matplotlib.pyplot as plt
%matplotlib inline

X = np.array([1,2,3])
Y = np.array([1,2,3])

1.1 파이썬으로 구현하기

1.1.1. cost 함수 만들기

🔨 비용함수 : 오차 제곱의 평균

$$cost(W) = \frac{1}{m} \sum_{i=1}^m(Wx_i - y_i)^2$$

🔨 원칙상 hypothesis 함수는 $H(x) = Wx + b$ 의 모양을 가지지만 이 경우에는 간략화시켜서 $H(x) = Wx$ 로 알아보도록 하자.

def cost_func(W,X,Y):
    c = 0
    for i in range(len(X)):
        c += np.square(W * X[i] - Y[i])
    return c / len(X)

1.1.2 비용 계산하기

cost_values = []

# W 값 정의 - np.linspace()로 15개의 W 값을 정의
W_values = np.linspace(-3,5,num=15)
print("{:>6} | {:>10}".format("W","cost"))

# 미리 정해둔 W 값에 따라 cost를 출력
for feed_W in W_values:
    curr_cost = cost_func(feed_W, X, Y)
    cost_values.append(curr_cost)
    print("{:6.3f} | {:10.5f}".format(feed_W, curr_cost))

>>
     W |       cost
-3.000 |   74.66667
-2.429 |   54.85714
-1.857 |   38.09524
-1.286 |   24.38095
-0.714 |   13.71429
-0.143 |    6.09524
 0.429 |    1.52381
 1.000 |    0.00000
 1.571 |    1.52381
 2.143 |    6.09524
 2.714 |   13.71429
 3.286 |   24.38095
 3.857 |   38.09524
 4.429 |   54.85714
 5.000 |   74.66667

🔨 W가 1일때 cost가 0 으로 최솟값을 가지는 것을 확인하자.

plt.figure(figsize = (10,10))
plt.plot(W_values, cost_values, "b")
plt.ylabel('Cost(W)')
plt.xlabel('W')
plt.show()

🔨 cost 함수를 시각화해보면 W가 1 일때 기울기가 0 이고 cost 함수가 최솟값 0을 가지는 것을 확인할 수 있다.

1.2. Tensorflow로 구현하기

# X,Y 데이터 정의
X = np.array([1,2,3])
Y = np.array([1,2,3])
print("{:>6} | {:>10}".format("W","cost"))

# 비용함수 생성
def cost_func_tf(W,X,Y):
    hypothesis = W*X
    return tf.reduce_mean(tf.square(hypothesis-Y))

# W 값 정의 - np.linspace()로 15개의 W 값을 정의
cost_values = []
W_values = np.linspace(-3,5,num=15)

# 미리 정해둔 W 값에 따라 cost를 출력
for feed_W in W_values:
    curr_cost = cost_func_tf(feed_W, X, Y)
    cost_values.append(curr_cost)
    print("{:6.3f} | {:10.5f}".format(feed_W, curr_cost))

>>
     W |       cost
-3.000 |   74.66667
-2.429 |   54.85714
-1.857 |   38.09524
-1.286 |   24.38095
-0.714 |   13.71429
-0.143 |    6.09524
 0.429 |    1.52381
 1.000 |    0.00000
 1.571 |    1.52381
 2.143 |    6.09524
 2.714 |   13.71429
 3.286 |   24.38095
 3.857 |   38.09524
 4.429 |   54.85714
 5.000 |   74.66667

plt.figure(figsize = (10,10))
plt.plot(W_values, cost_values, "b")
plt.ylabel('Cost(W)')
plt.xlabel('W')
plt.show()

🔨 결과는 파이썬으로 구현한 결과와 동일한 것을 확인할 수 있다.

2. Gradient Descent

🔨 Hypothesis

$$H(x) = Wx$$
🔨 Cost $$cost(W) = \frac{1}{m}\sum_{i=1}^m(Wx_i - y_i)^2$$
🔨 Gradient $$Gradient = \frac{1}{m} \sum_{i=1}^m (Wx_i - y_i) x_i$$
🔨 Descent $$W := W - \alpha \frac{1}{m} \sum_{i=1}^m (Wx_i - y_i) x_i$$

2.1. W 를 임의의 난수로 생성해서 업데이트

tf.random.set_seed(0)

X = np.array([1,2,3])
Y = np.array([1,2,3])
print("{:>5} | {:>10} | {:>10}".format("step", "cost", "W"))

# 맨 처음 W값만 랜덤함수로 정의
W = tf.Variable(tf.random.normal([1], -100., 100))

# W 값 300 회 업데이트 진행 
for step in range(300):
    hypothesis = W * X
    cost = tf.reduce_mean(tf.square(hypothesis - Y))
    
    alpha = 0.01
    gradient = tf.reduce_mean(tf.multiply(hypothesis - Y, X))
    descent = W - tf.multiply(alpha, gradient)
    W.assign(descent)
    
    if step % 10 == 0:
        print("{:5} | {:10.4f} | {:10.6f}".format(step, cost.numpy(), W.numpy()[0]))

>>
 step |       cost |          W
| 11716.3086 |  48.767971
|  4504.9126 |  30.619968
|  1732.1364 |  19.366755
|   666.0052 |  12.388859
|   256.0785 |   8.062004
|    98.4620 |   5.379007
|    37.8586 |   3.715335
|    14.5566 |   2.683725
|     5.5970 |   2.044044
|     2.1520 |   1.647391
|     0.8275 |   1.401434
|     0.3182 |   1.248922
|     0.1223 |   1.154351
|     0.0470 |   1.095710
|     0.0181 |   1.059348
|     0.0070 |   1.036801
|     0.0027 |   1.022819
|     0.0010 |   1.014150
|     0.0004 |   1.008774
|     0.0002 |   1.005441
|     0.0001 |   1.003374
|     0.0000 |   1.002092
|     0.0000 |   1.001297
|     0.0000 |   1.000804
|     0.0000 |   1.000499
|     0.0000 |   1.000309
|     0.0000 |   1.000192
|     0.0000 |   1.000119
|     0.0000 |   1.000074
|     0.0000 |   1.000046

2.2. W 를 tf.Variable() 로 생성해서 업데이트(1)

X = np.array([1,2,3])
Y = np.array([1,2,3])
print("{:>5} | {:>10} | {:>10}".format("step", "cost", "W"))

# 맨 처음 W값만 랜덤함수로 정의
W = tf.Variable(5.0)

# W 값 300 회 업데이트 진행 
for step in range(300):
    hypothesis = W * X
    cost = tf.reduce_mean(tf.square(hypothesis - Y))
    
    alpha = 0.01
    gradient = tf.reduce_mean(tf.multiply(hypothesis - Y, X))
    descent = W - tf.multiply(alpha, gradient)
    W.assign(descent)
    
    if step % 10 == 0:
        print("{:5} | {:10.4f} | {:10.6f}".format(step, cost.numpy(), W.numpy()))

>>
 step |       cost |          W
|    74.6667 |   4.813334
|    28.7093 |   3.364572
|    11.0387 |   2.466224
|     4.2444 |   1.909177
|     1.6320 |   1.563762
|     0.6275 |   1.349578
|     0.2413 |   1.216766
|     0.0928 |   1.134412
|     0.0357 |   1.083346
|     0.0137 |   1.051681
|     0.0053 |   1.032047
|     0.0020 |   1.019871
|     0.0008 |   1.012322
|     0.0003 |   1.007641
|     0.0001 |   1.004738
|     0.0000 |   1.002938
|     0.0000 |   1.001822
|     0.0000 |   1.001130
|     0.0000 |   1.000700
|     0.0000 |   1.000434
|     0.0000 |   1.000269
|     0.0000 |   1.000167
|     0.0000 |   1.000103
|     0.0000 |   1.000064
|     0.0000 |   1.000040
|     0.0000 |   1.000025
|     0.0000 |   1.000015
|     0.0000 |   1.000009
|     0.0000 |   1.000006
|     0.0000 |   1.000004

2.2. W 를 tf.Variable() 로 생성해서 업데이트(2)

cost_list = []
W_list = []

X = np.array([1,2,3])
Y = np.array([1,2,3])
print("{:>5} | {:>10} | {:>10}".format("step", "cost", "W"))

W = tf.Variable(30.0)

for step in range(300):
    hypothesis = W * X
    cost = tf.reduce_mean(tf.square(hypothesis - Y))
    
    alpha = 0.01
    gradient = tf.reduce_mean(tf.multiply(hypothesis - Y, X))
    descent = W - tf.multiply(alpha, gradient)
    W.assign(descent)
    
    W_list.append(W.numpy())
    cost_list.append(cost.numpy())
    if step % 10 == 0:
        print("{:5} | {:10.4f} | {:10.6f}".format(step, cost.numpy(), W.numpy()))

>>
 step |       cost |          W
|  3924.6667 |  28.646667
|  1509.0312 |  18.143147
|   580.2216 |  11.630123
|   223.0948 |   7.591529
|    85.7798 |   5.087276
|    32.9823 |   3.534438
|    12.6817 |   2.571555
|     4.8761 |   1.974490
|     1.8749 |   1.604262
|     0.7209 |   1.374691
|     0.2772 |   1.232338
|     0.1066 |   1.144068
|     0.0410 |   1.089334
|     0.0158 |   1.055394
|     0.0061 |   1.034349
|     0.0023 |   1.021299
|     0.0009 |   1.013207
|     0.0003 |   1.008190
|     0.0001 |   1.005078
|     0.0001 |   1.003149
|     0.0000 |   1.001953
|     0.0000 |   1.001211
|     0.0000 |   1.000751
|     0.0000 |   1.000466
|     0.0000 |   1.000289
|     0.0000 |   1.000179
|     0.0000 |   1.000111
|     0.0000 |   1.000069
|     0.0000 |   1.000043
|     0.0000 |   1.000026

🔨 이 선형모델을 시각화해서 cost 함수의 개형을 알아보자.

plt.figure(figsize = (10,12))
plt.plot(W_list, cost_list)
plt.ylabel('Cost(W)')
plt.xlabel('W')
plt.show()

🔨 이렇게 해서 비용을 최소화하는 알고리즘까지 구현해 보았다. 수식을 알고 코드로 표현하는 방법만 알면 이름만큼 거창한 분야는 아닐 수도 있겠다는 자신감이 생긴다🙃!! 물론 공부하다 보면 또 이런 생각은 온데간데 없겠지만 이렇게라도 자신감을 북돋아줘야겠다ㅎ

Share on

Twitter Facebook LinkedIn

Programin9

⚙ 5. 비용 최소화 구현 - 코드

1. cost함수 / 비용 구하기

1.1 파이썬으로 구현하기

1.1.1. cost 함수 만들기

1.1.2 비용 계산하기

1.2. Tensorflow로 구현하기

2. Gradient Descent

2.1. W 를 임의의 난수로 생성해서 업데이트

2.2. W 를 tf.Variable() 로 생성해서 업데이트(1)

2.2. W 를 tf.Variable() 로 생성해서 업데이트(2)

Share on

Leave a comment

You may also enjoy

2023.01.14
🆘 블로그 이전 안내

2023.01.14
🆘 블로그 이전 안내

2023.01.12
🫀심혈관질환 분석 09-패턴분석4

2023.01.11
🫀심혈관질환 분석 08-패턴분석3

Programin9

1. cost함수 / 비용 구하기

1.1 파이썬으로 구현하기

1.1.1. cost 함수 만들기

1.1.2 비용 계산하기

1.2. Tensorflow로 구현하기

2. Gradient Descent

2.1. W 를 임의의 난수로 생성해서 업데이트

2.2. W 를 tf.Variable() 로 생성해서 업데이트(1)

2.2. W 를 tf.Variable() 로 생성해서 업데이트(2)

Share on

Leave a comment

You may also enjoy

2023.01.14 🆘 블로그 이전 안내

2023.01.14 🆘 블로그 이전 안내

2023.01.12 🫀심혈관질환 분석 09-패턴분석4

2023.01.11 🫀심혈관질환 분석 08-패턴분석3

2023.01.14
🆘 블로그 이전 안내

2023.01.14
🆘 블로그 이전 안내

2023.01.12
🫀심혈관질환 분석 09-패턴분석4

2023.01.11
🫀심혈관질환 분석 08-패턴분석3