KRnet

INDEX

  • Àλ縻
  • Á¶Á÷Á¶Á÷
  • ÇÁ·Î±×·¥ÇÁ·Î±×·¥
  • µî·Ï/Çà»ç¾È³»µî·Ï/Çà»ç¾È³»
  • °Ô½ÃÆÇ °Ô½ÃÆÇ
  • Past KRnet
  • ¼¼ºÎÇÁ·Î±×·¥

    ¼¼ºÎÇÁ·Î±×·¥

     

    [C2: Deep Learning: Application] Deep Reinforcement Learning
    °ü¸®ÀÚ (krnet) ÀÛ¼ºÀÏ : 2018-05-09 14:22:12 Á¶È¸¼ö : 714
    ÄÚµå¹øÈ£ : 2
    ¹ßÇ¥ÀÚ : ¹ÚÁÖ¿µ
    ¼Ò¼Ó : °í·Á´ëÇб³
    ºÎ¼­ : Á¦¾î°èÃø°øÇаú
    Á÷À§ : ±³¼ö
    ¼¼¼Ç½Ã°£ :
    ¹ßÇ¥ÀÚ¾à·Â : 1993 - ÇöÀç : °í·Á´ëÇб³ Á¦¾î°èÃø°øÇаú ±³¼ö
    1992 : University of Texas at Austin Àü±â¹×ÄÄÇ»ÅÍ°øÇаú ¹Ú»ç
    1983 : ¼­¿ï´ëÇб³ Àü±â°øÇаú Çлç
    °­¿¬¿ä¾à : ±íÀº °­È­ÇнÀ(Deep Reinforcement Learning)Àº Çö´ë ÀΰøÁö´É ±â¼ú Áß °¡Àå È°¹ßÇÑ ¿¬±¸°¡
    ÀÌ·ç¾îÁö´Â ºÐ¾ß Áß Çϳª·Î¼­, È­ÇнÀ, Á¦¾îÀÌ·Ð ¹× µö·¯´× ±â¼úÀÌ °áÇÕµÇ¾î ½Ã³ÊÁö È¿°ú¸¦ °ÅµÎ¸ç
    ±Þ¼ÓÇÑ ¹ßÀüÀ» ÀÌ·ç°íÀÖ´Ù.
    º» °­Á¿¡¼­´Â ±íÀº °­È­ÇнÀ ±â¼úÀÇ °ú°Å¿Í ÇöÀ縦 ±¸¼ºÇÏ´Â ÁÖ¿ä ÁÖÁ¦ÀÎ Controlled Ito Process, Stochastic Optimal Control, Hamilton-Jacobi-Bellman Equation, Markov Decision Process, Model-based & Model-free Reinforcement Learning, Deep Learning, AlphaGo Zero µîÀÇ °³³äÀ» »ìÆ캸°í, ÀÌ¿Í°ü·ÃÇÑ ¹Ì·¡ ±â¼úÀÇ ¹æÇâ¿¡ ´ëÇØ »ý°¢Çغ»´Ù.
    ¿Â¶óÀÎÇà»çÀå :
    ¿Â¶óÀιßÇ¥Àå :
    .
    ¸ñ·Ïº¸±â

    TOP