四月的狂想

2023-04-21 约 2603 字预计阅读 6 分钟

不知不觉就已经是生命的第三十个年头了。也许是年纪渐增，感觉脑力逐渐变得大不如从前了。工作上也好，平时也好，集中精力思考一些问题之后，总会感觉

阅读全文

2022年的回望

2022-12-05 约 1313 字预计阅读 3 分钟

生活两年半前买的房子终于要在这周日交付了，随之而来的就是要准备验房、办证、装修之类的事情了，由于各种原因，到现在都还没确定装修公司，后面必须

阅读全文

Counterfactual Regret Minimization

2022-09-05 约 4072 字预计阅读 9 分钟

博弈论基础最近关注了下牌类游戏（主要是斗地主）的AI算法，初步调研之后，得知在德州扑克上，SOTA的方法依然是基于CFR这个框架的，比如De

阅读全文

关于工作与生活的感想

2022-05-31 约 2743 字预计阅读 6 分钟

工作快三年了，跳槽也很荒唐的有了两次，虽然中间那次工作只持续了一个星期。前段时间，又再次有了强烈的跳槽想法，主要是感觉工作既没有意义，实际上

阅读全文

RL: A Horrible Career (Part I)

2022-04-22 约 1166 字预计阅读 6 分钟

RL(Reinforcement Learning), originated from control theories, aims to solve decision problems with machine intelligence. Its creation can be dated back to the middle of the 20th century. Since the dramatic prosperity of deep learning, it has been equipped with deep neural networks and has shown its power and potential to solve many real-world decision problems. The amount of research papers related with RL is growing rapidly in recent years, as fancy ideas and algorithms seem to emerge constantly.

阅读全文