Finisky Garden

NLP中的Adapters是什么？

发表于 2022-05-01 更新于 2022-08-29 分类于 Machine Learning 评论：阅读次数：

NLP adapters主要想解决不同任务需要finetune整个模型的痛点，与Prompting一样，是一种轻量级的训练方法，也是Transfer Learning的应用。按出现时间来看，finetune早于adapters，adapters早于prompting。今天来重看这篇Adapters的文章，可以更好地理解lightweight finetune的发展过程。

Adapters在NLP中的应用源于这篇 ICML2019 文章：Parameter-Efficient Transfer Learning for NLP adapters 。

Adapter-based tuning requires training two orders of magnitude fewer parameters to fine-tuning, while attaining similar performance.

阅读全文 »

从私有代码库自动部署Hugo站到GitHub Pages

发表于 2022-04-26 更新于 2022-08-29 分类于 Hugo 评论：阅读次数：

Hugo是个极好的静态网站生成器。一个常见的情况是原始的网站源码放在私有代码库中，但希望自动化构建和部署的功能。假设私有代码库托管在github上，希望能自动化部署到GitHub Pages，这个功能可以通过github actions轻松搞定。

开始之前，假设我们已有了两个repo(可以隶属于不同的github账户):

私有库: 存放网站的源码
目标库: GitHub Pages (xxx.github.io)

阅读全文 »

Deploy Hugo From Private Repository to GitHub Pages

发表于 2022-04-26 更新于 2023-01-24 分类于 Hugo 评论：阅读次数：

Hugo is a nice static site generator. A common scenario is to store your website source code in a private repository and serve it on GitHub Pages. Can we leverage the github actions to automatically build and deploy the site from the private repository to GitHub Pages? The answer is absolutely yes!

Before we start, you need to have two repos (can belong to different github accounts):

Source Repo: the repo to store
Target Repo: host the GitHub Pages (xxx.github.io)

阅读全文 »

Why Neural Networks Can Represent Lanugage Models

发表于 2022-04-07 更新于 2023-01-24 分类于 Machine Learning 评论：阅读次数：

My first understanding of a language model is originated from n-gram. When I know RNNLM, I have a question: why a neural network can represent a language model?

After some research, I found the answer. Essentially, language model is a probability distribution:

A statistical language model is a probability distribution over sequences of words.

阅读全文 »

神经网络为什么可以表示语言模型

发表于 2022-04-07 分类于 Machine Learning 评论：阅读次数：

最初对语言模型的理解源于n-gram语言模型，但后来出现了RNNLM等一众神经网络语言模型，就有了这个疑问：神经网络为什么可以表示语言模型？

首先，语言模型本质上是概率分布：

A statistical language model is a probability distribution over sequences of words.

阅读全文 »

Scaling Laws for Neural Language Models简读

发表于 2022-01-05 更新于 2022-08-29 分类于 Machine Learning 评论：阅读次数：

# Scaling Laws for Neural Language Models

一篇实验Paper，调研了神经网络语言模型交叉熵损失变化满足power-law定律，挺有意思的文章。Transformer之后有许多探索不同模型结构的文章，并在一些任务上取得了新的SOTA，却鲜有人考虑影响模型性能的主要因素是什么。

Throughout we will observe precise power-law scalings for performance as a function of training time, context length, dataset size, model size, and compute budget.

阅读全文 »

C++ priority_queue Example

发表于 2022-01-05 更新于 2023-01-24 分类于 Coding Interview 评论：阅读次数：

LeetCode Top K Frequent Words need to use STL priority_queue to solve the problem.

priority_queue definition:

template<
    class T,
    class Container = std::vector<T>,
    class Compare = std::less<typename Container::value_type>
> class priority_queue;

There are two ways to implement the priority_queue compare function of customized type.

阅读全文 »

C++ priority_queue使用示例

发表于 2022-01-05 更新于 2022-05-03 分类于 Coding Interview 评论：阅读次数：

LeetCode Top K Frequent Words 中会用到priority_queue，同时需要定义priority_queue的排序算法。

priority_queue的定义:

template<
    class T,
    class Container = std::vector<T>,
    class Compare = std::less<typename Container::value_type>
> class priority_queue;

有两种方式实现自定义类型priority_queue的比较函数。

阅读全文 »

Speed Up Github Image Hosting By Jsdelivr CDN

发表于 2021-12-20 更新于 2023-01-24 分类于 Hexo 评论：阅读次数：

This blog uses github as image hosting service after migration. However, sometimes the image loading speed is slow. After some investigation, we found that we can easily improve the image loading speed by jsdelivr CDN.

阅读全文 »

CDN jsdelivr加速github图床

发表于 2021-12-19 更新于 2021-12-29 分类于 Hexo 评论：阅读次数：

在迁移博客之后，就切换了图床，使用github作免费的图床。但最近发现它不太稳定，常常打不开。研究发现可以用jsdelivr作github的CDN加速，只需替换下图片地址即可。这才是github图床正确的打开方式 :-)

阅读全文 »