当前位置：首页 > C++ > 正文

LZW压缩算法详解（C++语言实现LZW无损数据压缩与解压缩）

主机测评网
C++
2025-12-23
762

在计算机科学中，LZW压缩算法（Lempel-Ziv-Welch）是一种广泛使用的无损数据压缩技术。它由 Abraham Lempel、Jacob Ziv 和 Terry Welch 共同提出，因其高效性和简洁性被用于 GIF 图像格式、TIFF 文件等场景。本文将带你从零开始，用 C++语言实现LZW 的完整压缩与解压缩过程，即使是编程小白也能轻松理解！

什么是LZW压缩算法？

LZW 是一种基于字典的压缩方法。它通过构建一个动态字典（也叫“编码表”），将重复出现的字符串用较短的代码表示，从而减少数据体积。其核心思想是：遇到新字符串就加入字典，下次再出现时直接用编号代替。

LZW压缩算法详解（C++语言实现LZW无损数据压缩与解压缩） LZW压缩算法 C++实现LZW LZW编码解码无损数据压缩第1张

LZW压缩的基本步骤

初始化字典：将所有可能的单字符（如 ASCII 0-255）加入字典，编号为 0-255。
读取输入流，从第一个字符开始，尝试匹配当前最长的已在字典中的字符串。
当发现一个新字符串（即当前字符串 + 下一个字符不在字典中）时：

输出当前字符串对应的字典编号；
将这个新字符串加入字典（分配新编号）；
从下一个字符重新开始匹配。

重复直到输入结束，最后输出剩余字符串的编号。

C++实现LZW压缩

下面是一个完整的 C++ 实现，包含压缩和解压缩两个部分。我们使用 std::map 来模拟字典，使用 std::vector<int> 存储输出的编码序列。

#include <iostream>#include <map>#include <vector>#include <string>using namespace std;vector<int> lzw_compress(const string& input) {    map<string, int> dict;    int dict_size = 256;    // 初始化字典：所有单字符    for (int i = 0; i < 256; ++i) {        dict[string(1, char(i))] = i;    }    vector<int> result;    string w = "";    for (char c : input) {        string wc = w + c;        if (dict.find(wc) != dict.end()) {            w = wc;        } else {            result.push_back(dict[w]);            dict[wc] = dict_size++;            w = string(1, c);        }    }    if (!w.empty()) {        result.push_back(dict[w]);    }    return result;}

C++实现LZW解压缩

解压缩过程是压缩的逆操作。它不需要传输字典，因为解压端可以根据压缩码流重建相同的字典。

string lzw_decompress(const vector<int>& compressed) {    vector<string> dict(256);    int dict_size = 256;    // 初始化字典    for (int i = 0; i < 256; ++i) {        dict[i] = string(1, char(i));    }    string result = dict[compressed[0]];    string w = result;    for (size_t i = 1; i < compressed.size(); ++i) {        int k = compressed[i];        string entry;        if (k < dict_size) {            entry = dict[k];        } else if (k == dict_size) {            entry = w + w[0];        } else {            throw runtime_error("Invalid compressed data");        }        result += entry;        dict.push_back(w + entry[0]);        dict_size++;        w = entry;    }    return result;}

完整测试示例

int main() {    string text = "ABABABAABABABBABABABBBABABAAABABABCCC";    cout << "原始文本: " << text << endl;    // 压缩    vector<int> compressed = lzw_compress(text);    cout << "压缩结果: ";    for (int code : compressed) {        cout << code << " ";    }    cout << endl;    // 解压缩    string decompressed = lzw_decompress(compressed);    cout << "解压文本: " << decompressed << endl;    if (text == decompressed) {        cout << "✅ 压缩/解压成功！" << endl;    } else {        cout << "❌ 失败！" << endl;    }    return 0;}