久久久久久亚洲Av无码精品专口 ,亚洲色成人WWW永久在线观看,67194在线午夜亚洲

一個簡單blog備份工具的實現(xiàn)

為了備份blog，簡單寫了一個適用于blogjava等metaWeblog的blog備份工具，功能：

（1）備份post的正文到本地

（2）備份正文中的圖片、css文件到本地

（3）基于以上兩的步驟，修改相關(guān)的鏈接，實現(xiàn)本地脫機(jī)瀏覽

想到了但是未實現(xiàn)的功能：

（1）評論無法保存

（2）合適的話可以考慮以Eclipes RCP形式包裝

一、實現(xiàn)原理

（1）獲取post的方法：使用MetaWeblog提供的API接口

metaWeblog.getRecentPosts (blogid, username, password, numberOfPosts) returns array of structs,

Each struct represents a recent weblog post, containing the same information that a call to metaWeblog.getPost would return.

If numberOfPosts is 1, you get the most recent post. If it's 2 you also get the second most recent post, as the second array element. If numberOfPosts is greater than the number of posts in the weblog you get all the posts in the weblog.

(2) 使用正則表達(dá)式分析獲取下來的post，解析出post中包含的css和圖片文件的地址，執(zhí)行兩步操作

根據(jù)地址，抓取圖片保存到本地
修改post中的地址為本地保存地址

(3) 使用xml-rpc來簡化遠(yuǎn)程調(diào)用過程的編程

二、主要的代碼

public ArrayList<SimplePost> getAllPosts(String blogID, String name,String password, int num) throws XmlRpcException {
        ArrayList<SimplePost> posts = new ArrayList<SimplePost>();
        Object[] params = new Object[] { blogID, name, password,new Integer(num) };
        Object[] result = (Object[]) client.execute("metaWeblog.getRecentPosts", params);

        for (int i = 0; i < result.length; i++) {
            Map map = (Map) result[i];
            String postUrl = (String) map.get("link");
            String title = (String) map.get("title");
            String postId = (String) map.get("postid");

                        // post的內(nèi)容
            String description = (String) map.get("description");

            Map<String, String> images = new HashMap<String, String>();
            images = getImagesURL(description);

            String newDes = handleImagesURL(description,postId);
            String descriptioFileName = savePostContent(savePath, title,postId, newDes, css);

            SimplePost post = new SimplePost(postUrl, title, postId,descriptioFileName);
            //從postContent獲取圖像的地址和名稱，以便獲取圖片并保存
            post.setImages(images);
            posts.add(post);
            log.debug("postID: " + postId + "postTitle :" + title);
        }
        return posts;
    }

public static Map<String, String> getImagesURL(String description) {

       Map<String, String> map = new HashMap<String, String>();
        // img 的正則表達(dá)式
      String imgPattern = "<\\s*img\\s+([^>]+)\\s*>";
        Pattern p = Pattern.compile(imgPattern, Pattern.CASE_INSENSITIVE);
        Matcher matcher = p.matcher(description);

        // img src元素的正則表達(dá)式
        String srcPattern = "\\s*src\\s*=\\s*\"([^\"]+)\\s*\"";
       Pattern p2 = Pattern.compile(srcPattern, Pattern.CASE_INSENSITIVE);

        while (matcher.find()) {
            Matcher matcher2 = p2.matcher(matcher.group());
            // 一定要find(),這是實際的匹配動作
            if (matcher2.find()) {
                String src = matcher2.group();
                log.info(src);
                int i2 = src.lastIndexOf('/');
                int i1 = src.indexOf("http");
                if (i1 != -1) {
                    map.put(src.substring(i2 + 1, src.length() - 1), src
                            .substring(i1, src.length() - 1));
                }
            }
        }
        log.debug("圖片：" + map);
        return map;
    }

/**
     * 替換description的圖片鏈接為本地的相對鏈接，結(jié)構(gòu)為blogFiles/images/postid/
     *
     * @param description
     * @param userName
     * @param postId
     * @return
     */
    public static String handleImagesURL(String description, String postId) {
        String tmp = description;
        String address="images/" + postId + "/";

        String imgPattern = "<\\s*img\\s+([^>]+)\\s*>";
        Pattern p = Pattern.compile(imgPattern, Pattern.CASE_INSENSITIVE);
        Matcher matcher = p.matcher(tmp);

        // img src元素的正則表達(dá)式
        String srcPattern = "\\s*src\\s*=\\s*\"([^\"]+)\\s*\"";
        // String srcPattern = "\\s*src\\s*=\\s*\'([^\']+)\\s*\'";
        Pattern p2 = Pattern.compile(srcPattern, Pattern.CASE_INSENSITIVE);
        while (matcher.find()) {
            Matcher matcher2 = p2.matcher(matcher.group());
            // 一定要find(),這是實際的匹配動作
            if (matcher2.find()) {
                String src = matcher2.group();
                log.info(src);
                int l2=src.lastIndexOf('/')+1;
                log.info(src.substring(l2,src.length()-1));
                tmp=tmp.replace(src,"  src=\""+address+src.substring(l2,src.length()-1)+"\"");
            }
        }
        return tmp;
    }

發(fā)表于 2007-07-16 11:22 憑欄觀海閱讀(1283) 評論(8) 編輯收藏所屬分類: j2se

評論

# re: 一個簡單blog備份工具的實現(xiàn)

前排支持

交口稱贊評論于 2007-07-16 13:20 回復(fù) 更多評論

# re: 一個簡單blog備份工具的實現(xiàn)

支持一下讀取 MetaWeblog 的思路.
http://www.tkk7.com/beansoft/archive/2007/06/20/125255.html
BlogJava 備份文章閱讀器+離線瀏覽備份(含源碼,SWT)
里面已經(jīng)包含了保存 CSS, js, image 的 MHT 文件生成器的API, MHT 文件可以離線瀏覽(IE下). 不過我的所有文章列表都是從 BlogJava 備份文件那個大 XML 里面分析的. 不會 RCP, 交流一下思路先. 我用 HtmlParser 這個項目做的 HTML 解析, 比正則表達(dá)式準(zhǔn)確率高一些.

BeanSoft 評論于 2007-07-16 13:55 回復(fù) 更多評論

# re: 一個簡單blog備份工具的實現(xiàn)

@交口稱贊
多謝！^_^

憑欄觀海評論于 2007-07-16 14:15 回復(fù) 更多評論

# re: 一個簡單blog備份工具的實現(xiàn)

@BeanSoft
寫這個小東西的背景：
當(dāng)時剛剛在blogjava上安家，在blog上已經(jīng)放了點東西了，不過有一天在某個商業(yè)的blog服務(wù)提供商上的主頁上看到一段話，大致的意思是“大家難度都忘了幾年前，免費主頁空間的教訓(xùn)了嗎？”，于是試著google blog的備份工具，就找到了你提供的BlogJava 備份文章閱讀器+離線瀏覽備份，下載試用了下，感覺挺好的，美中不足的是，如果想要真正離線瀏覽，就必須使用mht來保存文章，這點我感覺有點受限了，因為firefox不能打開mht文件，（有時候還要到linux下面耍耍，mht在linux好像也沒有什么方式打開），所以我才決定寫一個東西將文件和圖片，css都下載到本地，同時修改鏈接來實現(xiàn)更加靈活的離線瀏覽。我沒有用過HtmlParser，學(xué)習(xí)學(xué)習(xí)去。。

憑欄觀海評論于 2007-07-16 14:30 回復(fù) 更多評論

# re: 一個簡單blog備份工具的實現(xiàn)

是呀, 我也這樣想, 萬一哪天掛了, 就沒了... 回頭有空的時候改改, 保存成 HTML 格式會好很多的.

BeanSoft 評論于 2007-07-16 15:35 回復(fù) 更多評論

# re: 一個簡單blog備份工具的實現(xiàn)

有點意思.

sitinspring 評論于 2007-07-17 13:00 回復(fù) 更多評論

# re: 一個簡單blog備份工具的實現(xiàn)

怎么沒有提供exe的下載，謝謝，能提供一個下載嗎？

蒙在天涯評論于 2007-07-26 23:24 回復(fù) 更多評論

# re: 一個簡單blog備份工具的實現(xiàn)

@蒙在天涯
如果有需要的話，可以考慮用RCP 封裝了提供出來，在http://www.tkk7.com/beansoft/archive/2007/06/20/125255.html有一個離線備份工具提供下載，原理上有點不一樣，但是目的是一樣的，現(xiàn)在也支持保存圖片等資源到本地了

憑欄觀海評論于 2007-07-27 08:54 回復(fù) 更多評論

新用戶注冊刷新評論列表


只有注冊用戶登錄后才能發(fā)表評論。




網(wǎng)站導(dǎo)航: 博客園 IT新聞 Chat2DB C++博客博問管理
相關(guān)文章: 關(guān)于Java String的intern and ==的筆試題，沒有全答對設(shè)計及設(shè)計模式：關(guān)于Java權(quán)限控制算法(轉(zhuǎn)） java基礎(chǔ)：byte與int 判斷resultset是否含有記錄 java的數(shù)據(jù)類型（備用） java語言中的bit 移位操作 JPA --Java EE 5.0 ORM 規(guī)范（轉(zhuǎn)）一個簡單blog備份工具的實現(xiàn) 正則表達(dá)式的總結(jié) 用HttpClient來模擬瀏覽器GET POST(收藏)

一個簡單blog備份工具的實現(xiàn)

一、實現(xiàn)原理

二、主要的代碼

隨筆分類(124)

隨筆檔案(185)

友情鏈接

工作流

常去的技術(shù)網(wǎng)站

鄰居

積分與排名

最新評論

閱讀排行榜

日出而作兮勤于外，日落而歸兮忙于內(nèi)
BlogJava \| 首頁 \| 發(fā)新隨筆 \| 發(fā)新文章 \| 聯(lián)系 \| 聚合 \| 管理