ipfs-search/ipfs-search

View on GitHub
docs/towards_dist_search/Distributed_search.md

Summary

Maintainability
Test Coverage

Unexpected trailing spaces found.
Open

-   This local index is put on an (IPFS) cluster. 

Unexpected trailing spaces found.
Open

    

Unexpected trailing spaces found.
Open

-   Tokenise the query (split into words)    

Unexpected trailing spaces found.
Open

    -   Number of downloaded bytes per second versus the number of peers        

Unexpected trailing spaces found.
Open

    -   Bring the downed node back online. The data that belongs on this node (hopefully) begins to flow back into the node.        

Unexpected trailing spaces found.
Open

-   [IPFS Cluster Documentation](https://cluster.ipfs.io/documentation/ "https://cluster.ipfs.io/documentation/") 

Unexpected trailing spaces found.
Open

-   Sanitize the query (stemming all the words, making all letters lowercase, removing punctuation)    

Unexpected tabs found.
Open

        return [filename for filename in invertedIndex[word].keys()]

Unexpected tabs found.
Open

        listOfLists.append(one_word_query(word))

Unexpected tabs found.
Open

    setted = set(listOfLists[0]).intersection(*listOfLists)

Unexpected tabs found.
Open

    for filename in setted:

Unexpected trailing spaces found.
Open

-   Node failure    

Unexpected trailing spaces found.
Open

-   No local discovery.    

Unexpected trailing spaces found.
Open

-   [Tika Supported Document Formats](https://tika.apache.org/1.4/formats.html "https://tika.apache.org/1.4/formats.html")    

Unexpected trailing spaces found.
Open

-   IPFS does not guarantee redundancy. We can use IPFS clustering.    

Unexpected tabs found.
Open

    string = pattern.sub(' ',string)

Unexpected tabs found.
Open

    for word in string.split():

Unexpected tabs found.
Open

        temp = []

Unexpected trailing spaces found.
Open

    -   Number of hashes crawled per second per-peer versus the number of peers        

Unexpected trailing spaces found.
Open

    walk through the directory structure to fetch each file_path and 

Unexpected trailing spaces found.
Open

        

Unexpected tabs found.
Open

    if word in invertedIndex.keys():

Unexpected tabs found.
Open

    string = pattern.sub(' ',string)

Unexpected trailing spaces found.
Open

    -   After a while, pull the data from the agent to check that data that was sent to its neighbours when it was down is stored correctly.        

Expected a newline at the end of the file.
Open

### [Source](https://niverel.tymyrddin.space/en/play/stones/upsidedown/testing)

Unexpected trailing spaces found.
Open

-   Provider nodes that wish to participate, parse and index only the files they have added to a dweb (DHT hashes) and that have world file permissions. 

Unexpected trailing spaces found.
Open

    

Unexpected trailing spaces found.
Open

    

Unexpected tabs found.
Open

    pattern = re.compile('[\W_]+')

Unexpected trailing spaces found.
Open

-   Scalability indicators    

Unexpected trailing spaces found.
Open

    -   Take an agent offline (check that it has gone down and is inaccessible) and verify that all the data appears to be working.        

Unexpected tabs found.
Open

        for i in range(len(temp)):

Unexpected additional newlines at the end of the file.
Open

### [Source](https://niverel.tymyrddin.space/en/play/stones/upsidedown/testing)

Unexpected tabs found.
Open

        return []

Unexpected tabs found.
Open

    result = []

Unexpected trailing spaces found.
Open

-   Performance indicators    

Unexpected tabs found.
Open

        result += one_word_query(word)

Unexpected tabs found.
Open

            for ind in range(len(temp[i])):

Unexpected tabs found.
Open

        for word in string.split():

Unexpected tabs found.
Open

        if set(temp[0]).intersection(*temp):

Unexpected trailing spaces found.
Open

    -   If automated, this may require adding data entry points in the API that are only used for testing.        

Unexpected trailing spaces found.
Open

-   Only popular indexes will be able to get a decent speed.    

Unexpected tabs found.
Open

    pattern = re.compile('[\W_]+')

Unexpected tabs found.
Open

    return list(set(result))

Unexpected tabs found.
Open

    for word in string.split():

Unexpected trailing spaces found.
Open

    -   We can run a few web agent type ipfs nodes in a cluster that pin all the indexes. Give these enough bandwidth and we have some basis nodes that can act as mirrors and can also be served via HTTPS (the internet-facing demo version).        

Unexpected tabs found.
Open

            result.append(filename)

Unexpected trailing spaces found.
Open

-   No transports other than, for example, via Tor (an overlay network consisting of more than seven thousand relays to conceal a user's location and usage from anyone conducting network surveillance or traffic analysis). 

Unexpected tabs found.
Open

    else:

Unexpected tabs found.
Open

            temp.append(invertedIndex[word][filename][:])

Unexpected trailing spaces found.
Open

    -   Number of hashes crawled per second versus different CPU loads/platforms        

Unexpected trailing spaces found.
Open

-   Predictive analysis 

Unexpected tabs found.
Open

    return rankResults(result, string)

Unexpected trailing spaces found.
Open

        

Unexpected trailing spaces found.
Open

    

Unexpected tabs found.
Open

                temp[i][ind] -= i

Unexpected trailing spaces found.
Open

-   [Yggdrasil Version 0.3.6](https://yggdrasil-network.github.io/2019/08/03/release-v0-3-6.html "https://yggdrasil-network.github.io/2019/08/03/release-v0-3-6.html"), august 2019, first version with API    

Unexpected trailing spaces found.
Open

    -   Throughput of a peer versus the number of crawled job queues (to determine the optimal number of crawl job queues) per platform (differentiate using agent attributes).        

Unexpected tabs found.
Open

    word = pattern.sub(' ',word)

Unexpected tabs found.
Open

    pattern = re.compile('[\W_]+')

Unexpected tabs found.
Open

    listOfLists, result = [],[]

There are no issues that match your filters.

Category
Status