MongoDB with millions of rows -


i've got mongodb 52m documents in it. can't queries ever finish. there no proper indexes, think might why. however, when try build index seemed failed well. left 2 hours , database still locked when came back. had restart service.

any ideas or things for? thought mongo should able query on millions of rows.

sample document

{     "_id" : objectid("54dfbcfe8b0de0cd148b4567"),     "status" : "success",     "articles" : [          {             "sequenceid" : "423671544757",             "id" : "20450468768",             "language" : "danish",             "title" : "se det spøjse billede: bendtner og co. tvunget midtergangen",             "content" : "de er millionærer med dyre biler og sikkert også råd til en flybillet ved nødudgangen, men da wolfsburg-spillerne skulle transporteres til lørdagens kamp mod leverkusen skete det med tog til köln. \n \ndet blev ikke en helt almindelig tur nicklas bendtner og holdkammeraterne, den togvogn, som wolfsburg havde bestilt sine pladsbilletter til, kom aldrig frem. stedet vogn 858 kom vogn 848, og så var der ingen reserverede pladser til de dyre fodboldben. \n \nderfor måtte blandt andre en af holdets største stjerner kevin de bruyne sætte sig mellemgangen toget, som du kan se på billedet fra den noget alternative tur wolfsburg-holdet. \n \nden anderledes togtur kom dog ikke til @ få den store betydning wolfsburg, der var masser af mål holdet, da de slog leverkusen 5-4 lørdag.",             "tags" : [],             "publisheddate" : "2015-02-14t21:17:00z",             "harvestdate" : "2015-02-14t21:22:45z",             "url" : "http://ct.moreover.com/?a=20450468768&p=20x&v=1&x=pmgzeu9pwnwzzrjkgeshea",             "outboundurls" : [],             "dataformat" : "text",             "loginstatus" : "",             "duplicategroupid" : "20450468768",             "media" : {                 "audio" : [],                 "images" : [                      {                         "url" : "http://b.bimg.dk/node-images/301/8/140x80/8301685-fbl-ger-bundesliga-leverkusen-wolfsburg.jpg"                     }                 ],                 "video" : []             },             "publishingplatform" : [],             "adultlanguage" : "false",             "topics" : [],             "companies" : [],             "locations" : [],             "semantics" : {                 "events" : [],                 "entities" : []             },             "author" : {                 "name" : "michel wikkelsø davidsen",                 "publishingplatform" : []             },             "licenses" : [],             "source" : {                 "name" : "b.t. online",                 "homeurl" : "http://www.bt.dk",                 "publisher" : "",                 "category" : "trade",                 "editorialrank" : "3",                 "location" : {                     "country" : "denmark",                     "countrycode" : "dk",                     "region" : "europe",                     "subregion" : "northern europe",                     "state" : "",                     "ziparea" : "",                     "zipcode" : ""                 },                 "feed" : {                     "id" : "169662017",                     "name" : "b.t. online",                     "mediatype" : "news",                     "tags" : [],                     "language" : "unassigned",                     "dataformat" : "text",                     "rank" : {                         "inboundlinkcount" : "8901"                     },                     "inwhitelist" : "true",                     "autotopics" : [],                     "editorialtopics" : [                          "miscellaneous"                     ],                     "genre" : "general"                 }             }         }     ] } 

indexing can take long time, specially if server not have enough computing power. slow / inadequate disk, inadequate ram, cpu can slow down index creation. if running on better server, fields choose index, can cause long runtime.

i avoid multi key indexes , try running indexing in background. since have not provided sample document, cannot specifics.


Comments

Popular posts from this blog

c# - Validate object ID from GET to POST -

node.js - Custom Model Validator SailsJS -

php - Find a regex to take part of Email -